Previous Lecture | Lecture 7 | Next Lecture |
Lecture 7, Tue 04/25
Recursion, Python Lists vs. Dictionaries
Partially Recorded Lecture: 4_25_23
- Unfortunately, there was only a partial recording of today’s lecture (but lecture notes are below).
- I will always try to provide recordings, but WIFI was down and I could not capture the recording.
- Note that recordings are a luxury (not a requirement) for this course, so attendance is always encouraged!
Recursion
- Recursion is when a function contains a call to itself
- Recursive solutions are useful when the result is dependent on the result of sub-parts of the problem
The three laws of recursion
- A recursive algorithm must have a base case
- A recursive algorithm must change its state and move toward the base case
- A recursive algorithm must call itself, recursively
Common Example: Factorial
- N! = N * (N-1) * (N-2) * … * 3 * 2 * 1
- N! = N * (N-1)!
- We can solve N! by solving the (N-1)! sub-part
- Note: 0! == 1
def factorial(n):
if n == 0: # base case
return 1
return n * factorial(n-1)
Function Calls and the Stack
- The
Stack
data structure has the First in, Last out (FILO) property - We can only access the elements at the top of the stack
- Analogy: Canister of tennis balls
- In order to remove the bottom ball, we have to remove all the balls on top
- Analogy: Canister of tennis balls
- We won’t go through an implementation yet, but we can conceptualize this on a high-level
- Function calls utilize a stack (“call stack”) and organize how functions are called and how expressions that call functions wait for the functions’ return values
- When a function is called, you can think of that function state being added (or “pushed”) on the stack
- When a function finishes execution, it gets removed (or “popped”) from the stack
- The top of the stack is the function that is currently running
- Example
def double(n):
return 2 * n
def triple(n):
return n + double(n)
print(triple(5))
- Going back to our recursive factorial example, let’s trace
factorial(3)
Common Example: Fibonnaci Sequence
- A fibonacci sequence is the nth number in the sequence is the sum of the previous two (i.e. f(n) = f(n-1) + f(n-2)).
- f(0) = 0, f(1) = 1, f(2) = 1, f(3) = 2, f(4) = 3, f(5) = 5, f(6) = 8, …
def fibonnaci(n):
if n == 1: # base cases
return 1
if n == 0:
return 0
return fibonnaci(n-1) + fibonnaci(n-2)
- Evaluate
fibonnaci(4)
by hand
fib(3) + fib(2)
fib(2) + fib(1) + fib(2)
fib(1) + fib(0) + fib(1) + fib(2)
1 + fib(0) + fib(1) + fib(2)
1 + 0 + fib(1) + fib(2)
1 + 0 + 1 + fib(2)
1 + 0 + 1 + fib(1) + fib(0)
1 + 0 + 1 + 1 + fib(0)
1 + 0 + 1 + 1 + 0
3
Python Lists vs. Python Dictionaries
- Let’s observe the performance between lists and dictionaries with an example.
- The following program counts the number of words in a file using a list and dictionary
- They do the same thing, but the performance is vastly different…
wordlist.txt
: File containing a collection of words, one per line.PeterPan.txt
: You can download classic novels from the Gutenberg Project as a .txt file!
# Set up our data structures
DICT = {}
infile = open("wordlist.txt", 'r')
for x in infile: # x goes through each line in the file
DICT[x.strip()] = 0 # Creates an entry in DICT with key x.strip() and value 0
print(len(DICT))
infile.close() # close the file after we’re done with it.
WORDLIST = []
for y in DICT: # put the DICT keys into WORDLIST
WORDLIST.append(y)
print(len(WORDLIST))
# Algorithm 1 - Lists
from time import time
start = time()
infile = open("PeterPan.txt", 'r')
largeText = infile.read()
infile.close()
counter = 0
words = largeText.split()
for x in words:
x = x.strip("\"\'()[]{},.?<>:;-").lower()
if x in WORDLIST:
counter += 1
end = time()
print(counter)
print("Time elapsed with WORDLIST (in seconds):", end - start)
# Algorithm 2 - Dictionaries
start = time()
infile = open("PeterPan.txt", 'r')
largeText = infile.read()
infile.close()
words = largeText.split()
counter = 0
for x in words:
x = x.strip("\"\'()[]{},.?<>:;-").lower()
if x in DICT: # Searching through the DICT
counter += 1
end = time()
print(counter)
print("Time elapsed with DICT (in seconds):", end - start)
- Python lists are efficient if we know the index of the item we’re looking for
- The reason why adding to the front of the list is costly is because lists have to “make room” for the element to be at index 0
- All existing elements of the list need to shift one index up in order for the inserted element to be placed at index 0
- Adding to the end of the list is not nearly as costly because no shifting of existing elements needs to occur
- The reason why adding to the front of the list is costly is because lists have to “make room” for the element to be at index 0
-
For this example, since we are looking for the value in the list (without knowing the index), we are checking through the entire WORDLIST for every word in
PeterPan.txt
- Python dictionaries are efficient when looking up a key value
- Dictionary values are actually stored in an underlying list
- Keys are converted into a numerical value, which is passed into a hash function
- The purpose of the hash function is to output the index for the underlying list based on the key value
- We do not have to scan the underlying list structure since a key will always be placed into a specific location in the the underlying list
- We won’t go into the implementation now, but we’ll revist this in more detail later
- There are MANY ways to solve a problem in programming, but understanding how certain tools work and making the best decisions is important!