Previous Lecture | Lecture 13 | Next Lecture |
Lecture 13, Tue 05/16
Mergesort
Recorded Lecture: 5_16_23
Divide and Conquer Algorithms
- Subdivide a larger problem into smaller problems
- Solve each smaller part
- Combine solutions of smaller sub problems back into the larger problem
- We’ve seen this pattern with recursion where the problem can be subdivided
- So far, we’ve talked about bubble sort, selection sort, and insertion sort
- These algorithms generally run in O(n2)
- But better sorting algorithms exist!
- We can improve our run time to O(n log n)
Merge Sort
- Idea:
- Break a list into sub sublists where size == 1
- A sublist with 1 element is considered sorted
- Merge each small sublist together to form sorted larger list
- Continue to merge sublists back into the original list
Hand drawn example
Merge Sort Implementation
def mergeSort(alist):
if len(alist) > 1:
mid = len(alist) // 2
# uses additional space to create the left / right
# halves
lefthalf = alist[:mid]
righthalf = alist[mid:]
# recursively sort the lefthalf, then righthalf
mergeSort(lefthalf)
mergeSort(righthalf)
# merge two sorted sublists (left / right halves)
# into the original list (alist)
i = 0 # index for lefthalf sublist
j = 0 # index for righthalf sublist
k = 0 # index for alist
while i < len(lefthalf) and j < len(righthalf):
if lefthalf[i] <= righthalf[j]:
alist[k] = lefthalf[i]
i = i + 1
else:
alist[k] = righthalf[j]
j = j + 1
k = k + 1
# put the remaining lefthalf elements (if any) into
# alist
while i < len(lefthalf):
alist[k] = lefthalf[i]
i = i + 1
k = k + 1
# put the remaining righthalf elements (if any) into
# alist
while j < len(righthalf):
alist[k] = righthalf[j]
j = j + 1
k = k + 1
# pytest
def test_mergeSort():
numList1 = [9,8,7,6,5,4,3,2,1]
numList2 = [1,2,3,4,5,6,7,8,9]
numList3 = []
numList4 = [1,9,2,8,3,7,4,6,5]
numList5 = [5,4,6,3,7,2,8,1,9]
mergeSort(numList1)
mergeSort(numList2)
mergeSort(numList3)
mergeSort(numList4)
mergeSort(numList5)
assert numList1 == [1,2,3,4,5,6,7,8,9]
assert numList2 == [1,2,3,4,5,6,7,8,9]
assert numList3 == []
assert numList4 == [1,2,3,4,5,6,7,8,9]
assert numList5 == [1,2,3,4,5,6,7,8,9]
Merge Sort Analysis
- Merge Sort breaks the list into two equal parts per recursive iteration
- Note that the height (number of levels) of the tree is log n (for 8 nodes, we have a height of 3).
- For each level, the merge step needs to compare
n
elements- So if there are log n levels and we need to do
n
comparisons to merge the elements for each level, then we have: O(n log n) running time
- So if there are log n levels and we need to do
- One disadvantage of Merge Sort is it requires O(n) additional space when creating the left and right sublists
- Can be problematic if the list we’re trying to sort is extremely large