Day 41: start with a copy of day 37 material

thatguyintech · thatguyintech · commit 82cc632869e9 · 2017-05-21T01:02:31.000-07:00
diff --git a/day41/README.md b/day41/README.md
@@ -0,0 +1,43 @@
+[Today's challenge is actually a follow-up on day37's challenge -- using a heap instead of radix sort]
+
+Question of the day: https://leetcode.com/problems/top-k-frequent-elements/#/description
+
+Given a non-empty array of integers, return the k most frequent elements.
+
+For example,  
+Given `[1,1,1,2,2,3]` and `k = 2`, return `[1,2]`.
+
+Note:   
+* You may assume k is always valid, 1 ≤ k ≤ number of unique elements.  
+* Your algorithm's time complexity must be better than O(n log n), where
+  n is the array's size.
+
+## Ideas
+
+Can't do a normal sort, since that alone will take `O(nlogn)` runtime.
+The input array isn't sorted, so we need to keep track of a count and organize
+that count somehow as we iterate through the integers in the array. There
+doesn't seem to be any constraints on the types of integers on the array,
+so I'll assume that possible elements in the array range from -maxInt to maxInt
+. 
+
+I think I can actually use radix sort again. Same idea as the challenge from
+[Day 36](../day36).
+
+## Code
+[Day 37 - Python](../day37/topKFrequent.py)
+
+## Follow-up
+
+Over the past few days ([38](../day38), [39](../day39), [40](../day40)), I got a
+little more familiar with the heap data structure and finally understand why
+heapifying an unsorted array can be done in linear time. The operations involved
+in heapify decrease exponentially over a logarithmic range, resulting in an overall
+linear amount of work. Anyways, I can use this `O(n)` time to heapify an unsorted
+array of frequencies, and then pop off the top `k` frequencies in `O(klogn)` time.
+The overall runtime would now be at most `O(nlogn)` if `k` == `n`. However, the
+real savings is in the `O(n)` space for storing all the elements of the heap.
+Much better than the `O(max value of the input array)` I had before.
+
+## Code
+[Day 41 - Python](./topKFrequen.py)
diff --git a/day41/topKFrequent.py b/day41/topKFrequent.py
@@ -0,0 +1,21 @@
+def topKFrequent(nums, k):
+
+    return ret
+
+def testTopKFrequent():
+    assert set(topKFrequent([], 0)) == set([])
+    assert set(topKFrequent([1], 1)) == set([1])
+    assert set(topKFrequent([-1, -1], 1)) == set([-1])
+    assert set(topKFrequent([1,1,1,2,2,3], 2)) == set([1, 2])
+    assert set(topKFrequent([-1,-1,-1,2,2,3], 2)) == set([-1, 2])
+    assert set(topKFrequent([1,1,1,2,2,3], 3)) == set([1, 2, 3])
+    assert set(topKFrequent([1,1,1,2,2,2,3,3,3], 3)) == set([1, 2, 3])
+    assert set(topKFrequent([1, 2, 3, 4, 5], 1)) == set([1, 2, 3, 4, 5])
+    assert set(topKFrequent([4,1,-1,2,-1,2,3], 2)) == set([-1, 2])
+    assert set(topKFrequent([3,2,3,1,2,4,5,5,6,7,7,8,2,3,1,1,1,10,11,5,6,2,4,7,8,5,6], 10)) == set([1,2,5,3,7,6,4,8,10,11])
+
+def main():
+    testTopKFrequent()
+
+if __name__ == "__main__":
+    main()