295-find-median-from-data-stream¶
Try it on leetcode
Description¶
The median is the middle value in an ordered integer list. If the size of the list is even, there is no middle value and the median is the mean of the two middle values.
- For example, for
arr = [2,3,4]
, the median is3
. - For example, for
arr = [2,3]
, the median is(2 + 3) / 2 = 2.5
.
Implement the MedianFinder class:
MedianFinder()
initializes theMedianFinder
object.void addNum(int num)
adds the integernum
from the data stream to the data structure.double findMedian()
returns the median of all elements so far. Answers within10-5
of the actual answer will be accepted.
Example 1:
Input ["MedianFinder", "addNum", "addNum", "findMedian", "addNum", "findMedian"] [[], [1], [2], [], [3], []] Output [null, null, null, 1.5, null, 2.0] Explanation MedianFinder medianFinder = new MedianFinder(); medianFinder.addNum(1); // arr = [1] medianFinder.addNum(2); // arr = [1, 2] medianFinder.findMedian(); // return 1.5 (i.e., (1 + 2) / 2) medianFinder.addNum(3); // arr[1, 2, 3] medianFinder.findMedian(); // return 2.0
Constraints:
-105 <= num <= 105
- There will be at least one element in the data structure before calling
findMedian
. - At most
5 * 104
calls will be made toaddNum
andfindMedian
.
Follow up:
- If all integer numbers from the stream are in the range
[0, 100]
, how would you optimize your solution? - If
99%
of all integer numbers from the stream are in the range[0, 100]
, how would you optimize your solution?
Solution(Python)¶
from heapq import heappush, heappop, heapify
class MedianFinder:
def __init__(self):
self.minHeap = []
self.maxHeap = []
def addNum(self, num: int) -> None:
if len(self.minHeap) != len(self.maxHeap):
heappush(self.minHeap, -heappushpop(self.maxHeap, -num))
else:
heappush(self.maxHeap, -heappushpop(self.minHeap, num))
def findMedian(self) -> float:
if len(self.minHeap) != len(self.maxHeap):
return -self.maxHeap[0]
else:
return (self.minHeap[0] - self.maxHeap[0]) / 2
# Your MedianFinder object will be instantiated and called as such:
# obj = MedianFinder()
# obj.addNum(num)
# param_2 = obj.findMedian()
#
#
# bruteforce:
# we need to maintain the sorted list of n integers
# in addnum we do insertion sort O(nlog n)
# find median is O(1)
#
# Augumented self-balanced binary search tree
# every node of BST have numbers of arr
# left side or numbers less than current nide right side for numbers greater than node
# only problem is bst gives us sorted data which is extra info not needed
#
# Heaps:
# since the problem needs an online algorithm and median is the essential info
# we can use max heap on the left side to represent values less than effective mean
# right heap to maintain values greater than effective mea
# after incoming input when bot heaps have same number of values take averahe=ge of heap root
# when they are not balanced we take take effective mean from largest heap
# Time Complexity: O(logn) for insertion and O(1) for get median