#問題
二分查找
list.index()無法應對大規模數據的查詢,需要用其它方法解決,這里談的就是二分查找
#思路說明
在查找方面,python中有list.index()的方法。例如:
>>> a=[2,4,1,9,3] #list可以是無序,也可以是有序
>>> a.index(4) #找到后返回該值在list中的位置
1
>>> a.index(5) #如果沒有該值,則報錯
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
ValueError: 5 is not in list
這是python中基本的查找方法,雖然簡單,但是,如果由於其時間復雜度為O(n),對於大規模的查詢恐怕是不足以勝任的。二分查找就是一種替代方法。
二分查找的對象是:有序數組。這點特別需要注意。要把數組排好序先。怎么排序,可以參看我這里多篇排序問題的文章。
基本步驟:
1. 從數組的中間元素開始,如果中間元素正好是要查找的元素,則搜素過程結束;
2. 如果某一特定元素大於或者小於中間元素,則在數組大於或小於中間元素的那一半中查找,而且跟開始一樣從中間元素開始比較。
3. 如果在某一步驟數組為空,則代表找不到。
這種搜索算法每一次比較都使搜索范圍縮小一半。時間復雜度:O(logn)
# 遞歸 def binary_search(lst, value, lo, hi): if lo > hi: return -1 half = (lo + hi)/2 if lst[half] == value: return half elif lst[half] > value: return binary_search(lst, value, lo, half-1) else: return binary_search(lst, value, half+1, hi) # 循環 def binary_search_while(lst, value): lo, hi = 0, len(lst)-1 while lo <= hi: half = (lo + hi)/2 if lst[half] > value: hi = half - 1 elif lst[half] < value: lo = half + 1 else: return half return -1
對於python,不能忽視其強大的標准庫。經查閱,發現標准庫中就有一個模塊,名為:bisect。
- 模塊接受排序后的列表。
- 本模塊同樣適用於長列表項。因為它就是用二分查找方法實現的,有興趣可以看其源碼(源碼是一個很好的二分查找算法的例子,特別是很好地解決了邊界條件極端的問題.)
- 關於bisect模塊的更多內容,可以參看[官方文檔](https://docs.python.org/2/library/bisect.html)
"""Bisection algorithms.""" def insort_right(a, x, lo=0, hi=None): """Insert item x in list a, and keep it sorted assuming a is sorted. If x is already in a, insert it to the right of the rightmost x. Optional args lo (default 0) and hi (default len(a)) bound the slice of a to be searched. """ if lo < 0: raise ValueError('lo must be non-negative') if hi is None: hi = len(a) while lo < hi: mid = (lo+hi)//2 if x < a[mid]: hi = mid else: lo = mid+1 a.insert(lo, x) insort = insort_right # backward compatibility def bisect_right(a, x, lo=0, hi=None): """Return the index where to insert item x in list a, assuming a is sorted. The return value i is such that all e in a[:i] have e <= x, and all e in a[i:] have e > x. So if x already appears in the list, a.insert(x) will insert just after the rightmost x already there. Optional args lo (default 0) and hi (default len(a)) bound the slice of a to be searched. """ if lo < 0: raise ValueError('lo must be non-negative') if hi is None: hi = len(a) while lo < hi: mid = (lo+hi)//2 if x < a[mid]: hi = mid else: lo = mid+1 return lo bisect = bisect_right # backward compatibility def insort_left(a, x, lo=0, hi=None): """Insert item x in list a, and keep it sorted assuming a is sorted. If x is already in a, insert it to the left of the leftmost x. Optional args lo (default 0) and hi (default len(a)) bound the slice of a to be searched. """ if lo < 0: raise ValueError('lo must be non-negative') if hi is None: hi = len(a) while lo < hi: mid = (lo+hi)//2 if a[mid] < x: lo = mid+1 else: hi = mid a.insert(lo, x) def bisect_left(a, x, lo=0, hi=None): """Return the index where to insert item x in list a, assuming a is sorted. The return value i is such that all e in a[:i] have e < x, and all e in a[i:] have e >= x. So if x already appears in the list, a.insert(x) will insert just before the leftmost x already there. Optional args lo (default 0) and hi (default len(a)) bound the slice of a to be searched. """ if lo < 0: raise ValueError('lo must be non-negative') if hi is None: hi = len(a) while lo < hi: mid = (lo+hi)//2 if a[mid] < x: lo = mid+1 else: hi = mid return lo # Overwrite above definitions with a fast C implementation try: from _bisect import * except ImportError: pass
參考資料:https://github.com/qiwsir/algorithm