Python序列刪除重復數據


## 對於列表來說,若不保持原有順序,可以直接轉換為set刪除重復數據

1 nums = [1,2,32,2,2,4,3,2,3,42]
2 nums = list(set(nums))
3 print(nums)
4 # [32, 1, 2, 3, 4, 42]  # 刪除了重復數據,但是原有順序也改變了

 

## 刪除數據並保持原有順序

 1 def dedupe(items, key=None):
 2     """
 3     items: 哈希或者不可哈希的序列
 4     key: 若items為不可哈希的序列(dict等)則需要指定一個函數
 5     """
 6     seen = set()
 7     for item in items:
 8         val = item if key is None else key(item)
 9         if val not in seen:
10             yield item
11             seen.add(val)
12 
13 nums = [1,2,32,2,2,4,3,2,3,42]
14 print(list(dedupe(nums)))
15 # [1, 2, 32, 4, 3, 42]
16 
17 students = [
18     {"name": "Stanley", "score": 88},
19     {"name": "Lily", "score": 92},
20     {"name": "Bob", "score": 91},
21     {"name": "Well", "score": 80},
22     {"name": "Bob", "score": 90},
23     {"name": "Peter", "score": 80}
24 ]
25 deduped_students = list(dedupe(students, key=lambda s: s['name']))
26 print(deduped_students)
27 """
28 [{'name': 'Stanley', 'score': 88},
29 {'name': 'Lily', 'score': 92},
30 {'name': 'Bob', 'score': 91},
31 {'name': 'Well', 'score': 80},
32 {'name': 'Peter', 'score': 80}]   # 刪除了相同姓名的元素
33 """
34 # 刪除姓名和分數都相同的元素
35 deduped_students = list(dedupe(students, key=lambda s: (s['name'], s['score'])))

 

參考資料:
  Python Cookbook, 3rd edition, by David Beazley and Brian K. Jones (O’Reilly).


免責聲明!

本站轉載的文章為個人學習借鑒使用,本站對版權不負任何法律責任。如果侵犯了您的隱私權益,請聯系本站郵箱yoyou2525@163.com刪除。



 
粵ICP備18138465號   © 2018-2025 CODEPRJ.COM