【Python學習筆記六】獲取百度搜索結果以及百度返回“百度安全驗證”問題解決

本文轉載自查看原文 2020-06-18 01:36 2141 python學習

1.獲取百度搜索結果頁面主要是修改百度搜索url中的參數實現，例如查詢的關鍵字為wd；

舉例：https://www.baidu.com/s?wd=python"，這樣就可以查詢到‘python’相關的內容

具體的參數屆時可以參考：https://blog.csdn.net/ZustKe/article/details/83882345

2.通過python獲取百度內容時，會出現返回的頁面內容是“百度安全驗證”的情況，像下面這樣

這是因為設置header是沒有設置accept參數，設置后就OK了。

慣例附代碼:

import urllib.request

headers = {
    'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/83.0.4103.97 Safari/537.36 Edg/83.0.478.50',
    'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.9'
}
url = "https://www.baidu.com/s?wd=python"

req = urllib.request.Request(url=url, headers=headers)
html = urllib.request.urlopen(req).read().decode('UTF-8', 'ignore')
print(html)

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 Python抓取百度搜索結果百度搜索語法百度搜索語法大全百度搜索結果爬蟲百度搜索的使用技巧怎樣在百度搜索到自己的博客百度搜索技巧百度搜索常用技巧百度搜索常用api 百度搜索的高級用法