Python:爬取網頁圖片並保存至本地
python3爬取網頁中的圖片到本地的過程如下:
1、爬取網頁
2、獲取圖片地址
3、爬取圖片內容並保存到本地
實例:爬取百度貼吧首頁圖片。
代碼如下:
import urllib.request import re url = 'https://tieba.baidu.com/' request = urllib.request.Request(url) request.add_header('User-Agent','Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/63.0.3239.132 Safari/537.36 QIHU 360EE') response = urllib.request.urlopen(request) buf = response.read() buf = str(buf, encoding='utf-8') # 獲取所有圖片url地址列表 listurl = re.findall(r'http.+\.jpg', buf) print(listurl) i = 1 for url in listurl: with open(str(i)+'.jpg','wb+') as f: req = urllib.request.urlopen(url) buf = req.read() f.write(buf) i += 1
參考資料:
https://blog.csdn.net/qq_37811638/article/details/84586200