Python 爬蟲：煎蛋網妹子圖

本文轉載自查看原文 2018-03-05 13:59 988 爬蟲/ 我的作品/ Python

使用 Headless Chrome 替代了 PhatomJS。

圖片保存到指定文件夾中。

 1 import requests
 2 from bs4 import BeautifulSoup
 3 from selenium import webdriver
 4 from selenium.webdriver.chrome.options import Options
 5 
 6 chrome_options = Options()
 7 chrome_options.add_argument('--headless')
 8 chrome_options.add_argument('--disable-gpu')
 9 driver = webdriver.Chrome(chrome_options=chrome_options)
10 dir = 'C:/spider-download/jandan-girls/'
11 img_urls = []
12 page_urls = ["http://jandan.net/ooxx/page-{}#comments".format(str(i)) for i in range(5, 6)]
13 
14 def GetImgUrl(u):
15     driver.get(u)
16     html = driver.page_source
17     soup = BeautifulSoup(html, 'lxml')
18     images = soup.select('a.view_img_link')
19     for i in images:
20         t = i.get('href')
21         if str('gif') in str(t):
22             pass
23         else:
24             img_url = 'http:' + t
25             img_urls.append(img_url)
26 
27 def DownloadImg():
28     n = 1
29     for i in img_urls:
30         print('第 ' + str(n) + ' 張 ... ', end='')
31         with open(dir + i[-20:], 'wb') as f:
32             f.write(requests.get(i).content)
33         print('OK!')
34         n = n + 1
35 
36 for u in page_urls:
37     GetImgUrl(u)
38 print('*** 開始下載 ***')
39 DownloadImg()
40 print('*** 下載完成 ***')

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 python爬蟲–爬取煎蛋網妹子圖片項目: python爬蟲福利煎蛋網妹子圖 python 爬蟲爬取煎蛋網妹子圖煎蛋網妹子圖爬蟲總結 python 爬取煎蛋ooxx妹子圖 [Python爬蟲]煎蛋網OOXX妹子圖爬蟲（1）——解密圖片地址 python爬蟲-妹子圖 python爬煎蛋妹子圖--20多行代碼搞定煎蛋妹子圖庫 Python爬蟲之——爬取妹子圖片爬取煎蛋XXOO妹子圖片