python3爬取網頁圖片路徑並寫入文件

本文轉載自查看原文 2019-04-21 23:48 946 python3

import re
import urllib.request

# 獲取網頁文件
def getHtml(url):
    response = urllib.request.urlopen('https://www.zhipin.com/?ka=header-home');
    return response.read();

# 寫入數據到文件
def writeFile(fileName,data):
    # 打開文件方式為'a'可不覆蓋原有數據
    htmlFile = open(fileName, 'a');
    htmlFile.write(data);
    htmlFile.close();

# 截取后綴為.jpg的圖片
def getImgSrc(fileName):
    # decode()將string轉為byte
    imgUrl = re.findall(r'https:.+\.jpg',fileName.decode('utf-8'));
    return imgUrl;

html = getHtml('https://www.imooc.com/');
print(html);

imgUrl = getImgSrc(html);
for i in imgUrl:
    print(i);
    writeFile('imgUrl.txt', i);

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 Python3批量爬取網頁圖片 Python：將爬取的網頁數據寫入Excel文件中 python3爬取1024圖片 python爬取網頁文本、圖片利用Python爬取網頁圖片 Python爬蟲功能（爬取網頁圖片） Python爬蟲——爬取網頁圖片 Python爬蟲爬取網頁圖片 selenium - 爬取網頁內容並寫入Excel文件 Python3爬蟲之爬取某一路徑的所有html文件