python正则图片爬取

本文转载自查看原文 2019-11-23 10:55 284 python爬虫

# conding:utf8
import requests
import re
import time

if __name__ == "__main__":
    # 所有的数据
    url = 'http://www.win4000.com/zt/qsmy.html'

    response = requests.get(url)
    # with open('./qsmy.html', mode='w', encoding='utf-8') as fp:
    #     fp.write(response.text)
    #     print('网页中的内容保存成功')

    # 我们想要的数据
    # <img src="http://static.win4000.com/home/images/placeholder.jpg" data-original = "http://pic1.win4000.com/wallpaper/5/53bcec5b3235b_270_185.jpg" />
    pattern = r'<img src=".*?" data-original = "(.*?)" />'
    html = response.text
    imahe_urls = re.findall(pattern, html)
    print(imahe_urls)
    for img_url in imahe_urls:
        print(img_url)
        response = requests.get(img_url)
        content = response.content
        file = img_url.rsplit('/', maxsplit=1)[1]
        with open('./tupian/%s' % file, mode='wb') as fp:
            fp.write(content)
            print('图片%s保存成功!' % file)
        time.sleep(1)

免责声明！

本站转载的文章为个人学习借鉴使用，本站对版权不负任何法律责任。如果侵犯了您的隐私权益，请联系本站邮箱yoyou2525@163.com删除。

猜您在找 python爬虫学习（四）：爬取网页图片-正则解析数据 python保存爬取的图片 python xpath图片爬取 python 爬取知乎图片 python网络爬虫之解析网页的正则表达式(爬取4k动漫图片)[三] 【Python爬虫】之爬取页面内容、图片以及用selenium爬取 Ins图片爬取（基于python,selenium） python批量爬取猫咪图片用python爬取一张仓鼠图片 python爬虫的图片信息爬取