python獲取網頁信息的三種方法

本文轉載自查看原文 2017-12-05 17:30 6088

import urllib.request
import http.cookiejar

url = 'http://www.baidu.com/'

# 方法一
print('方法一')
req_one = urllib.request.Request(url)
req_one.add_header('User-Agent', 'Mozilla/6.0')
res_one = urllib.request.urlopen(req_one)
code_one = res_one.getcode()
html_one = res_one.read().decode('utf-8')
res_one.close()
print('方法一網頁狀態碼：%s' % (code_one))
print('方法一網頁內容：'+html_one)


# 方法二
print('方法二')
res_two = urllib.request.urlopen(url)
code_two = res_two.getcode()
html_two = res_two.read().decode('utf-8')
print('方法二網頁狀態碼：%s' % (code_two))
print('方法二網頁內容：'+html_two)


#方法三
print('方法三')
cj = http.cookiejar.LWPCookieJar()
opener = urllib.request.build_opener(urllib.request.HTTPCookieProcessor(cj))
urllib.request.install_opener(opener)
res_three = urllib.request.urlopen(url)
print(cj)
code_three = res_three.getcode()
html_three = res_three.read().decode('utf-8')
res_three.close()
print('方法三網頁狀態碼：%s' % (code_three))
print('方法三的網頁內容：'+html_three)

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 用Python獲取Linux資源信息的三種方法 php獲取網頁header信息的4種方法 Java獲取Class的三種方法 Activiti獲取ProcessEngine的三種方法簡析Geoserver中獲取圖層列表以及各圖層描述信息的三種方法使用urllib2打開網頁的三種方法（Python2） Python中替換的三種方法 python　字典訪問的三種方法 Python列表去重的三種方法 python　字典訪問的三種方法