python3簡單爬蟲

本文轉載自查看原文 2014-05-25 15:09 8562 python/ python 爬蟲

最近在抽空學了一下python，於量就拿爬是練了下手，不得不說python的上手非常簡單。在網上找了一下，大都是python2的帖子，於是隨手寫了個python3的。代碼非常簡單就不解釋了，直接貼代碼。

#test rdp
import urllib.request
import re

#登錄用的帳戶信息
data={}
data['fromUrl']=''
data['fromUrlTemp']=''
data['loginId']='12345'
data['password']='12345'
user_agent='Mozilla/4.0 (compatible; MSIE 5.5; Windows NT)'
#登錄地址 
#url='http://192.168.1.111:8080/loginCheck'
postdata = urllib.parse.urlencode(data)  
postdata = postdata.encode('utf-8')
headers = { 'User-Agent' : user_agent } 
#登錄  
res = urllib.request.urlopen(url,postdata)
#取得頁面html
strResult=(res.read().decode('utf-8'))
#用正則表達式取出所有A標簽
p = re.compile(r'<a href="(.*?)".*?>(.*?)</a>')
for m in p.finditer(strResult):
    print (m.group(1))#group(1)是href里面的內容，group(2)是a標簽里的文字

關於cookie、異常等處理看了一下，沒有花時間去處理，畢竟只是想通過寫爬蟲來學習python。

想要深入的去看這個系列的文章，寫得非常詳細了。

[Python]網絡爬蟲

下面是python語法教程，真的只要幾分鍾就能看完。

Python3 入門教程

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 Python3 爬蟲實例（一）-- 簡單網頁抓取 Python3簡單爬蟲抓取網頁圖片 Python3簡單爬蟲抓取網頁圖片 Python3簡單爬蟲抓取網頁圖片 Python3實現簡單的爬蟲功能【Python3爬蟲】12306爬蟲 python3 爬蟲 python3定時爬蟲 Python3爬蟲介紹【Python3爬蟲】斗魚彈幕爬蟲