python爬蟲下載某網站156個網頁小游戲素材


有哪些游戲自己看吧↓
一波網頁小游戲(摸魚專用)
https://www.52pojie.cn/thread-1269936-1-1.html

 

懶得看代碼的可以直接下載成品,分成了兩個包
https://wwi.lanzoui.com/iwGxvgqiwzc
密碼:d89r
https://wwi.lanzoui.com/i7WQvgqisqj
密碼:dg3j

以下為python代碼

 

import requestsfrom bs4 import BeautifulSoup
 
 
def get_Url(url):
    str_list = []
    content = requests.get(url).content
    soup = BeautifulSoup(content, 'lxml')
    find = soup.find('span', attrs={'class': 'current'})
    sum = int(find.text.split('/')[1])
    for i in range(sum):
        if i == 0:
            str_list.append('https://www.mycodes.net/166/')
            continue
        str_list.append('https://www.mycodes.net/166/' + str(i + 1) + '.htm')
    return str_list
 
 
def get_document(url):
    soup = BeautifulSoup(requests.get(url).content, 'lxml')
    find_all = soup.find_all('a', attrs={'style': 'color:#006BCD;font-size:14px;'})
    a = ''
    for value in find_all:
        if a.__eq__(str(value['href'])):
            continue
        a = value['href']
        document = BeautifulSoup(requests.get(value['href']).content, 'lxml')
        text = document.find('td', attrs={'class': 'a0'}).text
        print(text+":")
        td_s = document.find_all('td', attrs={'class': 'b1'})
        for td in td_s:
            find = td.find('a')
            if find is not None:
                print(find['href'])
 
 
if __name__ == '__main__':
    url_list = get_Url('https://www.mycodes.net/166/')
    for url in url_list:
        get_document(url)

 

  

 


免責聲明!

本站轉載的文章為個人學習借鑒使用,本站對版權不負任何法律責任。如果侵犯了您的隱私權益,請聯系本站郵箱yoyou2525@163.com刪除。



 
粵ICP備18138465號   © 2018-2025 CODEPRJ.COM