獲取美拍視頻的鏈接--JS分析

本文轉載自查看原文 2019-12-31 15:37 1091 Z ------ 爬蟲練習

美拍鏈接：https://www.meipai.com/

找到視頻鏈接的標簽，源代碼中沒有這個div

通過Fiddler抓包，找到class="mp-h5-player-layer-video"的div由哪個js文件生成的

打開對應的js文件，對其進行斷點，找到src生成的方式

發現src參數在這個位置

此時需要找到字符串的來源、再模擬出這個方法

最后發現字符串是一開始就存在於網頁中的

在請求網頁時，提取出視頻對應的字符串，再通過模擬出的方法即可得到URL

import threading
import requests
import base64
import re


#   解密video的URL
def Decrypt_video_url(content):
    str_start = content[4:]

    list_temp = []
    list_temp.extend(content[:4])
    list_temp.reverse()
    hex = ''.join(list_temp)

    dec = str(int(hex, 16))
    list_temp1 = []
    list_temp1.extend(dec[:2])
    pre = list_temp1

    list_temp2 = []
    list_temp2.extend(dec[2:])
    tail = list_temp2

    str0 = str_start[:int(pre[0])]
    str1 = str_start[int(pre[0]):int(pre[0]) + int(pre[1])]

    result1 = str0 + str_start[int(pre[0]):].replace(str1, '')

    tail[0] = len(result1) - int(tail[0]) - int(tail[1])

    a = result1[:int(tail[0])]
    b = result1[int(tail[0]):int(tail[0]) + int(tail[1])]
    c = (a + result1[int(tail[0]):].replace(b, ''))

    return base64.b64decode(c).decode()


#   獲取網頁的內容
def Page_text(url):
    headers = {
        'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64; rv:21.0) Gecko/20130331 Firefox/21.0'
    }
    return requests.get(url, headers=headers).text


#   解析單個網頁
def Parse_url(video_title, url_tail):
    page_url = 'https://www.meipai.com' + url_tail
    video_page = Page_text(page_url)
    #   獲取視頻加密后的的URL
    data_video = re.findall(r'data-video="(.*?)"', video_page, re.S)[0]
    video_url = Decrypt_video_url(data_video)
    print("{}\n{}\n{}\n".format(video_title, page_url, video_url))


def Get_url(url):
    index_page = Page_text(url)
    #   各個視頻的標題
    videos_title = re.findall(r'class="content-l-p pa" title="(.*?)">', index_page, re.S)
    #   各個播放網頁的URL
    urls = re.findall(r'<div class="layer-black pa"></div>\n\s*<a hidefocus href="(.*?)"', index_page, re.S)

    t_list = []
    for video_title, url_tail in zip(videos_title, urls):
        t = threading.Thread(name='GetUrl', target=Parse_url, args=(video_title, url_tail,))
        t_list.append(t)

    for i in t_list:
        i.start()


if __name__ == '__main__':
    Get_url('https://www.meipai.com/')

![](https://img2018.cnblogs.com/blog/821307/201912/821307-20191231153323604-1746369418.png)

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 Python爬蟲：爬取美拍小姐姐視頻獲取youku視頻下載鏈接（wireshark抓包分析） base64隨機字符混淆加密、解密-美拍視頻地址解密(兼容ie、中文) 獲取騰訊視頻鏈接的方法 js獲取鏈接參數秒拍產品分析美團本地生活場景的短視頻分析 js獲取視頻截圖 B站最新視頻下載鏈接的獲取 JS 獲取鏈接中的參數