python3 requests爬取gbk时候遇到编码的坑

本文转载自查看原文 2019-08-31 17:49 498 python

python3默认是utf8的，爬取gbk网页的时候会出现乱码

解决办法

test.encoding="gbk"
test.text

text不转换会出现错误，python3字符集不支持转码

第二种方法

test.content.decode("gbk")

decode的作用是将其他编码的字符串转换成unicode编码，如str1.decode('gb2312')，表示将gb2312编码的字符串str1转换成unicode编码。解码

encode的作用是将unicode编码转换成其他编码的字符串，如str2.encode('gb2312')，表示将unicode编码的字符串str2转换成gb2312编码。编码

本站转载的文章为个人学习借鉴使用，本站对版权不负任何法律责任。如果侵犯了您的隐私权益，请联系本站邮箱yoyou2525@163.com删除。

猜您在找 python3爬虫-使用requests爬取起点小说 python3使用requests爬取新浪热门微博 python3爬虫-通过requests爬取西刺代理 Python+requests 爬取网站遇到中文乱码怎么办？ Requests爬取网页的编码问题 [实战演练]python3使用requests模块爬取页面内容 Python3网络爬虫：requests爬取动态网页内容 python3 requests_html 爬取智联招聘数据（简易版） Python3爬虫--两种方法（requests(urllib)和BeautifulSoup）爬取网站pdf python3爬虫-6.使用requests和BeautifulSoup爬取豆瓣Top250电影