Python使用content.encode("utf-8").decode("unicode-escape")导致中文乱码的解决方法 - 码上欢乐

相关内容简体繁体

Python使用content.encode("utf-8").decode("unicode-escape")导致中文乱码的解决方法

本文转载自查看原文 2020-09-09 12:01 855 Python

当想要把一个字符串中的\u002F这样的字符串转成正常字符串时，如果字符串中存在中文字符，将导致中文被转成乱码。
例如：

content = "\\u002F哈哈"
content = content.encode("utf-8").decode("utf-8") 
==> \u002F哈哈  无法进行转码

如果使用.decode(“unicode-escape”)

content = "\\u002F哈哈"
content = content.encode("utf-8").decode("unicode-escape")
==> /å“ˆå“ˆ   中文被转码导致乱码

解决方法是逐段解码，只对\uxxxx这样的字符串进行unicode-escape解码，代码如下

import re
content = "\\u002F哈哈"
content = re.sub(r'(\\u[\s\S]{4})',lambda x:x.group(1).encode("utf-8").decode("unicode-escape"),content)
==> /哈哈

补充：自己

content = "\u002F哈哈"
content.encode("utf-8").decode("unicode-escape")
print(content)
==> /哈哈

原文：https://blog.csdn.net/wang785994599/article/details/97653329

免责声明！

本站转载的文章为个人学习借鉴使用，本站对版权不负任何法律责任。如果侵犯了您的隐私权益，请联系本站邮箱yoyou2525@163.com删除。

猜您在找 Python爬虫:decode('utf-8')之后还是乱码的解决 mysql加密与解密decode与encode乱码解决方法(转) Java 使用URLEncoder.encode和URLDecoder.decode编解码(utf-8)中文及特殊字符 python3：(unicode error) 'utf-8' codec can't decode python正则中如何匹配汉字以及encode(‘utf-8’)和decode(‘utf-8’)的互转 idea软件编码已经设置好了为utf-8，但是svn中down下来的文件格式本身不是utf-8的，此时打开后会出现中文乱码解决方法 Sublime text 2/3 [Decode error - output not utf-8] 完美解决方法 python写入mysql时候出现'latin-1' codec can't encode character 问题解决方法以及python设置utf-8 postman测试接口报Content type 'text/plain;charset=UTF-8' not supported解决方法使用vscode运行python出现中文乱码的解决方法

粤ICP备18138465号 © 2018-2026 CODEPRJ.COM