Python decode與encode

本文轉載自查看原文 2016-12-24 18:53 1651 python syntax/ Python

字符串在Python內部的表示是unicode編碼(8-bit string)，因此，在做編碼轉換時，通常需要以unicode作為中間編碼，即先將其他編碼的字符串解碼（decode）成unicode，再從unicode編碼（encode）成另一種編碼。

decode的作用是將其他編碼的字符串轉換成unicode編碼，如str1.decode('gb2312')，表示將gb2312編碼的字符串str1轉換成unicode編碼。

encode的作用是將unicode編碼轉換成其他編碼的字符串，如str2.encode('gb2312')，表示將unicode編碼的字符串str2轉換成gb2312編碼。

因此，轉碼的時候一定要先搞明白，字符串str是什么編碼，然后decode成unicode，然后再encode成其他編碼.

如：s='中文'

如果是在utf8的文件中，該字符串就是utf8編碼，如果是在gb2312的文件中，則其編碼為gb2312。這種情況下，要進行編碼轉換，都需要先用decode方法將其轉換成unicode編碼，再使用encode方法將其轉換成其他編碼。通常，在沒有指定特定的編碼方式時，都是使用的系統默認編碼創建的代碼文件。

如果字符串是這樣定義：s=u'中文'

則該字符串的編碼就被指定為unicode了，即Python的內部編碼，而與代碼文件本身的編碼無關。因此，對於這種情況做編碼轉換，只需要直接使用encode方法將其轉換成指定編碼即可。

獲得當前環境默認編碼

>>> import sys
>>> print sys.getdefaultencoding()
ascii

修改當前編碼

>>> isinstance(s,unicode)
False
>>> sys.setdefaultencoding("gbk")
>>> unicode(s)
u'\u4e2d\u6587'
>>> s.decode()
u'\u4e2d\u6587'

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 python編碼encode和decode Python3的decode()與encode() python的encode()和decode()函數 python encode與decode Python encode和decode python的encode()、decode()方法 python3編碼（encode,decode） python中decode和encode的區別在python3 encode和decode 的使用 Python encode()、decode()方法詳解