python中decode和encode的區別

本文轉載自查看原文 2017-12-28 14:42 2546 python

#-*-coding:utf-8
import sys
'''
*首先要搞清楚，字符串在Python內部的表示是unicode編碼，因此，在做編碼轉換時，通常需要以unicode作為中間編碼，
即先將其他編碼的字符串解碼（decode）成unicode，再從unicode編碼（encode）成另一種編碼。
decode的作用是將其他編碼的字符串轉換成unicode編碼，如str1.decode('gb2312')，表示將gb2312編碼的字符串str1轉換成unicode編碼。
encode的作用是將unicode編碼轉換成其他編碼的字符串，如str2.encode('gb2312')，表示將unicode編碼的字符串str2轉換成gb2312編碼。
總得意思:想要將其他的編碼轉換成utf-8必須先將其解碼成unicode然后重新編碼成utf-8,它是以unicode為轉換媒介的
如：s='中文'
如果是在utf8的文件中，該字符串就是utf8編碼，如果是在gb2312的文件中，則其編碼為gb2312。這種情況下，要進行編碼轉換，都需要先用
decode方法將其轉換成unicode編碼，再使用encode方法將其轉換成其他編碼。通常，在沒有指定特定的編碼方式時，都是使用的系統默認編碼創建的代碼文件。
如下：
s.decode('utf-8').encode('utf-8')
decode():是解碼
encode()是編碼
isinstance(s,unicode):判斷s是否是unicode編碼，如果是就返回true,否則返回false*

'''
'''
s='中文'
s=s.decode('utf-8')   #將utf-8編碼的解碼成unicode
print isinstance(s,unicode)   #此時輸出的就是True
s=s.encode('utf-8')           #又將unicode碼編碼成utf-8
print isinstance(s,unicode)   #此時輸出的就是False
'''
print sys.getdefaultencoding()

s='中文'
if isinstance(s,unicode):   #如果是unicode就直接編碼不需要解碼
    print s.encode('utf-8')
else:
    print s.decode('utf-8').encode('gb2312')

print sys.getdefaultencoding()    #獲取系統默認的編碼
reload(sys)
sys.setdefaultencoding('utf8')    #修改系統的默認編碼
print sys.getdefaultencoding()

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 python中decode和encode的區別 python中decode和encode區別 Python中decode與encode的區別 python中的encode（）和decode（）函數 python中strip()、encode()、decode()、split()方法 python中bytes與bytearray以及encode與decode encode和decode的區別 decode和encode區別 decode 和 encode 區別 python編碼encode和decode