python中decode和encode區別

本文轉載自查看原文 2019-11-12 09:10 683 Python

#-*-coding:utf-8
import sys
'''
*首先要搞清楚，字符串在Python內部的表示是unicode編碼，因此，在做編碼轉換時，通常需要以unicode作為中間編碼，
即先將其他編碼的字符串解碼（decode）成unicode，再從unicode編碼（encode）成另一種編碼。
decode的作用是將其他編碼的字符串轉換成unicode編碼，如str1.decode('gb2312')，表示將gb2312編碼的字符串str1轉換成unicode編碼。
encode的作用是將unicode編碼轉換成其他編碼的字符串，如str2.encode('gb2312')，表示將unicode編碼的字符串str2轉換成gb2312編碼。
總得意思:想要將其他的編碼轉換成utf-8必須先將其解碼成unicode然后重新編碼成utf-8,它是以unicode為轉換媒介的
如：s='中文'
如果是在utf8的文件中，該字符串就是utf8編碼，如果是在gb2312的文件中，則其編碼為gb2312。這種情況下，要進行編碼轉換，都需要先用
decode方法將其轉換成unicode編碼，再使用encode方法將其轉換成其他編碼。通常，在沒有指定特定的編碼方式時，都是使用的系統默認編碼創建的代碼文件。
如下：
s.decode('utf-8').encode('utf-8')
decode():是解碼
encode()是編碼
isinstance(s,unicode):判斷s是否是unicode編碼，如果是就返回true,否則返回false*

'''
'''
s='中文'
s=s.decode('utf-8') #將utf-8編碼的解碼成unicode
print isinstance(s,unicode) #此時輸出的就是True
s=s.encode('utf-8') #又將unicode碼編碼成utf-8
print isinstance(s,unicode) #此時輸出的就是False
'''
print sys.getdefaultencoding()

s='中文'
if isinstance(s,unicode): #如果是unicode就直接編碼不需要解碼
print s.encode('utf-8')
else:
print s.decode('utf-8').encode('gb2312')

print sys.getdefaultencoding() #獲取系統默認的編碼
reload(sys)
sys.setdefaultencoding('utf8') #修改系統的默認編碼
print sys.getdefaultencoding()

原文鏈接：https://blog.csdn.net/qq_34162294/article/details/53727357

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 Python中decode與encode的區別 decode和encode區別 decode 和 encode 區別 python的encode()、decode()方法 Python decode與encode python3中encode和decode的一些基本用法 python之decode、encode及codecs模塊關於Python字符編碼encode和decode Python—編碼與解碼（encode()和decode()） json_decode 和 json_encode 區別