python處理emoji表情
爬到有表情的內容存mysql會報錯,mongodb則沒有事,解決辦法 去除表情后存儲。
pypi有現成的庫emoji來處理emoji字符串 : https://pypi.org/project/emoji/
安裝:
pip install emoji --upgrade
eg:
>> import emoji >> print(emoji.emojize('Python is :thumbs_up:')) Python is 👍 >> print(emoji.emojize('Python is :thumbsup:', use_aliases=True)) Python is 👍 >> print(emoji.demojize('Python is 👍')) Python is :thumbs_up:
替換函數
def filter_emoji(desstr,restr=''): #過濾表情 try: co = re.compile(u'[\U00010000-\U0010ffff]') except re.error: co = re.compile(u'[\uD800-\uDBFF][\uDC00-\uDFFF]') return co.sub(restr, desstr)