python實現中文驗證碼識別方法（親測通過）

本文轉載自查看原文 2018-05-31 14:20 3937 python


驗證碼截圖如下：


# coding:utf-8
from PIL import Image,ImageEnhance
import pytesseract
#上面都是導包，只需要下面這一行就能實現圖片文字識別
im = Image.open('merge_source.jpg')
#下面為增強部分
enh_con = ImageEnhance.Contrast(im)
contrast = 1.5
image_contrasted = enh_con.enhance(contrast)
#image_contrasted.show()

#增強亮度
enh_bri = ImageEnhance.Brightness(image_contrasted)
brightness = 1.5
image_brightened = enh_bri.enhance(brightness)
#image_brightened.show()
#增強對比度
enh_col = ImageEnhance.Color(image_brightened)
color = 1.5
image_colored = enh_col.enhance(color)
#image_colored.show()
#增強銳度
enh_sha = ImageEnhance.Sharpness(image_colored)
sharpness = 3.0
image_sharped = enh_sha.enhance(sharpness)
#image_sharped.show()

#灰度處理部分
im2=image_sharped.convert("L")
im2.show()
text=pytesseract.image_to_string(im2,lang='chi_sim').strip() #使用image_to_string識別驗證碼
print(text)
打印結果：

知識擴展：

1.需要安裝的庫Pillow 庫（PIL）和pytesseract ，安裝方式直接通過pip安裝

2.需要安裝windows版本的TesseractOCRiOS(光學字符識別)程序，下載地址：http://xza.198424.com/tesserract.zip

特別注意，ocr安裝完成后需要配置環境變量

　① 在安裝位置時，需要在語言選擇

　　展開，找到簡體中文進行勾選，如果有其他字體需要也可以勾選安裝，語言會安裝到安裝目錄下的tessdata文件夾中

　① tesseract的環境變量 C:\Program Files (x86)\Tesseract-OCR ，根據實際安裝位置填寫

　②C:\Program Files (x86)\Tesseract-OCR\tessdata

#打碼的路上還很長

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 Python實現圖片驗證碼識別 python實現圖文驗證碼識別 Python 代碼實現驗證碼識別 Python驗證碼識別 Python驗證碼識別 python識別驗證碼基於LeNet網絡的中文驗證碼識別 Python驗證碼識別的幾種方法 python 實現（附帶驗證碼識別）的模擬登陸 Python 實現自動識別驗證碼