python中文ocr方案-pytesseract

本文轉載自查看原文 2017-07-26 11:31 4197 tesseract/ 機器學習/ pytesseract/ OCR

pytesseract是google維護的具有學習功能的OCR引擎，3.0以后支持中文識別。

安裝：

1. 安裝tesseract-ocr組件；記得同步下載簡體中文與英文語言包。

2. 安裝PIL，需注意Windows64位版本

3. pip install pytesseract

使用:

image = Image.open("1.jpg")  # 打開圖片
image.load()  # 加載一下圖片，防止報錯，此處可省略
image.show()  # 調用show來展示圖片，調試用，可省略
tessdata_dir_config = '--tessdata-dir "C:\\Program Files (x86)\\Tesseract-OCR\\tessdata"'
vcode = pytesseract.image_to_string(image, lang='chi_sim', config=tessdata_dir_config)
print vcode

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 Python 進行 OCR識別 -- pytesseract庫 python中ocr軟件pytesseract使用 tesseract-OCR + pytesseract安裝 Tesseract-ocr視覺學習-驗證碼識別及python import pytesseract使用一個 Python 包 pytesseract ，幾行代碼實現 OCR 文本識別技術！ Python驗證碼識別安裝Pillow、tesseract-ocr與pytesseract模塊的安裝以及錯誤解決使用python的pytesseract調用谷歌tesseract-ocr識別中英文字符 Python3實現自動查詢成績（主要使用的包有Tesseract-OCR、PIL、execjs、pytesseract、BeautifulSoup） python 安裝 pytesseract python pytesseract使用