錯誤1
pytesseract.pytesseract.TesseractNotFoundError: tesseract is not installed or it's not in your path
解決方法
‘Tesseract-OCR’ 下載安裝,選擇對應的版本下載
我這里下載的是 window 版本的 tesseract
找到源碼中
tesseract_cmd = 'tesseract'
修改為
tesseract_cmd = r'D:\Program Files (x86)\Tesseract-OCR\tesseract.exe'
錯誤2
E:\BuildFolder\tesseract-ocr\testing>tesseract-dlld.exe eurotext.tif eurotext
Error opening data file ./tessdata/eng.traineddata
Please make sure the TESSDATA_PREFIX environment variable is set to the parent directory of your "tessdata" directory.
Failed loading language 'eng'
Tesseract couldn't load any languages!
Could not initialize tesseract.
解決方法
- 把 tessdata 目錄放在 tesseract.exe 的目錄下
將 TESSDATA_PREFIX=D:\Program Files (x86)\Tesseract-OCR 添加環境變量
臨時在 cmd 中設置環境變量,測試
set TESSDATA_PREFIX=D:\Program Files (x86)\Tesseract-OCR