環境vs 2019 .Net 4.8
新建一個wpf工程,拖放上一個button一個textbox

nuget下載tesseract,版本信息如圖所示

MainWindow.xaml.cs文件代碼如下:
using Microsoft.Win32;
using System.Windows;
using Tesseract;
namespace TestOCR
{
/// <summary>
/// Interaction logic for MainWindow.xaml
/// </summary>
public partial class MainWindow : Window
{
public MainWindow()
{
InitializeComponent();
}
private void Button_Click(object sender, RoutedEventArgs e)
{
OpenFileDialog ofd = new OpenFileDialog();
if (ofd.ShowDialog() == true)
{
var pix = Pix.LoadFromFile(ofd.FileName);
var ocr = new TesseractEngine("./tessdata", "eng",EngineMode.TesseractAndCube);
var page = ocr.Process(pix);
textbox.Text = page.GetText();
textbox.FontSize = 30;
}
}
}
}
去下載tessdata訓練文件:
https://github.com/tesseract-ocr/tesseract/wiki/Data-Files


下載后解壓,然后一層層文件夾進去找到“tessdata”文件夾,把該文件夾放在wpf工程bin/debug目錄里:

然后准備一個帶文字的圖片,測試:

