C# - Tesseract OCR:一次扫描多种语言
C# - Tesseract OCR: scan multiple language at once
知道怎么做吗?
TesseractEngine engine = new TesseractEngine("./tessdata", "eng", EngineMode.Default);
通常,对于一种语言,只需添加缩写就足够了。但是,如果我想扫描包含多种语言的图像怎么办?顺便说一句,我使用 Charles Weld 的包。谢谢
According to here,支持+
语法,所以你只需要添加一个+
符号,如下所示:
TesseractEngine engine = new TesseractEngine("./tessdata", "jpn+eng", EngineMode.Default); // jpn+eng for Japanese and English
The output can be different based on the order of languages, so -l
eng+hin can give different result than -l hin+eng.
据我所知,您首先指定的语言更准确。
知道怎么做吗?
TesseractEngine engine = new TesseractEngine("./tessdata", "eng", EngineMode.Default);
通常,对于一种语言,只需添加缩写就足够了。但是,如果我想扫描包含多种语言的图像怎么办?顺便说一句,我使用 Charles Weld 的包。谢谢
According to here,支持+
语法,所以你只需要添加一个+
符号,如下所示:
TesseractEngine engine = new TesseractEngine("./tessdata", "jpn+eng", EngineMode.Default); // jpn+eng for Japanese and English
The output can be different based on the order of languages, so -l eng+hin can give different result than -l hin+eng.
据我所知,您首先指定的语言更准确。