C#如何OCR离线识别文本

本项目是基于开源项目PaddleOCR的C++代码修改并封装的.NET库，包含文本OCR功能。同时也提供了C++的调用示例代码，可以供C++开发者使用。

PaddleOCR.dll文件是基于开源项目PaddleOCR的C++代码修改的C++动态库，基于opencv的x64编译而成的。.

为了降低部署文件大小，PaddleOCR.dll使用了openblas依赖编译，paddle_inference.dll是官方提供的openblas模式下的库。

模型库支持轻量版（本项目）、服务器版模型库（更准确），可以自行更改模型库适用实际需求。

PS：有更好的方式欢迎推荐。

使用方式

文件夹结构

Cpp //PaddleOCR.dll的头文件和库文件，方便C++调用PaddleOCR.dll

一、C++代码如下。

#include <iostream>#include <Windows.h>#include "include/PaddleOCR.h"#include "include/OCRResult.h"#include <tchar.h>#include "string"#pragma comment (lib,"PaddleOCR.lib")using namespace std;int main(){  LpOCRResult lpocrreult;  modeldata md;  OCRParameter parameter;  char path[MAX_PATH];  GetCurrentDirectoryA(MAX_PATH, path);  string cls_infer(path);  cls_infer += "\\inference\\ch_ppocr_mobile_v2.0_cls_infer";  string rec_infer(path);  rec_infer += "\\inference\\ch_PP-OCRv2_rec_infer";  string det_infer(path);  det_infer += "\\inference\\ch_PP-OCRv2_det_infer";  string ocrkeys(path);  ocrkeys += "\\inference\\ppocr_keys.txt";  string imagefile(path);  imagefile += "\\test.png";  md.cls_infer = const_cast<char*>(cls_infer.c_str());  md.rec_infer = const_cast<char*>(rec_infer.c_str());  md.det_infer = const_cast<char*>(det_infer.c_str());  md.keys = const_cast<char*>(ocrkeys.c_str());  md.imagefile = const_cast<char*>(imagefile.c_str());  int  cout =Detect(md.det_infer, md.cls_infer, md.rec_infer, md.keys, md.imagefile, parameter, &lpocrreult);  for (size_t i = 0; i < cout; i++)  {    wstring ss =(WCHAR*)(lpocrreult->pOCRText[i].ptext);    std::wcout <<ss;  }  FreeDetectMem(lpocrreult);}

二、添加引用PaddleOCRSharp.dll System.Drawing.dll

运行需要用的库文件目录如下。

PaddleOCRLib  //OCR运行需要的文件|--inference     //OCR的模型库文件夹|--openblas.dll   //第三方引用库|--paddle_inference.dll   //飞桨库|--PaddleOCR.dll   //基于开源项目PaddleOCR修改的C++动态库PaddleOCRSharp  //.NET封装库

C#如何OCR离线识别文本

二、新建 MainWindow.xaml 代码点击按钮调用OCR识别如下

using PaddleOCRSharp;private void button1_Click(object sender, EventArgs e)        {            OpenFileDialog ofd = new OpenFileDialog();            ofd.Filter = "*.*|*.bmp;*.jpg;*.jpeg;*.tiff;*.tiff;*.png";            if (ofd.ShowDialog() != DialogResult.OK) return;            var imagebyte = File.ReadAllBytes(ofd.FileName);            Bitmap bitmap = new Bitmap(new MemoryStream(imagebyte));            OCRResult ocrResult = PaddleOCRSharp.PaddleOCRHelper.DetectText(bitmap);            if (ocrResult != null)            {                MessageBox.Show(ocrResult.Text,"识别结果");            }
        }

源码地址如下

gitee：https://gitee.com/raoyutian/paddle-ocrsharp

使用方式

董川民