< span style="font-size: 0.882em;">"Open Source Selection" is a column where we share high-quality projects in open source communities such as Github and Gitee, including technology, learning, practical and various interesting content. This issue recommends an open source OCR image-to-text recognition software - Umi-OCR.

Features

Free: All codes of this project are open source and completely free.
Convenience: decompress and use, offline operation, no network required .
Batch: Images can be imported and processed in batches, and the results can be saved to Local txt, md, jsonl files in multiple formats. You can also take instant screenshots for identification.
Efficient: Adopt PaddleOCR-json C++ recognition engine. Usually faster than online OCR services as long as the computer is powerful enough.
Precise: Use the PPOCR-v3 model library by default. In addition to being able to accurately recognize regular text, it also has a good recognition rate for scenarios such as handwriting, incorrect orientation, and messy backgrounds. You can set the ignore area to exclude the watermark, and set the post-processing of the text block to merge the typesetting paragraphs to obtain a regular text.

Easy to get started

Screenshot recognition

Click the screenshot button or customize the shortcut key to evoke screenshot recognition.

Paste the picture to the software

Copy the picture anywhere (such as file manager, web page, WeChat), click the paste button on the software, and it will be automatically recognized.

Batch recognition of local image files

Drag pictures or folders into the software to convert text in batches. You can also click the button to open the browser window to import.

The recognition result will be saved locally. Optionally generate plain text txt files, Markdown files with links, raw information json files and other formats. Perform shutdown/standby after configurable task completion.

Text block post-processing (typesetting optimization)

The text recognized by OCR is divided into "blocks". Usually, a line of text is divided into one block, and sometimes a line is mistakenly divided into multiple blocks, which brings inconvenience to reading. Text block post-processing is the process of reprocessing the text block, merging the text in the same line or the same paragraph, and sorting them in the correct order.

The following figure shows which processing scheme should be used for different typesetting:

Ignore area features

Ignore area is a special function of this software, which can be used to eliminate the interference of watermark in the picture, so that only the required text is left in the recognition result.

Easily exclude watermarks from video screenshots

Exclude the two UIs in game screenshots

Download & Instructions

Download

https://github.com/hiroi-sora/Umi-OCR/releases/tag/v1.3.0

Instructions

https://github.com/hiroi-sora/Umi-OCR#%E7%AE%80%E5%8D%95%E4%B8%8A%E6%89%8B

-END-

Open source license: MIT

Open source address: https://github.com/hiroi-sora/Umi-OCR

Completely offline OCR image-to-text recognition tool Umi-OCR

Features

Easy to get started

Text block post-processing (typesetting optimization)

Ignore area features

Download & Instructions

发表回复

评论列表（196条）

Completely offline OCR image-to-text recognition tool Umi-OCR

Features

Easy to get started

Text block post-processing (typesetting optimization)

Ignore area features

Download & Instructions

Related Suggestion

[Practical tutorial] PPT scientific research drawing tutorial (016) - schematic diagram of electrospinning

[Practical Tips] Illustrated Electrospinning Process: An Introductory Guide to PPT Scientific Research Drawing (016)

New version: PPT scientific research drawing complete guide and material collection

The most complete PPT scientific research drawing tutorial and materials in history (second edition)

[Practical tutorial] PPT scientific research drawing tutorial (001) - schematic diagram of photocatalytic mechanism

发表回复

评论列表（196条）