Settings

Theme

TurboOCR: 270–1200 img/s OCR with Paddle and TensorRT (C++/CUDA, FP16)

github.com

7 points by pfdomizer 12 days ago · 4 comments

Reader

leechii1337 12 days ago

how does this compare to e.g. docling, mineru. hard to keep track of all the ocr libs that are being posted.

  • pfdomizerOP 12 days ago

    Docling and MinerU are great for structured output like markdown and table extraction, but they run at 1-5 pages/s because of the VLMs under the hood.

    Turbo-OCR gives you bounding boxes, text, and layout regions at multiple hundred img/s depending on the text density. When you have many PDFs to process, it makes a huge difference. You can always pipe the output into a VLM for the pages that need deeper extraction. Structured extraction and markdown output are on the roadmap (without sacrificing too much speed).

armando1514 12 days ago

Exactly what I was looking for!

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection