Original article excerpt
Server-side extracted preview paragraphs from the original source.
A Blog post by Technology Innovation Institute on Hugging Face
We also relase Falcon OCR, a 0.3B-parameter model which reaches a score of 80.3 and 88.6 on the olmOCR benchmark and OmniDocBench respectively, while having the highest throughput of any open source OCR model.
This post is a brief, practical write-up of what we built, why we built it this way, and what we learned along the way.