Posts tagged with "document-processing"

All posts related to document-processing

olmOCR

In olmOCR, Simon Willison shares an interesting piece of software. OCR is, as far as I know, a resource-heavy and costly process. Unfortunately, the article doesn’t mention the latency of the process. I find the concept of “document anchoring” really interesting, and I appreciate that they released a training dataset, which is still relatively rare these days.

Read more