Mistral OCR 4 with structured document extraction, 170 languages and self-hosting launched

Mistral AI has introduced Mistral OCR 4, a new optical character recognition (OCR) model designed for enterprise document understanding. Unlike previous versions that mainly converted documents into text and tables, OCR 4 produces structured document outputs with bounding boxes, block classification, and confidence scores for every page and word. Continue reading “Mistral OCR 4 with structured document extraction, 170 languages and self-hosting launched”