Mistral OCR 4 Hackernews Viewer

Mistral OCR 4

364 points by meetpateltech 6 hours ago | 93 comments

Comments

ericyd 2 hours ago

I’ve always thought the US Postal Service is such a technological marvel. They somehow manage to identify and route billions of pieces of mail and I have to imagine their tech is significantly more primitive than this. Not only that but US addresses are absurdly non-standardized, you can often write the same address multiple ways and have it deliver to the same location. I’m sure there’s plenty of published knowledge in this area, but whenever I see announcements about OCR it feels like this should be a solved problem if it’s been accomplished at the scale of USPS for many years.

andrewmutz 4 hours ago

A tangential observation: the video on the linked page wasn't what I expected. I thought Mistral was a european AI company, so I didnt expect the video to be filmed in San Francisco featuring three people who don't seem to be european.

I'm not against them being a global organization, that's wonderful. I was just surprised. I expected a parisian office and european accents.

beklein 2 hours ago

All AI labs really need to stop using truncated y-axes for benchmark bar charts...

https://mistral.ai/_astro/cm-engish_ZhlvoT.webp?dpl=6a3a94bd...

mdrzn 5 hours ago

It'll be interesting to see how this ranks against https://github.com/baidu/Unlimited-OCR

themanmaran 4 hours ago

It's cheap at $4/1k, but I'm hesitant to even benchmark this one again since the previous versions were all "98% accurate based on internal benchmarks of 4 pdfs" and ended up falling short of almost everything else on the market [1].

Even in this one, they just report that OlmOCRBench and OmniDocBench have "known limitations" and that's why they report flagship numbers from their internal benchmark.

https://getomni.ai/blog/benchmarking-open-source-models-for-...

sreekanth850 3 hours ago

Tested with Malayalam, normal handwriting got accurate but a slight different style got detected as kannada. Have samples if required, which sarvam got done with 99% accuracy leaving one text error.

mcbetz 5 hours ago

Little on differences other than bounding boxes and double the price compared to their previous OCR v3 model from December - https://mistral.ai/news/mistral-ocr-3/ - other benchmarks were used back then.

utopiah 5 hours ago

"A note on out-of-scope use. OCR 4 is a document-understanding model, not a decision-maker. It is not intended for medical diagnosis, legal advice or judgment, high-stakes financial decisions, safety-critical systems, real-time/latency-sensitive processing, or non-document inputs (raw audio, video, etc.). "

Can't wait for the "oh so innovative" manager who will suggest during the next meeting "Ok... but what if WE used it for high-stakes financial decisions on non-document inputs like a photo from my phone?"

I guarantee you somebody on HN is going to comment about this "idea" next week.

Insanity 5 hours ago

Recently I tied OCR with Opus 4.8. (I know, not technically right tool for the job). All I needed to do was extract dates from receipts. It got about 20% of the dates wrong yet rated all as “high confidence”.

Should have probably tried a more OCR specific model

bastawhiz 3 hours ago

The comparisons rank it against GPT and Gemini but not Claude. Is Claude's vision support simply not competitive when it comes to OCR tasks?

remus 1 hour ago

Given this a test on some scans of magazines, generally pretty impressed with the results. Mags are generally pretty whacky layouts and it does a reasonable job working out what is where and pulling it together into a single coherent md file. The way it crops relevant pics and puts them into the doc is pretty nice.

Haven't compared it with any other high tech OCR estups, but it's way better than the jank that comes as standard with my scanner.

Ducki 5 hours ago

I was processing 55 year old paper files, most of them severely degraded, with its predecessor model. I was very impressed! I also tried Abbyy Finereader but it didn't even come close in my experience.

trilogic 2 hours ago

Mistral keeps reminding us that doesn´t just brew great coffee, they can build great AI too. Hats off to the team. Mistral O.C.R. (Only Cool Results)

pmxi 5 hours ago

This has been a niche where Mistral has actually been successful. Btw, Hindi and Japanese are bucketed in "Rare Languages," which is odd.

nickvec 2 hours ago

Naive question: is Claude no good at OCR? Was surprised to see that none of Anthropic's models were included in the benchmark comparisons.

MostlyStable 5 hours ago

Does anyone know of OCR benchmarks that include hand-written documents? I'm currently using Gemini pro 3 for this, and error rates are quite good, but it's a little bit pricey, and I'd be interested in a cheaper model that could perform as well, but almost all the OCR benchmarks I'm aware of (and I believe all the ones included in this announcement) are about printed/typeset text.

JGB100 2 hours ago

Not well tested. It switched all U.S. (") double quotation marks to UK-style (') single quotation marks, ignoring the source document. Useless in the US.

stri8ted 5 hours ago

Way too expensive. Google vision OCR (which they failed to compare against), is $1.50 per 1k pages. Vs $4 from Mistral.

coulix 4 hours ago

I wonder how it does compare to reducto, pulse, extendai.

jppope 5 hours ago

Is there something wrong with their certificate? Chromium is saying https isn't valid

mrkn1 4 hours ago

This runs for free on CPU https://github.com/kouhxp/textsnap

tdubey 5 hours ago

Are there benchmarks for how this performs on charts, or maybe more accurately, plots? I've yet to find a model that can digitize a plot into X,Y points with some accuracy in my use case of digitizing old datasheets.

Ninjinka 3 hours ago

Is there a complete list of the languages they support, and benchmarks by language, instead of just "Rare Languages"?

ge96 5 hours ago

1000 pages for $4? damn how does it compare to llama parse I wonder

gpm 5 hours ago

Do these models (this one or its competitors) do handwriting recognition?

sscaryterry 2 hours ago

Why the chart crimes?!

v3ss0n 3 hours ago

Not opensource right?

dominotw 3 hours ago

starting y axis from 50 and 95 is a bit mileading

greenleafone7 5 hours ago

After paying for Mistral and using it for a while I genuinely hated it. It's a productivity black hole and can't realistically compete with anyone. I chose it only because it was European, but no. I'd rather let my one year subscription go to waste than use anything 'Mistral'.