Show HN: Qwen-2.5-32B is now the best open source OCR model
12 by themanmaran | 1 comments on Hacker News.
Last week was big for open source LLMs. We got: - Qwen 2.5 VL (72b and 32b) - Gemma-3 (27b) - DeepSeek-v3-0324 And a couple weeks ago we got the new mistral-ocr model. We updated our OCR benchmark to include the new models. We evaluated 1,000 documents for JSON extraction accuracy. Major takeaways: - Qwen 2.5 VL (72b and 32b) are by far the most impressive. Both landed right around 75% accuracy (equivalent to GPT-4o’s performance). Qwen 72b was only 0.4% above 32b. Within the margin of error. - Both Qwen models passed mistral-ocr (72.2%), which is specifically trained for OCR. - Gemma-3 (27B) only scored 42.9%. Particularly surprising given that it's architecture is based on Gemini 2.0 which still tops the accuracy chart. The data set and benchmark runner is fully open source. You can check out the code and reproduction steps here: - https://ift.tt/BHNhUC6... - https://ift.tt/1FnkPHJ - https://ift.tt/sKiISyT
Post Top Ad
Tuesday, April 1, 2025

Home
Hacker News
New top story on Hacker News: Show HN: Qwen-2.5-32B is now the best open source OCR model
New top story on Hacker News: Show HN: Qwen-2.5-32B is now the best open source OCR model
Tags
# Hacker News
Share This
About Unknown
Templatesyard is a blogger resources site is a provider of high quality blogger template with premium looking layout and robust design. The main mission of templatesyard is to provide the best quality blogger templates which are professionally designed and perfectlly seo optimized to deliver best result for your blog.
Hacker News
Labels:
Hacker News
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment