Skip to content

Commit cea0cbf

Browse files
Update Chandra model link in ocr-open-models.md (#3161)
1 parent 7bfc0cc commit cea0cbf

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

ocr-open-models.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -137,7 +137,7 @@ Many of the models in this collection have been fine-tuned from Qwen2.5-VL or Qw
137137
| [OlmOCR-2](https://huggingface.co/allenai/olmOCR-2-7B-1025) | Markdown, HTML, LaTeX | Grounding<br>Optimized for large-scale batch processing | 8B | ❎English-only | 82.3 ± 1.1 |
138138
| [Granite-Docling-258M](https://huggingface.co/ibm-granite/granite-docling-258M) | DocTags | Prompt-based task switching<br>Ability to prompt element locations with location tokens<br>Rich output | 258M | ✅Supports English, Japanese, Arabic and Chinese. | N/A |
139139
| [DeepSeek-OCR](https://huggingface.co/deepseek-ai/DeepSeek-OCR) | Markdown, HTML | Supports general visual understanding<br>Can parse and re-render all charts, tables, and more into HTML<br>Handles handwriting<br>Memory-efficient, solves text through image | 3B | ✅Supports nearly 100 languages | 75.4 ± 1.0 |
140-
| [Chandra](https://huggingface.co/deepseek-ai/DeepSeek-OCR) | Markdown, HTML, JSON | Grounding<br>Extracts and inserts images as is | 9B | ✅Supports 40+ languages | 83.1 ± 0.9 |
140+
| [Chandra](https://huggingface.co/datalab-to/chandra) | Markdown, HTML, JSON | Grounding<br>Extracts and inserts images as is | 9B | ✅Supports 40+ languages | 83.1 ± 0.9 |
141141
| [Qwen3-VL](https://huggingface.co/collections/Qwen/qwen3-vl) | Vision Language Model can output in all formats | Can recognize ancient text<br>Handles handwriting<br>Extracts and inserts images as is | 9B | ✅Supports 32 languages | N/A |
142142

143143
While Qwen3-VL itself is a powerful and versatile vision-language model post-trained for document understanding and other tasks, it isn’t optimized for a single, universal OCR prompt. In contrast, the other models were fine-tuned using one or a few fixed prompts specifically designed for OCR tasks. So to use Qwen3-VL, we recommend experimenting with prompts.

0 commit comments

Comments
 (0)