[feat] OCR based application

### What problem does this solve?

Right now, LEANN is using text embedding only. We have two other options for multimodal data:
1. Use DeepSeek OCR or [MinerU](https://github.com/opendatalab/MinerU) to process all into text space
2. maintain both image vectors and text vectors separately

### Proposed solution

RAGanything repo, MinerU

### Example usage

```python
To RAG over vision-rich task
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[feat] OCR based application #158

What problem does this solve?

Proposed solution

Example usage

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

[feat] OCR based application #158

Description

What problem does this solve?

Proposed solution

Example usage

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions