-
Notifications
You must be signed in to change notification settings - Fork 753
Open
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomers
Description
What problem does this solve?
Right now, LEANN is using text embedding only. We have two other options for multimodal data:
- Use DeepSeek OCR or MinerU to process all into text space
- maintain both image vectors and text vectors separately
Proposed solution
RAGanything repo, MinerU
Example usage
To RAG over vision-rich taskMetadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomers