[RAGFlow] RAG Evaluation feature request #1259
SUNbrightness
started this conversation in
Ideas
Replies: 0 comments 3 replies
-
We intend to create an international community, so we encourage using English for questions and answers to help others with similar queries. 😊 So, I have translated the original text into English, so if anything is wrong or missing, please correct the description. |
Beta Was this translation helpful? Give feedback.
0 replies
-
Thank you for your advice. Here are some details that I would like to discuss with you.
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Now the RAG development every phase need to manually test each QA, assess the RAG result, which cost too much. All of these are hard to quantify.
I would like to have a "test evaluation function". Manually maintain multiple test sets containing questions, answers, index_ids, and use NLP or a LLM to do the evaluation, record the document hit rate and the answer correct rate. Having such a metric is very beneficial for RAG tuning and project delivery.
Now Dify, FastGPT, etc. are missing such a feature!
Beta Was this translation helpful? Give feedback.
All reactions