[RAGFlow] RAG Evaluation feature request #1259

SUNbrightness · 2024-05-18T02:44:49Z

SUNbrightness
May 18, 2024

Now the RAG development every phase need to manually test each QA, assess the RAG result, which cost too much. All of these are hard to quantify.

I would like to have a "test evaluation function". Manually maintain multiple test sets containing questions, answers, index_ids, and use NLP or a LLM to do the evaluation, record the document hit rate and the answer correct rate. Having such a metric is very beneficial for RAG tuning and project delivery.
Now Dify, FastGPT, etc. are missing such a feature!

JinHai-CN · 2024-05-19T02:28:51Z

JinHai-CN
May 19, 2024
Maintainer

We intend to create an international community, so we encourage using English for questions and answers to help others with similar queries. 😊 So, I have translated the original text into English, so if anything is wrong or missing, please correct the description.

0 replies

JinHai-CN · 2024-05-19T02:33:42Z

JinHai-CN
May 19, 2024
Maintainer

Thank you for your advice.

Here are some details that I would like to discuss with you.

Which data sets do you suggest? Or what data sets do you typically use to evaluate RAG system?
How to assess the quality of data chunking?

1 reply

SUNbrightness May 21, 2024
Author

thank you for the response.

Our Rag Project require our customer offer a excel which include four colums
question, answer, ref file, documents fragment
Now, of course, we do some of the evaluation manually using these QA
I'm sure you have more experience with the methods of evaluation, because there are a lot of theoretical methods.
I have some simple ideas.
Using the above evaluation data, the index hit rate and answer accuracy are evaluated. This can be done using some of the methods of Nlp assessment or using gpt.

The index hit ratio and answer accuracy are the two metrics that I find most meaningful。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

InfiniFlow

[RAGFlow] RAG Evaluation feature request #1259

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments 3 replies

{{title}}

{{title}}

{{title}}

Select a reply

InfiniFlow

[RAGFlow] RAG Evaluation feature request #1259

SUNbrightness May 18, 2024

Replies: 0 comments · 3 replies

JinHai-CN May 19, 2024 Maintainer

JinHai-CN May 19, 2024 Maintainer

SUNbrightness May 21, 2024 Author

SUNbrightness
May 18, 2024

Replies: 0 comments 3 replies

JinHai-CN
May 19, 2024
Maintainer

JinHai-CN
May 19, 2024
Maintainer

SUNbrightness May 21, 2024
Author