In this module, we'll cover the techniques that could improve your RAG pipeline.
- Small-to-Big chunk retrieval
- Leveraging document metadata
- Hybrid search
- User query rewriting
- Document reranking
Links:
- Slides (TBD)
- Five Techniques for Improving RAG Chatbots - Nikita Kozodoi [Video]
- Survey on RAG techniques [Article]
Links:
Links:
- Reciprocal Rank Fusion (RRF) method [Elasticsearch Guide]
- RRF method [Article]
- Elasticsearch subscription plans
We should pull and run a docker container with Elasticsearch 8.9.0 or higher in order to use reranking based on RRF algorithm:
docker run -it \
--rm \
--name elasticsearch \
-m 4GB \
-p 9200:9200 \
-p 9300:9300 \
-e "discovery.type=single-node" \
-e "xpack.security.enabled=false" \
docker.elastic.co/elasticsearch/elasticsearch:8.9.0
TBD