Custom RAG Application

Retrieval-Augmented Generation (RAG) is a technique that enhances LLM responses by first retrieving relevant documents or data from an external knowledge source, then passing that retrieved context along with the user's query to the model. This allows the LLM to ground its answers in up-to-date or domain-specific information beyond what was baked into its training weights.

Setup

This application assumes you have Docker and the Docker Model Runner installed.
Deployment assumes you have already created an S3 bucket for RAG content, as well as a AWS security key and secret key. If running locally, copy .env.example to .env and provide credentials. If running in EC2 using an IAM role with permission to the bucket, comment out those lines in .env.
Pull the relevant model manually from the CLI:
```
docker model pull ai/qwen2.5:latest
```
Run the complete stack using docker compose. This includes a compiled ReactJS front-end and a Python-based FastAPI back-end.
```
docker compose up --build
```

Work with RAG

Open a browser to port 3000 of the host machine: http://localhost:3000 or in EC2 http://12.34.56.78:3000/
From the web UI, upload a PDF to the application by dragging it to the "UPLOAD PDF" area of the page. Alternatively, you can upload documents to S3 using the CLI or boto3, etc.
Each new document will be parsed as part of the model's context, expanding its ability to reply to prompts related to the uploaded content.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
backend		backend
frontend		frontend
pdfs		pdfs
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Custom RAG Application

Setup

Work with RAG

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

Custom RAG Application

Setup

Work with RAG

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages