RAG-space

RAG-space is a Retrieval-Augmented Generation (RAG) model that combines document retrieval and language generation to provide informative responses based on a custom knowledge base.

Project Description

This project implements a RAG model using Python, FastAPI, LangChain, Llama 2 (via Ollama), FAISS, and SentenceTransformers. It retrieves relevant information from a local knowledge base and generates responses using the Llama 2 language model.

For more detailed technical information about the project, please refer to the info.txt file.

Installation

Clone this repository:

git clone https://github.com/yourusername/RAG-space.git
cd RAG-space

Install the required Python packages:
```
pip install -r requirements.txt
```
Install Ollama: Follow the instructions at Ollama's official website to install Ollama for your operating system.
Pull the Llama 2 model using Ollama:
```
ollama pull llama2
```

Running the Server

Start the Ollama service (if not already running):
```
ollama serve
```
Run the FastAPI server:
```
python app.py
```

The server will start on http://localhost:8000.

Getting a Response

To get a response from the RAG model, send a POST request to the /generate endpoint:

curl -X POST "http://localhost:8000/generate" -H "Content-Type: application/json" -d '{"text": "What color is Mars?"}'

This will return a JSON response containing the generated answer and the retrieved documents used for context.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RAG-space

Project Description

Installation

Running the Server

Getting a Response

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

RAG-space

Project Description

Installation

Running the Server

Getting a Response