YouTube Video Question Answering

This project allows you to perform question answering on YouTube videos by transcribing the video content and using a language model to answer questions based on the transcription.

Features

Download audio from YouTube videos.
Transcribe audio to text.
Split transcriptions into manageable chunks.
Create embeddings for document chunks.
Store documents in a vector store for efficient retrieval.
Answer questions based on the transcribed content.

Installation

To get started with this project, follow these steps:

Clone the repository:

git clone <repository-url>
cd <repository-directory>

Create a virtual environment:

python3 -m venv venv
source venv/bin/activate

Install the required dependencies:
```
pip install -r requirements.txt
```
Set up environment variables:

Create a .env file in the root directory of the project and add the following environment variables:
```
GROQ_API_KEY=<your-groq-api-key>
HUGGINGFACEHUB_API_TOKEN=<your-huggingfacehub-api-token>
```

Why Transcribe Audio This Way?

The transcription process involves converting audio to 16kHz mono and splitting it into chunks before transcribing. This approach is taken for several reasons:

Consistency and Quality: Converting audio to a standard format (16kHz mono) ensures consistent quality and compatibility with the transcription API.
Manageable Chunks: Splitting the audio into smaller chunks makes the transcription process more manageable and reduces the likelihood of errors or timeouts.

Usage

Run the Streamlit app:
```
streamlit run rag_app.py
```
Provide the YouTube video link:

When prompted, enter the YouTube link of the video you want to transcribe and analyze.
Ask questions:

After the transcription process is complete, you can start asking questions based on the transcribed content. Leave the question blank to quit the program.

How It Works

Download Audio:
- The script downloads the audio from the provided YouTube link using pytubefix.
Transcribe Audio:
- The audio is converted to 16kHz mono and split into chunks.
- Each chunk is transcribed using the Groq API.
Process Transcription:
- The transcription is saved to a file and loaded for further processing.
- The text is split into chunks for efficient retrieval.
Create Embeddings:
- Embeddings for the document chunks are created using a HuggingFace model.
Store Documents:
- The documents are stored in a vector store for efficient retrieval.
Answer Questions:
- A retrieval chain is set up to answer questions based on the transcribed content using a language model.

Dependencies

langchain
langchain[docarray]
langchain_groq
langchain_community
langchain_huggingface
docarray
pydantic==1.10.8
pytubefix
python-dotenv
tiktoken
ruff
pypdf
groq
streamlit

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
rag_app.py		rag_app.py
rag_cli.py		rag_cli.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YouTube Video Question Answering

Features

Installation

Why Transcribe Audio This Way?

Usage

How It Works

Dependencies

About

Releases

Packages

Languages

rudradeep22/Youtube-video-quiz

Folders and files

Latest commit

History

Repository files navigation

YouTube Video Question Answering

Features

Installation

Why Transcribe Audio This Way?

Usage

How It Works

Dependencies

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages