Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

💡[Feature]: A complete end to end RAG chatbot application using Gemini #1354

Closed
4 tasks done
pranavvb03 opened this issue Oct 10, 2024 · 3 comments
Closed
4 tasks done
Labels
enhancement New feature or request

Comments

@pranavvb03
Copy link
Contributor

Is there an existing issue for this?

  • I have searched the existing issues

Feature Description

Develop a Q & A RAG (Retrieval Augmented generation) chatbot on PDF document extraction using Langchain , sentence transformers and FAISS vector indexing with Gemini API that is capable of handling multiple pdf files and answering users queries using information present in pdf files. The model is deployed on Streamlit.

Workflow:

  • Document Parsing: Extract text from the PDF and divide it into chunks (paragraphs or sections).
  • Vector Embedding: Convert the chunks into vector embeddings using a pre-trained sentence transformer.
  • Indexing with FAISS: Store the vectorized chunks in a FAISS index for fast similarity searches.
  • User Query: Convert the user query into a vector and use FAISS to retrieve the most relevant text chunks from the document.
  • Answer Generation: Feed the retrieved chunks and the query into the Gemini API (language model) to generate a coherent answer.
  • Response: The chatbot returns the generated answer to the user.

Use Case

  • Document Analysis: Answering questions based on large reports, research papers, or legal documents.
  • Customer Support: Assisting customers by retrieving relevant sections from manuals, FAQs, or documentation.
  • Educational Tools: Providing students with answers based on textbook content or study materials.

Benefits

Efficient Retrieval: FAISS enables rapid retrieval of relevant information from large documents, making the chatbot responsive even with complex or long queries.

Add ScreenShots

No response

Priority

High

Record

  • I have read the Contributing Guidelines
  • I'm a GSSOC'24 contributor
  • I want to work on this issue
@pranavvb03 pranavvb03 added the enhancement New feature or request label Oct 10, 2024
Copy link

Thank you for creating this issue! 🎉 We'll look into it as soon as possible. In the meantime, please make sure to provide all the necessary details and context. If you have any questions reach out to LinkedIn. Your contributions are highly appreciated! 😊

Note: I Maintain the repo issue twice a day, or ideally 1 day, If your issue goes stale for more than one day you can tag and comment on this same issue.

You can also check our CONTRIBUTING.md for guidelines on contributing to this project.
We are here to help you on this journey of opensource, any help feel free to tag me or book an appointment.

@sanjay-kv
Copy link
Member

#1367

Copy link

Hello @pranavvb03! Your issue #1354 has been closed. Thank you for your contribution!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants