Skip to content

A multi-step LangGraph workflow that retrieves, validates, and synthesizes information from user documents and web sources using tool-based search, cross-source verification, and mixed LLMs for optimized reasoning.

Notifications You must be signed in to change notification settings

SriviharReddy/Insight-Fusion-Agent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

1 Commit
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Insight Fusion Agent 🧠

Insight Fusion is an advanced research-automation agent designed to synthesize deep insights from a hybrid of user-uploaded documents and real-time online sources. Built on LangChain and LangGraph, it employs an iterative, self-correcting workflow to ensure retrieval accuracy, claim validation, and evidence-grounded summarization.

πŸš€ Features

  • Hybrid Context Retrieval: Seamlessly blends proprietary data (PDFs/Text) with Tavily/SerpAPI web search results.
  • Iterative Refinement: Uses a cyclic LangGraph workflow to critique and refine search queries until sufficient evidence is gathered.
  • Fact-Checking & Validation: Dedicated graph nodes effectively cross-reference claims against retrieved sources.
  • Streamlit Interface: A clean, modern UI for interacting with the agent and visualizing the synthesis process.

πŸ—οΈ Architecture

The core of the agent is a state machine defined in src/agent.py. The workflow proceeds as follows:

  1. Query Decomposition: Breaks down complex user requests into sub-questions.
  2. Hybrid Search: Fetches documents from vector store and web.
  3. Relevance Graduation: Evaluates retrieved documents for relevance.
  4. Generation & Hallucination Check: Generates an answer and verifies it against the documents.

πŸ› οΈ Usage

  1. Clone the repository.
  2. Install dependencies:
    pip install -r requirements.txt
  3. Set up your environment variables:
    cp .env.example .env
    # Add OPENAI_API_KEY, TAVILY_API_KEY etc.
  4. Run the Streamlit app:
    streamlit run app.py

πŸ“¦ Tech Stack

  • Orchestration: LangChain, LangGraph
  • LLM: GPT-4o / Claude 3.5 Sonnet
  • Vector Store: ChromaDB / FAISS
  • Frontend: Streamlit
  • Search: Tavily API

πŸ“ Roadmap

  • Add multi-modal support (charts/images).
  • Implement persistent memory/checkpointing with SQLite.
  • Add export to PDF feature for research reports.

About

A multi-step LangGraph workflow that retrieves, validates, and synthesizes information from user documents and web sources using tool-based search, cross-source verification, and mixed LLMs for optimized reasoning.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published