Skip to content

kurtjallo/Clarify-Hackville-2026

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

169 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Clarify ๐Ÿ

Built at Hackville 2026

Clarify is a document-aware web application that helps Canadians understand complex government and legal forms in plain language. Instead of relying on generic AI summaries, Clarify uses retrieval-augmented generation (RAG) to ensure every response is grounded directly in the uploaded document.


๐ŸŒŸ Overview

Public services often fail not because they are unavailable, but because they are difficult to understand. Canadian government and legal forms are frequently long, dense, and written in institutional language that assumes prior knowledge most people do not have.

Clarify addresses this gap by turning static PDFs into interactive, understandable documents. Users upload a form and ask questions in plain language, receiving clear explanations sourced strictly from the document itself. This reduces misinterpretation, builds trust, and lowers the cognitive barrier to accessing essential services.

Clarify is designed as a gov-tech accessibility tool, not a generic chatbot.


โœจ Key Features

  • ๐Ÿ“„ Document Upload & Parsing
    Upload government or legal PDFs and convert them into structured, searchable content.

  • ๐Ÿง  RAG-Powered Chatbot
    Ask questions about a form and receive responses grounded directly in the document.

  • ๐Ÿ–๏ธ Highlight-to-Explain
    Select specific sections of text to receive simplified explanations.

  • ๐Ÿ‡จ๐Ÿ‡ฆ Canada-Focused
    Designed specifically for Canadian government and legal workflows.

  • โ™ฟ Accessibility-First Design
    Reduced cognitive load, plain language, and clarity prioritized over verbosity.

  • ๐ŸŒ— Dark Mode & Responsive UI
    Fully usable across devices and lighting conditions.


๐Ÿ“ธ Screenshots

1 2 3 4 5 6 7 8 9

๐ŸŽฏ The Problem

Canadians frequently struggle with essential forms due to:

  • Dense legal and bureaucratic language
  • Long, unstructured PDF documents
  • Ambiguous terminology and unclear requirements
  • Fear of making irreversible mistakes
  • Lack of accessible, trustworthy explanations

These barriers disproportionately affect newcomers, students, low-income individuals, and anyone unfamiliar with legal or governmental processes.


๐Ÿง  Why Retrieval-Augmented Generation (RAG)?

Using a standard chatbot for legal or government forms is risky. Hallucinated or incorrect information can have real consequences.

Clarify uses RAG to prioritize accuracy and trust:

  • Responses are generated only after retrieving relevant document sections
  • The LLM is constrained to the uploaded formโ€™s content
  • Reduces hallucinations and unsupported claims
  • Improves transparency and explainability

This makes Clarify suitable for high-stakes contexts where correctness matters more than creativity.


๐Ÿ› ๏ธ Tech Stack

Frontend

  • Next.js
  • React.js
  • TypeScript
  • Tailwind CSS
  • Zustand (state management)
  • Lucide-React (icons)

Backend

  • Node.js
  • Express.js
  • TypeScript

AI & Document Intelligence

  • Retrieval-Augmented Generation (RAG)
  • Vector embeddings for semantic document search

Database

  • MongoDB Atlas
  • MongoDB Vector Search
  • Mongoose

๐Ÿ’ก How It Works

  1. User uploads a government or legal PDF
  2. Document is parsed, chunked, and embedded
  3. Embeddings are stored in MongoDB Atlas
  4. User queries are embedded and matched semantically
  5. Relevant document sections are retrieved
  6. The LLM generates a response constrained to that context

This architecture balances usability with reliability while remaining scalable.


๐Ÿ† Achievements

  • Built a functional RAG system in under 24 hours
  • Implemented vector search using MongoDB Atlas
  • Designed for accessibility and public-service use cases
  • Delivered a polished, demo-ready gov-tech prototype
  • Demonstrated practical application of AI safety concepts

๐ŸŽ“ What We Learned

  • Implementing end-to-end RAG pipelines
  • Working with vector databases in production-like systems
  • Handling real-world PDF document complexity
  • Designing AI systems for trust and correctness
  • Rapid full-stack development under hackathon constraints

๐Ÿ”ฎ Future Improvements

  • Inline citations and source highlighting
  • Multi-language support (French priority)
  • OCR support for scanned documents
  • Document history and saved conversations
  • Stronger accessibility tooling and screen reader support

๐Ÿ‘ฅ Team


๐Ÿ™ Acknowledgments

  • Hackville 2026 organizers and mentors
  • MongoDB Atlas for enabling scalable vector search
  • Open-source tools that made rapid experimentation possible
  • Anyone who has struggled to understand essential paperwork

Built with care at Hackville 2026

Releases

No releases published

Packages

 
 
 

Contributors

Languages