Add RAG Example using FAISS and Harmony Prompts #207
  Add this suggestion to a batch that can be applied as a single commit.
  This suggestion is invalid because no changes were made to the code.
  Suggestions cannot be applied while the pull request is closed.
  Suggestions cannot be applied while viewing a subset of changes.
  Only one suggestion per line can be applied in a batch.
  Add this suggestion to a batch that can be applied as a single commit.
  Applying suggestions on deleted lines is not supported.
  You must change the existing code in this line in order to create a valid suggestion.
  Outdated suggestions cannot be applied.
  This suggestion has been applied or marked resolved.
  Suggestions cannot be applied from pending reviews.
  Suggestions cannot be applied on multi-line comments.
  Suggestions cannot be applied while the pull request is queued to merge.
  Suggestion cannot be applied right now. Please check back later.
  
    
  
    
Overview
This PR introduces a minimal Retrieval-Augmented Generation (RAG) example that integrates FAISS-based retrieval with gpt-oss models using Harmony-style prompts.
It is completely self-contained, non-invasive, and designed as an educational reference for ML engineers who want to ground open LLMs in local or private data sources.
🧠 What’s Included
New files only (no core modifications):
examples/rag_gpt_oss.py— main example script implementing FAISS indexing, retrieval, and Harmony promptingexamples/utils/harmony_helpers.py— helper functions for constructing and validating Harmony-formatted messagesexamples/requirements-rag.txt— isolated dependencies for RAG exampleexamples/data/— small local documents for FAISS indexing and retrievaldocs/examples/rag_gpt_oss.md— setup and usage guide⚙️ Key Features
examples/data/.faiss/)all-MiniLM-L6-v2) for lightweight retrievalOPENAI_BASE_URLOPENAI_API_KEYGPT_OSS_MODEL--no-streaminference modesexamples/data/runs/) with metadata and latency🧩 Example Usage
✅ Validation Checklist
Before submitting the PR, the following items have been verified:
examples/,examples/utils/,examples/data/, anddocs/examples/pyproject.toml, core libraries, or CI configurationexamples/requirements-rag.txtOPENAI_BASE_URLOPENAI_API_KEYGPT_OSS_MODELharmony_helpers.py--no-streamwork as expectedexamples/data/runs/with latency and metadatadocs/examples/rag_gpt_oss.mdblackand checked withruff(if available)transformersandvLLMbackends