Add RAG Example using FAISS and Harmony Prompts #207

Ujjwal-Bajpayee · 2025-10-05T17:39:23Z

Overview

This PR introduces a minimal Retrieval-Augmented Generation (RAG) example that integrates FAISS-based retrieval with gpt-oss models using Harmony-style prompts.

It is completely self-contained, non-invasive, and designed as an educational reference for ML engineers who want to ground open LLMs in local or private data sources.

🧠 What’s Included

New files only (no core modifications):

examples/rag_gpt_oss.py — main example script implementing FAISS indexing, retrieval, and Harmony prompting
examples/utils/harmony_helpers.py — helper functions for constructing and validating Harmony-formatted messages
examples/requirements-rag.txt — isolated dependencies for RAG example
examples/data/ — small local documents for FAISS indexing and retrieval
docs/examples/rag_gpt_oss.md — setup and usage guide

⚙️ Key Features

FAISS-based semantic search with persistent index (examples/data/.faiss/)
SentenceTransformer embeddings (all-MiniLM-L6-v2) for lightweight retrieval
Harmony-format chat construction for structured prompts
OpenAI-compatible endpoint via environment variables
- OPENAI_BASE_URL
- OPENAI_API_KEY
- GPT_OSS_MODEL
Supports streaming and --no-stream inference modes
Automatic JSONL logging (examples/data/runs/) with metadata and latency
Graceful fallbacks for missing dependencies (clear CLI messages)

🧩 Example Usage

pip install -r examples/requirements-rag.txt

export OPENAI_BASE_URL="http://localhost:8000/v1"
export OPENAI_API_KEY="dummy"
export GPT_OSS_MODEL="gpt-oss-mini"

python examples/rag_gpt_oss.py --query "Explain FAISS-based vector search" --top_k 3


Expected Output:
[Assistant]
FAISS (Facebook AI Similarity Search) is a library for efficient vector similarity search...
Sources: [1] intro_vector_search.md, [2] embeddings_and_faiss.md

✅ Validation Checklist

Before submitting the PR, the following items have been verified:

…ony prompts

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

chatgpt-codex-connector · 2025-10-05T17:41:51Z

examples/rag_gpt_oss.py

+def retrieve(query: str, index, chunks: List[Dict], model_name: str, top_k: int) -> List[Dict]:
+    model = SentenceTransformer(model_name)
+    qvec = model.encode([query], normalize_embeddings=True)
+    D, I = index.search(qvec, top_k)
+    results = []


Validate top_k before calling FAISS search

The retrieval function forwards the user-provided top_k directly to index.search without clamping it to the number of indexed chunks or ensuring it is positive. When the corpus is small (e.g., only three chunks) and the CLI is invoked with --top_k 100 or --top_k 0, faiss.IndexFlatIP.search raises a Faiss assertion 'k <= index.ntotal' failed (or similar) before any error handling runs, terminating the program instead of emitting the friendly error messages used elsewhere. Validating top_k against index.ntotal and requiring it to be > 0 would avoid the crash.

Useful? React with 👍 / 👎.

Ujjwal-Bajpayee · 2025-10-05T17:46:47Z

@simonw @seratch @romainhuet @bojanbabic This PR is designed to help anyone understand how to ground gpt-oss responses in external data using a minimal RAG example.
Please review whenever you get a chance. I’ve verified that it runs locally.

Add Retrieval-Augmented Generation (RAG) example using FAISS and Harm…

a9dd68e

…ony prompts

chatgpt-codex-connector bot reviewed Oct 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add RAG Example using FAISS and Harmony Prompts #207

Add RAG Example using FAISS and Harmony Prompts #207

Ujjwal-Bajpayee commented Oct 5, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Oct 5, 2025

Uh oh!

Ujjwal-Bajpayee commented Oct 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add RAG Example using FAISS and Harmony Prompts #207

Are you sure you want to change the base?

Add RAG Example using FAISS and Harmony Prompts #207

Conversation

Ujjwal-Bajpayee commented Oct 5, 2025

Overview

🧠 What’s Included

⚙️ Key Features

🧩 Example Usage

✅ Validation Checklist

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Oct 5, 2025

Choose a reason for hiding this comment

Uh oh!

Ujjwal-Bajpayee commented Oct 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant