LiveKit RAG Voice Agent with Qdrant

Changes in this Fork

This repository is a forked version of the original Cartesia Voice Agent example. The main enhancements include the integration of Retrieval-Augmented Generation (RAG) using OpenAI embeddings and Qdrant vector database for semantic search capability. Specifically:

Document Ingestion: Added injest.py script that processes text files, generates embeddings with OpenAI's text-embedding-3-small model, and populates the Qdrant vector database (knowledge_base).
Custom Retrieval Function: Implemented retrieve_info in main.py to enable the agent to query the vector database for exact and semantic matches, returning concise, context-aware responses.
Enhanced Logging: Improved logging throughout ingestion and retrieval processes to assist debugging and ensure visibility into internal operations.
Async Handling: Ensured all blocking I/O operations (e.g., Qdrant queries, OpenAI embedding requests) run in separate threads (asyncio.to_thread) to maintain asynchronous efficiency.

Things to improve

The thinking_messages in agent\main.py aren't being used at all. They should be used while the agent is processing the user's query.

Feel Free to contribute to this repository.

Original README (modified requirements and instructions)

This is a demo of a LiveKit Voice Pipeline Agent using Cartesia and GPT-4o-mini.

The example includes a custom Next.js frontend and Python agent.

Live Demo (original w/o RAG and Qdrant)

Live Demo

Running the example

Prerequisites

Node.js
Python 3.9-3.12
LiveKit Cloud account (or OSS LiveKit server)
Cartesia API key (for speech synthesis)
OpenAI API key (for LLM)
Deepgram API key (for speech-to-text)
Qdrant API key, Cluster_ID, and URL (for semantic search)

Frontend

Copy .env.example to .env.local and set the environment variables. Then run:

cd frontend
npm install
npm run dev

Agent

Copy .env.example to .env and set the environment variables. Then run:

cd agent
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

# This will ingest the document into the Qdrant Vector Database (Only to Run Once)
python injest.py

python main.py dev

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github		.github
agent		agent
frontend		frontend
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LiveKit RAG Voice Agent with Qdrant

Changes in this Fork

Things to improve

Original README (modified requirements and instructions)

Live Demo (original w/o RAG and Qdrant)

Running the example

Prerequisites

Frontend

Agent

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LiveKit RAG Voice Agent with Qdrant

Changes in this Fork

Things to improve

Original README (modified requirements and instructions)

Live Demo (original w/o RAG and Qdrant)

Running the example

Prerequisites

Frontend

Agent

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages