Skip to content

avnlp/rag-pipelines

RAG Pipelines

GitHub License

Datasets

We evaluate the RAG pipelines on the following datasets:

Dataset Description
HealthBench A comprehensive benchmark for evaluating medical AI, featuring multi-turn conversations and expert assessments.
MedCaseReasoning A collection of medical case studies that include detailed, step-by-step reasoning processes.
MetaMedQA A medical question-answering dataset where contexts are sourced from USMLE textbooks.
PubMedQA A biomedical question-answering dataset derived from abstracts in PubMed articles.

Developing

Installing dependencies

The development environment can be set up using uv. Hence, make sure it is installed and then run:

uv sync
source .venv/bin/activate

In order to install dependencies for testing (codestyle, unit tests, integration tests), run:

uv sync --dev
source .venv/bin/activate

In order to exclude installation of packages from a specific group (e.g. docs), run:

uv sync --no-group docs

About

Advanced RAG Pipelines and Evaluation

Topics

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages