Mind the data gap: Missingness Still Shapes Large Language Model Prognoses

This repository allows reproducing the results presented in Mind the data gap: Missingness Still Shapes Large Language Model Prognoses. In this work, we investigate the impact of missingness serialization on the zero-shot performance of LLMs.

Experimental setup

The proposed experiments consist of providing clinical data as inputs and prompting two LLMs (Qwen 3 and OSS-GPT) to predict an outcome of interest. To measure the impact of missingness, we employ two strategies to serialize the data: with and without missingness indicators in the serialized input.

To reproduce the paper's results:

Generate MIMIC-IV MEDS build and task cohorts
Use the following tutorial to construct the MIMIC-IV MEDS build and downstream task cohorts.
Follow the instructions in that repository to create the required inputs.
Create the final evaluation cohort

This step extracts the clinical measurements and formats them into the final evaluation cohort used for inference. From this repository, run:
```
python main.py --experiment mimic --mode generate_cohort
```
Run inference

Generate LLM predictions by running:
```
python main.py --experiment mimic --mode test
```

Requirements

Python 3.11
vLLM for efficient inference.

To install with conda:

conda env create -f environment.yml
conda activate vllm_env

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
configs		configs
src		src
README.md		README.md
environment.yml		environment.yml
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mind the data gap: Missingness Still Shapes Large Language Model Prognoses

Experimental setup

Requirements

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

reAIM-Lab/EHR-missingness

Folders and files

Latest commit

History

Repository files navigation

Mind the data gap: Missingness Still Shapes Large Language Model Prognoses

Experimental setup

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages