Overhaul LLM simulator: persona, RAG, student-split eval by patelfagun1998 · Pull Request #68 · aims-foundations/dynamic-irt

patelfagun1998 · 2026-05-20T20:15:42Z

Summary

Refactor run.py (net -500 lines): simplified grading/evaluation loop
Add student persona generation (persona.py) for behavioral simulation
Add retrieval-augmented generation (rag.py) for context-aware prompts
Add attempt summarization (summarize.py) for multi-attempt tracking
Add prompt strategy abstraction (prompt_strategies.py)
Add student-split simulation (run_student_split.py, student_split_loader.py) bridging to temporal_eval data splits
Add result analysis and plotting (analyze_results.py, plot_attempt_accuracy.py, power_analysis.py)
Update data_loader.py with temporal filtering, course params, CSV dedup
Update prompts.py and runners.py

Files changed (13)

llm_simulator/run.py — major refactor
llm_simulator/prompts.py — updated prompt assembly
llm_simulator/runners.py — updated model wrappers
llm_simulator/data_loader.py — temporal filtering, course param
llm_simulator/persona.py — new
llm_simulator/rag.py — new
llm_simulator/summarize.py — new
llm_simulator/prompt_strategies.py — new
llm_simulator/run_student_split.py — new
llm_simulator/student_split_loader.py — new
llm_simulator/analyze_results.py — new
llm_simulator/plot_attempt_accuracy.py — new
llm_simulator/power_analysis.py — new

Test plan

python3 -c "import llm_simulator.run" — imports clean
python3 -c "import llm_simulator.run_student_split" — imports clean
python3 -c "import llm_simulator.persona" — imports clean

- Make grounded (real student trajectory) the only execution mode - Remove prompt_strategies.py (unused) and power_analysis.py - Remove dead functions: load_eval_items, _load_trajectory, _load_zero_shot, retrieve_examples, retrieve_context, _retrieve_self_similar, summarize_history, summarize_rag_context, SUMMARY_PROMPT - Remove early_stop_patience (unused in grounded mode)

patelfagun1998 added 8 commits May 20, 2026 16:15

Overhaul LLM simulator: persona, RAG, summarization, student-split eval

4dd8fd4

Remove plot_attempt_accuracy.py (superseded by plot_filtered_accuracy)

65e3e2a

Remove analyze_results.py (not used for paper figures)

9b85aa4

Remove pricing tracking from ClaudeRunner

fab93dc

Rename run_student_split.py to eval_student_split.py, update README

47756af

Rename claude model key to opus in runners and README

635e523

Merge student_split_loader.py into data_loader.py

7d80ca4

patelfagun1998 merged commit 5aa10d1 into main May 20, 2026
3 checks passed

patelfagun1998 deleted the llm-simulator-overhaul branch May 20, 2026 21:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overhaul LLM simulator: persona, RAG, student-split eval#68

Overhaul LLM simulator: persona, RAG, student-split eval#68
patelfagun1998 merged 8 commits into
mainfrom
llm-simulator-overhaul

patelfagun1998 commented May 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

patelfagun1998 commented May 20, 2026

Summary

Files changed (13)

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant