Skip to content

Overhaul LLM simulator: persona, RAG, student-split eval#68

Merged
patelfagun1998 merged 8 commits into
mainfrom
llm-simulator-overhaul
May 20, 2026
Merged

Overhaul LLM simulator: persona, RAG, student-split eval#68
patelfagun1998 merged 8 commits into
mainfrom
llm-simulator-overhaul

Conversation

@patelfagun1998
Copy link
Copy Markdown
Collaborator

Summary

  • Refactor run.py (net -500 lines): simplified grading/evaluation loop
  • Add student persona generation (persona.py) for behavioral simulation
  • Add retrieval-augmented generation (rag.py) for context-aware prompts
  • Add attempt summarization (summarize.py) for multi-attempt tracking
  • Add prompt strategy abstraction (prompt_strategies.py)
  • Add student-split simulation (run_student_split.py, student_split_loader.py) bridging to temporal_eval data splits
  • Add result analysis and plotting (analyze_results.py, plot_attempt_accuracy.py, power_analysis.py)
  • Update data_loader.py with temporal filtering, course params, CSV dedup
  • Update prompts.py and runners.py

Files changed (13)

  • llm_simulator/run.py — major refactor
  • llm_simulator/prompts.py — updated prompt assembly
  • llm_simulator/runners.py — updated model wrappers
  • llm_simulator/data_loader.py — temporal filtering, course param
  • llm_simulator/persona.pynew
  • llm_simulator/rag.pynew
  • llm_simulator/summarize.pynew
  • llm_simulator/prompt_strategies.pynew
  • llm_simulator/run_student_split.pynew
  • llm_simulator/student_split_loader.pynew
  • llm_simulator/analyze_results.pynew
  • llm_simulator/plot_attempt_accuracy.pynew
  • llm_simulator/power_analysis.pynew

Test plan

  • python3 -c "import llm_simulator.run" — imports clean
  • python3 -c "import llm_simulator.run_student_split" — imports clean
  • python3 -c "import llm_simulator.persona" — imports clean

- Make grounded (real student trajectory) the only execution mode
- Remove prompt_strategies.py (unused) and power_analysis.py
- Remove dead functions: load_eval_items, _load_trajectory, _load_zero_shot,
  retrieve_examples, retrieve_context, _retrieve_self_similar,
  summarize_history, summarize_rag_context, SUMMARY_PROMPT
- Remove early_stop_patience (unused in grounded mode)
@patelfagun1998 patelfagun1998 merged commit 5aa10d1 into main May 20, 2026
3 checks passed
@patelfagun1998 patelfagun1998 deleted the llm-simulator-overhaul branch May 20, 2026 21:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant