A simple machine learning project that trains and evaluates models to predict student performance using the provided Students Performance dataset.
- File:
data/StudentsPerformance.csv(also available innotebook/data/) - Short description: student demographics and exam scores (math, reading, writing).
-
Install dependencies:
pip install -r requirements.txt
-
Reproduce experiments or train models:
- Open and run the notebooks in
notebook/(recommended for step-by-step work), or - Use the training pipeline in
src/(seesrc/components/model_trainer.pyand pipeline modules).
- Open and run the notebooks in
-
Use Docker (optional): common commands are in the
Makefile(e.g.,make build,make up).
data/— dataset filesnotebook/— EDA and model training notebookssrc/— code for ingestion, transformation, and model trainingartifacts/— generated models and outputsapp.py— small app / entrypoint
Keep it simple and reproducible. For more details, open the notebooks or check src/.