This project is focused on validating risk models by analyzing delivery rates and their statistical properties. It includes data processing, variance calculations, t-tests, and visualizations to assess the performance of different models and technologies.
- Overall Results Display: Summarizes total offered volume, expected value, and delivery rates.
- Delivery Rate Analysis: Calculates the difference between issued and model delivery rates.
- Variance Calculation: Computes variances of delivery rates and checks for significant differences.
- T-Test for Statistical Significance: Performs a t-test to compare delivery rates and assess whether differences are statistically significant.
- Visualization: Plots delivery rate differences and trends by technology category.
-
Variance Calculation:
- Variances of
CORE DEL RATEandISSUED DEL RATEare computed. - A comparison is made to check if the variances are significantly different.
- Variances of
-
T-Test:
- A t-test is performed to determine if the mean delivery rates (
CORE DEL RATEvs.ISSUED DEL RATE) are statistically different. - This helps validate whether the model delivery rate aligns with the issued delivery rate.
- A t-test is performed to determine if the mean delivery rates (
-
Ensure the required Python libraries are installed:
pandasmatplotlibscipy(for t-tests)
-
Ensure that the
validation.csvdataset is in the Data subfolder. -
Run the notebook to:
- View overall results.
- Analyze delivery rates, variances, and t-test results.
- Visualize the data.