This repository is dedicated to my book 📕 𝐀𝐩𝐩𝐥𝐢𝐞𝐝 𝐃𝐚𝐭𝐚 𝐒𝐜𝐢𝐞𝐧𝐜𝐞 𝐟𝐨𝐫 𝐂𝐫𝐞𝐝𝐢𝐭 𝐑𝐢𝐬𝐤: 𝗔 𝗣𝗿𝗮𝗰𝘁𝗶𝗰𝗮𝗹 𝗚𝘂𝗶𝗱𝗲 𝗶𝗻 𝗥 𝗮𝗻𝗱 𝗣𝘆𝘁𝗵𝗼𝗻 and other topics related to credit risk modeling. It will be regularly updated with GitHub pages, slides, and PDF documents covering various modeling subjects.
The motivation behind writing this book and creating the repository stems from the observed gap between academic literature, industry practices, and the evolving landscape of data science. While there's been a notable increase in literature on credit risk modeling, discrepancies persist. The evolution of data science has led to significant automation in processes. Still, it has also brought the risk of overreliance on pre-programmed procedures, sometimes leading to the misuse of statistical methods. Moreover, many practitioners entering credit risk modeling often overlook fundamental principles, hindering their professional development. Hence, the repository aims to serve as a centralized hub for continuous education and consolidating essential concepts.
The repository and book will encompass practical examples utilizing both R
and Python
.
Below are links providing an overview of the repository's main topics, which include summaries from the book and insights gleaned from practical experience.
The Vasicek Distribution (Probability of Default Models):
- The Functional Form and Parameters Estimation Methods (pdf, presentation)
- Shiny Application for Estimating the Parameters of the Vasicek Distribution
- Asset Correlation Estimation - Maximum Likelihood: Analytical vs Numerical Optimization Approach (pdf, presentation)
- The Logistic Vasicek Distribution (pdf, presentation)
- Asset Correlation Estimation - Maximum Likelihood: Normal vs Logistic Vasicek Distribution (pdf, presentation)
- Asset Correlation Estimators and Bias Quantification (pdf, presentation)
- The Vasicek PD Model and Transition Matrices - Optimization of the Systemic Factor Z
Loss Given Default:
- Loss Given Default as a Function of the Default Rate (pdf, presentation)
- The Vasicek LGD Model - The Functional Form and Parameters Estimation Method (pdf, presentation)
- The Vasicek LGD Model - Simulating the Distribution of the Parameters (pdf, presentation)
- The Vasicek LGD Model - The Bias Quantification of the Sensitivity Parameter (pdf, presentation)
Low Default Portfolios:
- Likelihood Approaches to Low Default Portfolios - Andrija Djurovic's Adjustment of Alan Forrest's Method to the Multi-Year Period Design (pdf, presentation)
- Conservative Estimation of Default Probabilities - BCR Method (pdf presentation,
R
&Python
code)
Measuring Concentration Risk:
- Measuring Concentration Risk - A Partial Portfolio Approach (pdf presentation,
R
&Python
code) - Simulation dataset
Effective Interest Rate:
Loan Repayment Plan:
Model Development and Validation:
- IRB PD Periodic Model Validation - Quantitative Testing Procedures (pdf, presentation)
- Validation of Credit Risk Models - Does the P-value Provides Sufficient Insight? (pdf, presentation)
- Validation of Credit Risk Models - On Favorable P-values in Statistical Tests? (pdf, presentation)
- Margin of Conservatism Type C in PD Modeling - Central Tendency Uncertainty in the Presence of Autocorrelation (pdf, presentation)
- Hypothesis Testing in Credit Risk - A Visual Approach for Deeper Understanding (pdf, presentation)
- On Testing the Concentration in the Rating Grades - The Initial and Periodic PD Model Validation (pdf, presentation)
- The Binomial Tests for PD Model Validation - The Independent and Correlated Binomial Distributions (pdf, presentation)
- The Model-Based Heterogeneity Testing (pdf, presentation)
- Principal Component Analysis for IFRS9 Forward-Looking Modeling (pdf,
R
&Python
code) - Bootstrap Hypothesis Tests (html, GitHub page)
- Bootstrap Hypothesis Tests (pdf, presentation)
- Statistical Binning of Numeric Risk Factors - PD Modeling (pdf, presentation)
- Hosmer-Lemeshow VS Z-score Test on Portfolio Level (html, GitHub slides)
- Hosmer-Lemeshow VS Z-score Test on Portfolio Level (pdf, presentation)
- Power Play: Probability of Default Predictive Ability Testing (pdf, presentation)
- Nested Dummy Encoding (pdf, presentation)
- Marginal Information Value (pdf, presentation)
OLS Regression:
- Consequences of Violating the Normality Assumption for OLS Regression (pdf,
R
&Python
code) - Consequences of Heteroscedasticity for OLS Regression (pdf,
R
&Python
code) - Consequences of Multicollinearity for OLS Regression (pdf,
R
&Python
code) - Consequences of Autocorrelation for OLS Regression (pdf,
R
&Python
code)