-
Notifications
You must be signed in to change notification settings - Fork 2
Open
Labels
priority/highHigh priority issueHigh priority issue
Description
Summary
Add student loan (SLC) balance imputation from the Wealth and Assets Survey (WAS) to the FRS, following the same pattern as existing wealth imputations in wealth.py.
WAS Data Available
Student loan balance can be derived from WAS Round 7 (April 2018 - March 2020):
Tot_LosR7_aggr(total loans) minusTot_los_exc_SLCR7_aggr(total loans excluding SLC) = SLC debt
WAS Statistics
| Metric | Value |
|---|---|
| Unweighted households with SLC debt | 701 |
| Weighted households with SLC debt | 1.66m |
| Mean balance (those with debt) | £20,028 |
| Median balance | £14,000 |
| Max | £210,000 |
| Total weighted | £33.4bn |
Comparison to Admin Statistics (~2019-20)
| Metric | WAS | Admin (SLC) |
|---|---|---|
| Borrowers/HH with debt | 1.66m HH | ~4.5m individuals |
| Total outstanding | £33.4bn | ~£140bn UK |
| Coverage | 24% | 100% |
WAS undercounts due to young graduate underrepresentation and household vs individual unit. However, it captures the distribution shape which is valuable for imputation.
Implementation
- Add to
wealth.pyRENAMES dict:
"Tot_LosR7_aggr": "total_loans",
"Tot_los_exc_SLCR7_aggr": "total_loans_exc_slc",- Derive SLC debt in
generate_was_table():
was["student_loan_balance"] = was["total_loans"] - was["total_loans_exc_slc"]-
Add
"student_loan_balance"toIMPUTE_VARIABLES -
Consider scaling imputed values to match SLC admin totals (currently £267bn for England, March 2025)
Why Needed
- FRS
student_loans(tuborr) is amount borrowed THIS YEAR by current students, not outstanding balance - 98.7% of FRS repayers have
student_loans = 0 - Without balance data, we can't cap repayments at outstanding balance (high earners get unrealistic modelled repayments)
- Needed for interest accrual calculations
Related Issues
- Add student loan repayment calibration targets #237 - Add student loan repayment calibration targets
- policyengine-uk will need a
student_loan_balancevariable to use the imputed data
Sources
Metadata
Metadata
Assignees
Labels
priority/highHigh priority issueHigh priority issue