Han-co (en)

Han-co (en)Field notes on credit & finance data science, from a data scientist working in Japan.https://han-co.com/en[Basics] Part 5. Ranking isn't enough: three axes for evaluating a credit modelhttps://han-co.com/en/blog/part5-evaluation-metrics/https://han-co.com/en/blog/part5-evaluation-metrics/How do you know whether you built a good model? In credit you don't just check whether it ranks well (discrimination). You read discrimination with AUC and PR-AUC, check whether the probabilities match reality with calibration, and check whether it holds up over time with PSI. Here are the two axes ordinary ML tends to skip.Thu, 02 Jul 2026 00:00:00 GMT[Review] Can Google's new tabular foundation model TabFM beat GBM in credit? I tested it on public datahttps://han-co.com/en/blog/tabfm-credit/https://han-co.com/en/blog/tabfm-credit/Google's zero-shot tabular foundation model TabFM claims to beat even a well-tuned GBM with no training and no tuning. Can it actually be used on credit losses? A practitioner's review, pitting it against a carefully built GBM on public credit-card data.Thu, 02 Jul 2026 00:00:00 GMT[Basics] Part 4. Building a credit model: scorecards and treeshttps://han-co.com/en/blog/part4-credit-modeling/https://han-co.com/en/blog/part4-credit-modeling/If Part 3 was about choosing a model, this piece is about actually building one. How to build a scorecard with logistic regression (WOE, IV, score scaling) and how to build one with trees (features, SHAP, monotone constraints), where the two diverge, and the reject inference and calibration you have to run no matter which model you picked.Mon, 29 Jun 2026 00:00:00 GMT[Deep Dive] Where do rejected applicants go? Reject inference and rejectkithttps://han-co.com/en/blog/rejectkit/https://han-co.com/en/blog/rejectkit/A credit model learns only from the people it approved, yet it's judged on every applicant, rejects included. I bundled eight reject-inference techniques for correcting that sample-selection bias behind one API, and — more importantly — built rejectkit, a Python library that measures whether the correction actually helps on your own data. Both are now public.Mon, 29 Jun 2026 00:00:00 GMT[Paper] SSL falls short of GBM on credit data. But combined, it helpshttps://han-co.com/en/blog/ssl-credit-risk/https://han-co.com/en/blog/ssl-credit-risk/Can self-supervised learning (SSL) beat GBM at credit default prediction? I ran the experiment on public data (AMEX). On its own, SSL falls short of GBM — but bolted onto GBM's features, it lifts performance by a statistically meaningful margin. And that lift was concentrated in the hidden defaults among customers GBM thought were safe.Sun, 28 Jun 2026 00:00:00 GMT[Basics] Part 3. Where deep learning doesn't win: machine learning for scoringhttps://han-co.com/en/blog/part3-ml-for-scoring/https://han-co.com/en/blog/part3-ml-for-scoring/Credit data is tabular. And on tabular data, the winner isn't a flashy deep net — it's tree-based boosting. Here's why picking on performance lands you at a tree as the final model, why logistic regression is still in use, and why cross-validation in finance has to be done differently.Thu, 25 Jun 2026 00:00:00 GMT[Deep Dive] Does raising a credit limit increase defaults? A test on three public datasetshttps://han-co.com/en/blog/credit-limit-debiasing/https://han-co.com/en/blog/credit-limit-debiasing/If you raise someone's credit limit, does their probability of default go up or down? Intuition says up, but the data says the opposite: down. This post untangles that paradox with debiasing, tests it on three public datasets, and works out when the sign of the limit effect actually flips.Mon, 22 Jun 2026 00:00:00 GMT[Basics] Part 2. Statistics first: how to read credit datahttps://han-co.com/en/blog/part2-statistics-probability/https://han-co.com/en/blog/part2-statistics-probability/Before you reach for machine learning, statistics comes first. In credit, you ask 'is this difference real or just noise?' far more often than 'does the model fit well?' Here's the shape of financial data, the trap of multiple testing, how to handle small samples, and the bias that's baked in by default.Sun, 21 Jun 2026 00:00:00 GMT[Basics] Part 1. The card business and credit risk: where underwriting models beginhttps://han-co.com/en/blog/part1-credit-card-business/https://han-co.com/en/blog/part1-credit-card-business/What a model should optimize for comes, in the end, from the business. Here's a walk through where a card issuer earns and where it loses, how credit loss breaks into parts, and how regulation makes its way inside the model. Consider it the domain groundwork for understanding an underwriting model.Fri, 19 Jun 2026 00:00:00 GMT[Basics] Part 0. 7 ways finance data science differs from ordinary MLhttps://han-co.com/en/blog/part0-finance-ds-7-differences/https://han-co.com/en/blog/part0-finance-ds-7-differences/People who are great at everything from building ML models to evaluating them still trip up when they reach credit underwriting. It isn't a skill gap — the field runs on different rules. From selection bias to regulation, here are 7 ways finance data science is structurally different from ordinary ML.Wed, 17 Jun 2026 00:00:00 GMT