Overview

Quant researcher tests 100 hypotheses; one shows 5% Sharpe. Did they discover alpha or just p-hack (false positive from multiple testing)? Detect p-hacking: hold-out test set. If strategy overfits to training data, hold-out performance collapses. Out-of-sample validation exposes p-hacking.