Trading the Breaking

Trading the Breaking

Quant Lectures

[Quant Lecture] The Hypothesis-Testing Framework

Statistics for algorithmic traders

πš€πšžπšŠπš—πš π™±πšŽπšŒπš”πš–πšŠπš—'s avatar
πš€πšžπšŠπš—πš π™±πšŽπšŒπš”πš–πšŠπš—
Sep 22, 2025
βˆ™ Paid
1
1
Share

The Hypothesis-Testing Framework

This chapter turns estimation into a decision process: from a single backtest path to a formal, falsifiable test that separates repeatable edge from noise and supports real capital decisions. It adopts a skeptical starting point (β€œno edge”) and advances with robust test statistics, defensible p-values/alphas, confidence-interval thinking, and explicit power analysis.

What’s inside:

  1. From estimation to decision. Why measuring uncertainty (point & interval estimates) isn’t enoughβ€”and how hypothesis testing supplies the courtroom-style procedure for a go/no-go verdict.

  2. Defining the claims. How to state precise null vs. alternative hypotheses for traders: mean excess return, Sharpe, regression alpha, correlations, and model comparisons (including costs via β€œΒ΅ ≀ friction”).

  3. Test statistics done right. Distilling performance into β€œsignal Γ· noise,” using HAC/Newey–West errors and joint tests (F-tests) when factors are assessed together.

  4. From numbers to evidence. Mapping a test statistic to a p-value and choosing Ξ± as a business threshold; what p-values areβ€”and aren’t.

  5. Intervals as decisions. Duality between tests and confidence intervals; using bootstrap CIs (e.g., Sharpe) and deciding on the lower bound vs. a cost-aware hurdle.

  6. Errors and power. Type I (false edge) vs. Type II (missed edge), their costs, and the levers that set statistical power: Ξ±, effect size, variance, and sample size.

  7. Backtest length trade-off. Long samples boost power but threaten stationarity; short samples fit regime but risk underpowered conclusionsβ€”plus ways to balance the tension.

  8. Advanced testing via Likelihood Ratio. A general, nested-model framework (with χ² reference) to test structural breaks, factor redundancy, and justified model complexity.

A disciplined path from β€œlooks good” to statistically defensibleβ€”and economically relevantβ€”deployment decisions.

Check a sample of what you will find inside:

Chapter Sample
677KB βˆ™ PDF file
Download
Download

This post is for paid subscribers

Already a paid subscriber? Sign in
Β© 2025 Quant Beckman
Privacy βˆ™ Terms βˆ™ Collection notice
Start writingGet the app
Substack is the home for great culture