Introduction

NLP signals (sentiment, topic prevalence, event frequency) work well in some market regimes but fail in others. A sentiment signal profitable in quiet markets might fail during crises. Testing signal stability across regimes is critical before deployment.

Defining Market Regimes

Regimes: periods with distinct characteristics. Examples:

  • Bullish calm: positive sentiment, low volatility, strong earnings
  • Bullish volatile: positive sentiment, high volatility, event-driven
  • Bearish calm: negative sentiment, low volatility, recession talk
  • Crisis: extreme negative sentiment, extreme volatility, panic
Use market indicators (VIX, returns, volatility) to classify regimes.

Regime Identification Methods

1. Hidden Markov Model: discovers regimes from volatility/returns. 2. Fixed regimes: manually define (bull = S&P >50 SMA, bear otherwise). 3. Clustering: cluster days by returns, volatility, sentiment, identify clusters as regimes.

Testing Signal Performance by Regime

For each regime, compute signal correlation with returns:

  • Bullish calm: sentiment-return correlation 0.45
  • Bullish volatile: sentiment-return correlation 0.12
  • Bearish calm: sentiment-return correlation 0.38
  • Crisis: sentiment-return correlation -0.05 (no signal)
Signal effectiveness varies dramatically across regimes.

Why Signals Fail in Some Regimes

1. Regime-dependent signal relevance: in crisis, fundamental sentiment less important than liquidity/forced selling. 2. Signal saturation: in strong sentiment moves, everyone has same signal, no edge. 3. Regime-specific signals dominate: earnings matter in bullish calm, technical factors dominate in crises.

Adaptive Signal Weighting

Use regime-specific weights: sentiment signal weight = 50% in bullish calm, 10% in crisis. Dynamically adjust weights based on current regime. This increases robustness to regime shifts.

Ensemble of Regime-Specific Models

Train separate models for each regime. At each time, identify regime, use corresponding model. This handles regime-dependent relationships explicitly. Backtest shows 15-25% Sharpe improvement versus single regime-agnostic model.

Stress-Testing Signal Stability

Simulate extreme regimes (2008 crisis, 2020 COVID crash, 2022 rate shock). Does sentiment signal predict returns? If signal breaks down in tail regimes, risk model for catastrophic failure. Either improve signal or reduce position sizes during stress.

Metrics for Regime-Adjusted Performance

Instead of simple correlation, use regime-adjusted correlation: correlation per regime, then average (equal-weighted or volatility-weighted). This captures performance across typical market conditions, not just average conditions.

Empirical Case Study: Fed Communication Signal

Fed communication signal (hawkish/dovish tone) predicts bond returns:

  • Normal regime (3% interest rates): correlation 0.35
  • Low-rate regime (<1%): correlation 0.05
  • Tightening regime (rates rising): correlation 0.42
  • Crisis regime: correlation -0.08
Signal is unreliable in low-rate environments; strong in tightening.

Deployment Strategy

1. Identify current regime daily.
2. Use regime-specific weights for signals.
3. Monitor signal performance in current regime.
4. If signal breaks (correlation drops), reduce position or switch strategy.
5. Retrain regime-specific models when regime changes significantly.