US Index Prediction: A Multi-Index Framework for DJIA, S&P 500, and NAS100

Empirical Studies·March 2026·Rahul S. P.

Abstract

A literature review and research framework for predicting US equity index movements using cross-index dynamics. We identify several unstudied research gaps including price-weighted vs cap-weighted divergence signals and trivariate cointegration regime models. Empirical phases are in progress.

Work in Progress — Phase 2 complete, Phase 3 in progress. Dual-system architecture converged. Shorts: specialist model achieves +$127,633 (PF 1.90, 59.5% WR, Run 3L). Longs: dip-buying model achieves +$4,683 (PF 1.16, 40.2% WR, Run 3N), the first profitable long model. Runs 3j through 3O established that barrier labels are fundamentally wrong for equity longs (drift is invisible to barriers), while shorts exploit recognisable panic patterns. See Sections 7.5-7.14 for full progression.

Project Roadmap

Phase	Description	Status
Phase 1	Literature Review	Complete
Phase 2	Data Collection & Feature Engineering 7 gap studies completed — see Section 6 for full results.	Complete
Phase 3	Model Development & Backtesting Dual-system: Short specialist +$127,633 (PF 1.90, Run 3L). Dip-buy long model +$4,683 (PF 1.16, Run 3N, first profitable longs). 13 runs documented. See Sections 7.5-7.14.	In Progress
Phase 4	Walk-Forward Validation	Planned

1. Introduction

The three dominant US equity indices — the Dow Jones Industrial Average (DJIA, traded as US30), the S&P 500 (US500), and the NASDAQ-100 (NAS100) — are often treated as interchangeable proxies for "the US stock market." In practice, they differ profoundly in construction methodology, sector composition, and constituent overlap. The DJIA is price-weighted across 30 blue-chip stocks; the S&P 500 is float-adjusted market-cap-weighted across roughly 500 companies; the NAS100 is modified market-cap-weighted across 100 non-financial firms with heavy technology exposure. These structural differences create persistent, non-trivial divergences in short-horizon returns that are largely absent from the academic literature.

Most published research on US equity index prediction treats each index in isolation: momentum strategies on the S&P 500, mean-reversion on the DJIA, or machine learning forecasts for the NASDAQ. The cross-index dimension — how information propagates between the three indices, how their spreads behave across market regimes, and whether structural differences create exploitable signals — remains substantially understudied. This is surprising given that the futures on these three indices (ES, YM, NQ) are among the most liquid instruments in the world, and that relative-value trades between them are a staple of institutional desks (CME Group, "Stock Index Spread Opportunities").

This project aims to fill that gap. We begin with a comprehensive literature review covering cross-index dynamics, multi-index trading strategies, and structural differences that create tradeable opportunities. We then identify specific research gaps — several of which appear to be entirely unstudied in the academic literature — and outline a phased research plan to test them empirically. The data constraint is deliberate: we restrict ourselves to OHLCV data at minute resolution from MetaTrader 5, ensuring that any findings are reproducible without proprietary data feeds.

2. Cross-Index Dynamics

2.1 Lead-Lag Relationships

The foundational work on lead-lag in equity markets comes from Lo and MacKinlay (1990), who documented that returns of large-capitalisation stocks lead returns of smaller stocks, attributing the effect partly to nonsynchronous trading and partly to differential speed of adjustment to information. Chordia and Swaminathan (2000) refined this finding by showing that high-volume portfolios lead low-volume portfolios at daily and weekly horizons, even after controlling for firm size. The mechanism is not purely mechanical: high-volume stocks adjust faster to market-wide information because they attract more attention from informed traders and algorithmic market makers.

In the futures-spot domain, the evidence is decisive. Stoll and Whaley (1990) found that S&P 500 and Major Market Index futures returns lead the corresponding cash indices by approximately five minutes on average, with occasional leads exceeding ten minutes. Lower transaction costs, leverage, and the ease of short-selling in futures explain why price discovery concentrates there. Hasbrouck (2003) quantified this precisely: roughly 90% of price discovery in the S&P 500 occurs in E-mini futures (information share IS = 0.89 to 0.93). For the NASDAQ-100, E-mini futures similarly dominate. The SPY ETF contributes to sector ETF price discovery, but not the reverse.

At the tick level, Huth and Abergel (2011) demonstrated that the most liquid assets lead smaller and less liquid stocks, and that the lead-lag structure is not constant intraday but shows seasonality around macroeconomic announcements and the US market open. By the early 2020s, median lead-lag durations in major equity markets have compressed to under ten milliseconds.

Despite this extensive literature on futures-spot and large-small cap lead-lag, direct studies of information flow between the three major US equity indices are sparse. Because the DJIA contains only 30 price-weighted stocks while the NAS100 is technology-heavy and the S&P 500 is broadly cap-weighted, differential information absorption speeds should exist during sector-specific news events. For instance, technology earnings may move the NAS100 first, with the signal propagating to the S&P 500 and the DJIA lagging if the relevant stocks carry low price-weighting in the Dow. This hypothesis has not been formally tested.

2.2 Correlation Structure and Regime Dependence

Engle (2002) introduced the Dynamic Conditional Correlation (DCC-GARCH) framework, which has become the standard tool for estimating time-varying correlations between financial assets. The model proceeds in two stages: univariate GARCH for each series, followed by a parsimonious correlation model on the standardised residuals. For any study of cross-index dynamics, DCC-GARCH provides the natural starting point for measuring how tightly the three indices co-move and whether that co-movement is stable.

A critical methodological insight comes from Forbes and Rigobon (2002), who demonstrated that raw correlation coefficients are biased upward during high-volatility periods. After adjusting for this bias, they found no significant increase in unconditional correlation during the 1997 Asian crisis, the 1994 Mexican devaluation, or the 1987 US crash. What appeared to be crisis-driven contagion was in fact pre-existing interdependence made visible by elevated variance. This finding has direct implications for anyone studying cross-index correlation during stress periods: naive rolling correlations will systematically overstate the degree of regime change.

Hamilton (1989) introduced the Markov-switching model for macroeconomic time series, where model parameters depend on an unobservable regime variable that follows a first-order Markov chain. This framework underpins all subsequent regime-switching work in finance. Ang and Bekaert (2002) applied it to portfolio choice, documenting that correlations and volatilities increase in bear markets. Despite this, diversification retains value even under regime switching because the increase in correlation is not perfect.

Regarding the three indices specifically, a Nasdaq (2020) white paper documents that NAS100 correlation with DJIA and S&P 500 was weakest during the Tech Bubble and the low-volatility period of 2017, and strongest during and after the 2008 Financial Crisis. In low-volatility environments, correlations decline naturally as there is no strong macroeconomic signal forcing co-movement. Fry-McKibbin and Hsiao (2018) applied Markov-switching models to US indices and identified three regimes — tranquil, volatile, and turbulent — with the tranquil regime being most frequent, the volatile regime dominating 2008, and the turbulent regime dominating the first four months of 2020.

2.3 Sector Rotation Patterns

The three indices differ structurally in sector exposure. The DJIA tilts toward industrials, healthcare, consumer staples, and financials. The S&P 500 has approximately 30% technology, 13% healthcare, and 13% financials. The NAS100 is roughly 45% technology with significant communications and consumer discretionary exposure, but excludes financials entirely and has minimal energy and utilities representation. These are not minor differences: they mean that sector rotation directly translates into cross-index relative performance.

Barberis and Shleifer (2003) formalised this intuition in their style investing framework. They showed that investors categorise assets into styles and allocate capital at the category level rather than the individual-asset level. Assets within the same style co-move excessively; assets in different styles co-move too little relative to fundamentals. Importantly, style-level momentum and value strategies are more profitable than their asset-level counterparts. This framework maps directly onto the DJIA (value/industrial style) versus NAS100 (growth/technology style) distinction.

Moskowitz and Grinblatt (1999) found that industry momentum is highly profitable even after controlling for size, book-to-market, and individual stock momentum. The sector composition differences across the three indices create natural momentum and rotation opportunities. The 2025 to 2026 "Great Rotation" provides a real-time illustration: capital shifted from technology (NAS100 underperformed the S&P 500 by approximately 6% year-to-date in 2025) into financials, industrials, energy, and precious metals, with the DJIA outperforming as traditional sectors led.

2.4 Dispersion and Convergence Dynamics

The dispersion trading literature, reviewed by Drechsler, Moreira, and Savov (2018), documents that implied correlation among index constituents tends to exceed realised correlation. The core dispersion trade — buying straddles on individual stocks and selling straddles on the index — exploits this wedge. A study on S&P 500 constituents from 2000 to 2017 found statistically significant returns of 14.5% to 26.5% per annum after transaction costs. Dispersion trades are concave in correlation: they profit when individual stocks diverge and lose during stress periods when correlation spikes, making them inherently short the volatility of correlation.

While traditional dispersion trading operates at the single-stock versus index level, the concept extends naturally to a three-index framework. If the three indices are temporarily dislocated — for example, the NAS100 rallying while the DJIA falls — a convergence trade betting on mean-reversion of the spread exploits the same correlation premium at the index level.

2.5 Index Arbitrage and Constituent Overlap

The overlap structure between the three indices is asymmetric. All 30 DJIA stocks are constituents of the S&P 500 (100% overlap). Approximately 79 of the 100 NAS100 stocks also appear in the S&P 500. However, only six stocks appear in all three indices. Roughly 20% of DJIA weight maps to about 30% of NAS100 weight. This partial overlap means that the indices are neither independent nor identical — they share enough common constituents to co-move, but differ enough to diverge meaningfully during sector-specific events.

Greenwood and Sammon (2023) documented that the index inclusion/exclusion effect has diminished over time as passive investing has grown, but that discretionary S&P 500 deletions still beat additions by 22% in the following year. Index fund long-short rebalancing portfolios continue to earn 4.61% annualised. Each index follows its own rebalancing calendar: the S&P 500 rebalances quarterly with ad hoc additions, the DJIA changes infrequently at the committee's discretion, and the NAS100 rebalances annually in December with special rebalancing triggered when the largest stock exceeds 24% weight. These rebalancing events create predictable flow demands that can temporarily dislocate cross-index relationships.

3. Multi-Index Strategies in the Literature

3.1 Pairs and Spread Trading

Gatev, Goetzmann, and Rouwenhorst (2006) established the academic foundation for pairs trading. Using minimum-distance matching on normalised prices across the period 1962 to 2002, they found that a simple two-standard-deviation divergence trigger yielded average annualised excess returns of up to 11% for self-financing portfolios. More recently, Zhu (2024) found that trading cointegrated near-parity pairs generates 58 basis points per month after costs, with 71% convergence probability, outperforming distance-based selection methods.

Applied to index spreads, CME Group details the methodology for constructing intermarket spreads between ES, YM, and NQ futures. A trader who believes technology is overvalued relative to the broad market sells NQ and buys ES, capturing relative sector performance without directional exposure. These spreads benefit from reduced margin requirements (as low as 10% of outright) reflecting their lower risk profile.

3.2 Time-Series Momentum and Rotation

Moskowitz, Ooi, and Pedersen (2012) documented significant time-series momentum across 58 liquid instruments including equity index futures. A diversified time-series momentum (TSMOM) portfolio delivers substantial abnormal returns and performs best during extreme market moves. Applied to a three-index rotation framework — allocating to the index with the strongest trailing momentum at each rebalancing point — this is one of the most robust findings in quantitative finance, yet its specific application to DJIA/S&P 500/NAS100 rotation is untested.

Barberis and Shleifer (2003) showed that style rotation is more profitable than individual asset rotation. The DJIA-as-value versus NAS100-as-growth mapping provides a natural style rotation pair. Rothe (2023) formalised sector rotation using macroeconomic indicators to time sector ETF allocation, while Mamais (2025) showed that momentum profitability varies across sectors and time, with macroeconomic conditions predicting these shifts.

3.3 Risk-On/Risk-Off Regime Detection

Chari, Stedman, and Lundblad (2025) proposed a composite risk-on/risk-off (RORO) index using credit spreads, equity returns, implied volatility, funding liquidity, and currency/gold signals. NBER Working Paper 31907 (2023) argues for measuring RORO as a combination of risk aversion (the price of risk) and macroeconomic uncertainty (the quantity of risk). Li (2025) found that the largest negative VIX-to-S&P 500 correlation occurs when both markets are in a high-volatility state, a result directly applicable to regime-conditional hedging.

A particularly promising signal, used by practitioners but never formally studied, is the NAS100/DJIA ratio as a risk-on/risk-off indicator. When the NAS100 outperforms the DJIA, capital is flowing into growth and technology stocks, signalling risk-on conditions. When the DJIA outperforms the NAS100, capital is rotating into value and defensive sectors, signalling risk-off. The 2025 to 2026 "Great Rotation" episodes provide vivid real-time illustrations of this dynamic. Despite its widespread use on trading desks, no academic study has validated the NAS100/DJIA ratio as a regime indicator or tested whether conditioning on it improves strategy selection.

4. Research Gaps Identified

Our literature review reveals several research gaps, ranging from entirely unstudied phenomena to well-known effects that have never been rigorously validated on this specific set of instruments. We restrict attention to gaps that can be tested with OHLCV data at minute resolution — the data we have available from MetaTrader 5. The following four gaps carry the highest combination of novelty, feasibility, and practical value.

4.1 Price-Weighted vs. Cap-Weighted Divergence Signal

The DJIA is the only major US equity index that uses price-weighting. This construction methodology creates mechanical, non-fundamental divergences from cap-weighted indices around stock splits, constituent additions and deletions, and divisor adjustments. A stock split, which is economically neutral, changes a company's DJIA weight but has no effect on its S&P 500 or NAS100 weight. Passive DJIA-tracking funds must rebalance in response; S&P 500 and NAS100 trackers do not.

No published study has systematically tested this divergence as a mean-reversion trading signal. The weighting methodology difference is structural and permanent — it cannot be arbitraged away because it stems from index construction rules, not from mispricing. The divergence is directly observable as the spread between normalised US30 and US500 (or NAS100) price series, making it testable with standard OHLCV data. The planned methodology involves constructing the normalised spread, testing z-score mean-reversion entry and exit thresholds, identifying whether divergence events cluster around known structural events, and validating out of sample with walk-forward windows.

4.2 Trivariate Cointegration Regime Model

Most cointegration studies in the pairs-trading literature test bivariate relationships (e.g., SPY/IWM). However, the Johansen (1991) multivariate vector error correction model (VECM) framework allows testing cointegration among all three indices simultaneously. Trivariate cointegration can reveal cointegrating vectors that no bivariate test would detect — relationships where the three-way spread mean-reverts even though no two-way spread does.

Furthermore, no study examines how trivariate cointegration stability changes across market regimes. Cointegration can break down during crisis periods or structural breaks. A Markov-switching VECM that detects regime transitions and adjusts trading rules accordingly would be a novel contribution. The planned methodology involves Johansen trace and eigenvalue tests at multiple timeframes (M5, M15, H1, D1), estimation of cointegrating vectors and error-correction speeds, and regime-switching models to detect when cointegration breaks down.

4.3 NAS100/DJIA Ratio as a Regime Indicator

As discussed in Section 3.3, the NAS100/DJIA ratio is widely used by practitioners as a risk-on/risk-off proxy, but it has never been formally validated. Zero academic studies exist. The planned empirical work will construct the ratio time series, define regimes based on the direction and magnitude of ratio changes across multiple lookback windows, and test whether regime identification predicts which index has the highest forward returns, whether momentum or mean-reversion strategies perform better in each regime, and whether volatility is expanding or contracting. The 2025 to 2026 "Great Rotation" provides a natural out-of-sample test period.

4.4 Cross-Index Lead-Lag at Minute Frequency

The academic lead-lag literature focuses on futures versus spot or large-cap versus small-cap stocks. No study directly measures information flow between US30, US500, and NAS100 at minute frequency, conditional on the type of move. During sector-specific events, differential absorption speeds should exist: technology earnings may move the NAS100 first, with the signal propagating to the S&P 500 and reaching the DJIA last. The planned methodology involves Granger causality tests at lags of one to ten minutes, time-varying lead-lag estimation via rolling window cross-correlation, conditioning on volatility regime and time of day, and testing whether detected lead-lag patterns are exploitable after spread costs.

4.5 Additional Gaps

Beyond the four primary gaps, our review identified several secondary opportunities:

DJIA stock-split event arbitrage — when a DJIA constituent splits, its index weight drops mechanically while its weight in the S&P 500 and NAS100 is unaffected, creating a multi-index relative-value window that has never been formally studied.
Joint multi-index Hidden Markov Model — most HMMs in the financial literature use single-index returns; a joint HMM on all three indices could capture cross-index states such as "technology-led rally," "broad selloff," "sector rotation," or "convergence."
Anomaly decay rates on the DJIA — calendar effects, Dogs of the Dow, and moving average crossover strategies have all weakened over time, but no meta-study quantifies the rate at which published anomalies lose their edge on this liquid blue-chip index.
NAS100 concentration-conditional strategy selection — whether momentum versus mean-reversion performance varies as a function of mega-cap concentration levels (Magnificent 7 weight approximately 40%) is an open question with no peer-reviewed evidence.

5. Planned Methodology

The empirical work is organised into three subsequent phases, each building on the previous.

Phase 2: Data Collection and Feature Engineering. We will collect M1 OHLCV bars for US30, US500, and NAS100 from MetaTrader 5 and CSV archives covering at least five years. Features will include normalised cross-index spreads (US30/US500, US30/NAS100, NAS100/US500), the NAS100/DJIA ratio and its rolling changes, volatility estimators (ATR, Garman-Klass, Parkinson, Yang-Zhang) for each index, rolling Johansen cointegration test statistics at multiple timeframes, and lead-lag estimates from rolling cross-correlation and Granger causality. Feature engineering will follow the same rigorous pipeline used in our gold trading research, with cache invalidation tied to feature column signatures.

Phase 3: Model Development and Backtesting. We will test the four primary research gaps as standalone strategies: z-score mean-reversion on the price-weighted/cap-weighted divergence, trivariate VECM spread trading with regime-conditional entry and exit, NAS100/DJIA ratio as a regime filter for momentum versus mean-reversion selection, and cross-index lead-lag exploitation at minute frequency. Each strategy will be evaluated against a buy-and-hold baseline with realistic transaction costs (MT5 spreads of 1 to 3 points for US30, 0.5 to 1 point for US500 and NAS100).

Phase 4: Walk-Forward Validation. All strategies that show promise in Phase 3 will undergo walk-forward out-of-sample testing with expanding or rolling training windows. We will report Sharpe ratios, maximum drawdowns, profit factors, and statistical significance via bootstrap. Any strategy that fails to outperform buy-and-hold after costs in the walk-forward test will be documented as a negative result.

6. Phase 2: Empirical Gap Studies

Seven empirical gap studies were conducted to test the research questions identified in Section 4. Studies are presented in order of increasing complexity, from simple single-index strategies to multi-index structural models, with a final Granger causality validation study bridging Phase 2 and Phase 3.

6.1 Gap Study #8: IBS/RSI Mean-Reversion Replication

Objective

The first empirical study in Phase 2 replicates two of the most cited OHLCV-only mean-reversion strategies on US equity indices: the Internal Bar Strength (IBS) strategy from Pagonidis (2014) and the RSI(2) strategy from Connors and Alvarez (2009). Both strategies are tested on US30, US500, and NAS100 using daily bars from MetaTrader 5 with realistic CFD spread costs applied to every round-trip. The purpose is to establish whether these well-known edges survive transaction costs on MT5 CFDs before building more complex models on top of them.

Full-Sample Results (Literature Parameters)

The IBS strategy enters long when the Internal Bar Strength $\text{IBS} = (\text{Close} - \text{Low}) / (\text{High} - \text{Low})$ falls below 0.20 and exits the next trading day. The RSI(2) strategy enters long when the two-period RSI drops below 5 and holds for five trading days. Both use the exact parameter values from their respective publications.

IBS (buy < 0.20, sell > 0.80, hold 1 day)

Index	Trades	Win Rate	Profit Factor	Total Points	Buy & Hold Points
US30	360	49.4%	1.15	+8,764	+19,167
US500	547	50.3%	1.26	+2,846	+4,055
NAS100	603	49.4%	1.25	+12,516	+18,027

RSI(2) < 5, hold 5 days

Index	Trades	Win Rate	Profit Factor	Total Points	Buy & Hold Points
US30	47	57.4%	1.48	+7,243	+19,167
US500	61	67.2%	1.64	+1,501	+4,055
NAS100	62	59.7%	1.45	+4,959	+18,027

Both strategies are profitable in-sample across all three indices, but neither comes close to matching buy-and-hold returns. IBS captures roughly 46% to 70% of buy-and-hold points depending on the index, while RSI(2) captures 27% to 38%. The RSI(2) strategy shows higher win rates and profit factors but trades far less frequently (47 to 62 trades versus 360 to 603 for IBS).

Walk-Forward Out-of-Sample Results

To test robustness, both strategies were evaluated using a nine-fold walk-forward framework with expanding training windows. At each fold, the strategy parameters were re-optimised on the training window and evaluated on the subsequent out-of-sample period.

Strategy	Folds Beating Buy & Hold	OOS Beat Rate
IBS	2 / 9	22%
RSI(2)	3 / 9	33%

Neither strategy beats buy-and-hold consistently out of sample. Walk-forward optimal parameters are unstable across folds, suggesting that the in-sample edge is partially an artefact of parameter fitting rather than a stable structural signal.

Key Findings

Pagonidis's 75% IBS win rate does not replicate. We observe approximately 50% across all three indices. The discrepancy likely reflects differences in instrument (equities versus CFDs), cost assumptions, and sample period.
RSI(2) shows a genuine but weak signal. Win rates of 55 to 67% are consistent with Connors and Alvarez (2009) but the edge is too thin to overcome buy-and-hold on a trending asset class.
US500 is the worst venue for both strategies. Higher relative spread costs on the S&P 500 CFD eat the thin mean-reversion edge more aggressively than on US30 or NAS100.
Walk-forward parameters are unstable. Optimal IBS and RSI thresholds shift substantially across folds, indicating that the strategies are fitting noise rather than capturing a stable structural signal.
Negative results are informative. These findings confirm that the research agenda should focus on the novel cross-index gaps identified in Section 4 (spread dynamics, cointegration, regime detection) rather than on single-index mean-reversion at daily frequency.
Verdict: FAIL. Daily mean-reversion on MT5 CFDs does not outperform buy-and-hold. IBS replication failed (50% win rate versus Pagonidis's reported 75%). RSI(2) replication is partial (genuine but weak signal, insufficient after costs). Neither strategy passes walk-forward validation.

Charts

Summary comparison across all strategies and indices — Figure 1. Summary comparison of IBS and RSI(2) strategies across US30, US500, and NAS100. Neither strategy matches buy-and-hold returns.

US30 IBS and RSI study results — Figure 2. US30 IBS and RSI(2) equity curves and trade distributions.

US500 IBS and RSI study results — Figure 3. US500 IBS and RSI(2) equity curves and trade distributions. US500 shows the weakest performance due to higher relative spread costs.

NAS100 IBS and RSI study results — Figure 4. NAS100 IBS and RSI(2) equity curves and trade distributions.