Post

SPY Regression of a U.S. Equity Stat-Arb Portfolio

I recently designed and implemented a multi-sleeve statistical arbitrage portfolio for large-cap U.S. equities. The portfolio combines mean reversion, momentum, distance-relative pairs, and event-driven signals into a dollar-neutral long-short framework. The final portfolio consists of eight sleeves with relatively low correlations and achieves an in-sample net Sharpe ratio above 2.0 after transaction costs during the 2015-2023 development period. Although performance weakens during the 2024-2026 evaluation period, the portfolio remains profitable and preserves much of its diversification structure.

One question my original analysis did not answer is whether the portfolio returns are actually alpha or a form of disguised market exposure. To investigate this, I regressed the daily portfolio returns against SPY using a simple OLS model,

\[R_{port} = \alpha + \beta R_{SPY} + \epsilon\]

Given the portfolio’s construction, I would expect low market exposure and a low $R^2$. A statistical arbitrage portfolio that derives its returns from security selection should exhibit a small beta, little explanatory power from the benchmark, and positive alpha. Conversely, if the portfolio’s returns are largely driven by market direction, beta and $R^2$ should be substantially higher.

The below table shows the regression results for each calendar year and the aggregate in-sample (2015-2023), out-of-sample (2024-2026), and full-period results.

PeriodLabelAnn RetAnn VolSharpeSPY RetAlpha (ann)CI loCI hit(alpha)p(alpha)Betat(beta)p(beta)R2SkewKurtN
2015IS5.5%5.3%1.0491.3%5.5%-4.3%15.3%1.10.2710.01020.40.69080.00090.594.5252
2016IS20.6%6.4%3.22312.0%17.0%7.3%26.7%3.440.00060.16134.050.00010.1084-0.190.8252
2017IS2.7%5.9%0.4621.8%2.2%-8.2%12.6%0.420.67730.03260.330.73890.0014-0.313.5251
2018IS18.1%7.6%2.37-4.6%17.2%3.1%31.3%2.390.01670.08691.480.13770.03760.483.6251
2019IS20.2%7.9%2.54231.2%16.6%1.1%32.2%2.10.03590.07451.630.10250.01380.621.5252
2020IS50.5%13.0%3.90118.3%42.7%9.0%76.4%2.490.0129-0.0427-0.620.53780.01212.3712.7253
2021IS3.6%8.5%0.42928.7%2.6%-11.8%17.0%0.350.72390.05130.570.5670.00620.10.4252
2022IS45.8%10.2%4.498-18.2%39.6%25.6%53.6%5.550.00.07711.960.05030.03370.873.5251
2023IS15.3%7.9%1.93726.4%10.7%-3.1%24.5%1.520.1280.15633.970.00010.06730.340.1250
2024OOS-9.1%7.0%-1.31124.9%-9.7%-22.6%3.2%-1.480.13940.0170.460.64220.0009-0.31.0252
2025OOS14.4%15.8%0.90917.9%3.7%-15.0%22.4%0.390.69820.59574.480.00.54055.5674.0250
2026OOS33.7%13.1%2.56715.1%21.5%-2.9%45.9%1.730.08350.55337.430.00.3557-0.390.681
2015–2023IS19.2%8.4%2.2911.8%17.5%11.2%23.9%5.420.00.03340.860.3910.00521.4111.02264
2024–2026OOS5.8%12.3%0.47220.5%-2.3%-13.1%8.4%-0.420.67250.43773.350.00080.32525.086.1583
2015–2026ALL15.4%9.3%1.66513.5%13.2%7.5%18.9%4.540.00.10932.010.0440.04343.1659.82847

The in-sample results are positive. During the 2015-2023 period, the portfolio generated approximately 17.5% annualized alpha with a t-statistic of 5.42 and a p-value effectively equal to zero. At the same time, beta was only 0.0334 and indistinguishable from zero while SPY explained less than 1% of the variation in portfolio returns ($R^2 = 0.52\%$). These results are consistent with the portfolio’s performance during this period was driven primarily by security selection and portfolio construction rather than systematic market risk. Beta remained close to zero throughout most of the period and the strongest contributions came from 2016, 2020, and 2022.

On the other hand, the out-of-sample results aren’t as positive. Annualized alpha falls to roughly -2.3% and loses statistical significance. Beta goes up to 0.4377 and $R^2$ increases to 32.52%. A much larger fraction of portfolio returns are explainable by movements of the benchmark. The year 2024 was particularly challenging with a negative alpha despite relatively modest market exposure. Although performance improves in 2025 and 2026, the recovery also shows substantially higher beta. These results suggest that the portfolio’s behavior changed during the out-of-sample period. During 2015-2023 the portfolio behaved like a fairly pure statistical arbitrage strategy, but during 2024-2026 its returns became increasingly associated with movements in the benchmark.

One possible explanation is that the out-of-sample period represents a different market regime. The benchmark during these years was dominated by the unusually persistent momentum of a small number of large-cap stocks. This environment would’ve been difficult for the portfolio’s mean-reversion sleeves which is consistent with the conclusions in my initial portfolio analysis. It’s also supported by the increased beta and higher benchmark influence in the regression results.

In sum, during the in-sample period the portfolio exhibited near-zero market exposure and generated statistically significant alpha. During the out-of-sample period, however, alpha disappeared while beta increased substantially. The most important question raised by these results is why the portfolio behavior appeared market-neutral in-sample, but developed meaningful market exposure out-of-sample. I think the next step is to identify the source of that exposure. One possibility is that certain sleeves became dominant during the 2024–2026 period and altered the aggregate behavior of the portfolio. One way to investigate this question is to decompose performance at the sleeve level and examine how each sleeve contributes to alpha, beta, and overall risk across different market environments. It would also be useful to run a broader factor analysis to determine whether the observed beta is truly market exposure or whether it reflects exposure to other risk factors such as momentum, size, value, or low-volatility effects.

Notebook with regression results here.

This post is licensed under CC BY 4.0 by the author.