{"slug": "machine-learning-and-the-random-walk-puzzle-forecasting-the-cad-usd-exchange-and", "title": "Machine Learning and the Random Walk Puzzle: Forecasting the CAD/USD Exchange Rate with Expanding Window Evaluation and SHAP Interpretability", "summary": "A study from arXiv finds that linear regression outperforms the naive random walk in forecasting the monthly USD/CAD exchange rate, while machine learning ensemble models show only marginal differences. The research, using data from January 2017 to May 2026, identifies four structural breakpoints tied to trade wars, COVID-19, and Bank of Canada rate cycles, and applies SHAP analysis to interpret model drivers.", "body_md": "arXiv:2606.15058v1 Announce Type: new\nAbstract: This study examines whether machine learning (ML) models can outperform the naive random walk benchmark in forecasting the monthly USD/CAD exchange rate. Using daily data from the Bank of Canada spanning January 2017 to May 2026, resampled into 113 monthly observations, five ML models are evaluated: linear regression, random forest, gradient boosting, XGBoost, and AdaBoost. These models are benchmarked against the naive random walk model and exponential smoothing with Holt-Winters seasonality (ETS). All models are evaluated using an expanding-window framework to maintain strict out-of-sample integrity, and forecast-accuracy differences are assessed using the Diebold-Mariano (DM) test. Structural break detection identifies four significant breakpoints in the series, corresponding to the escalation of the US-China trade war in 2018, the COVID-19 economic recovery in 2020, the peak of the Bank of Canada rate-hiking cycle in 2022, and the start of the Bank of Canada rate-cutting cycle in 2024. SHAP, or Shapley Additive Explanations, analysis is applied to interpret the drivers of the best-performing ML model. The results show that the naive random walk model remains a formidable benchmark. Linear regression is the only model that statistically outperforms the naive random walk model, with a DM statistic of 3.0585 and a p value of 0.0071, whereas the ML ensemble models show only marginal differences. Random Forest with an expanding-window framework achieves the lowest MAPE of 1.17 percent among all models except the random walk. SHAP analysis confirms that short-term lags, particularly lag1 and lag2, and recent rolling means dominate predictions, consistent with the near-random-walk behavior of exchange rates.", "url": "https://wpnews.pro/news/machine-learning-and-the-random-walk-puzzle-forecasting-the-cad-usd-exchange-and", "canonical_source": "https://arxiv.org/abs/2606.15058", "published_at": "2026-06-16 04:00:00+00:00", "updated_at": "2026-06-16 04:27:08.130738+00:00", "lang": "en", "topics": ["machine-learning", "artificial-intelligence", "ai-research"], "entities": ["Bank of Canada", "XGBoost", "AdaBoost", "Random Forest", "SHAP"], "alternates": {"html": "https://wpnews.pro/news/machine-learning-and-the-random-walk-puzzle-forecasting-the-cad-usd-exchange-and", "markdown": "https://wpnews.pro/news/machine-learning-and-the-random-walk-puzzle-forecasting-the-cad-usd-exchange-and.md", "text": "https://wpnews.pro/news/machine-learning-and-the-random-walk-puzzle-forecasting-the-cad-usd-exchange-and.txt", "jsonld": "https://wpnews.pro/news/machine-learning-and-the-random-walk-puzzle-forecasting-the-cad-usd-exchange-and.jsonld"}}