Financial markets show persistent volatility, creating barriers to achieving exact financial predictions. The forecasting of multivariate financial data requires forecasting models like the Vector Autoregressive (VAR) model for modeling linear dependencies, the Long Short-Term Memory (LSTM) model for modeling non-linear patterns, and the Generalized Autoregressive Conditional Heteroscedastic (GARCH) model that is capable of modeling volatility clustering. Each of these models fails to handle complete data complexity on its own, as they specialize in unique properties of the data. Recent studies have been carried out that enhance forecasting accuracy by combining two models. The first case is the VAR-GARCH model, which can model linear and volatility clustering aspects but fails to model non-linear dependencies. Another case is the LSTM-GARCH model that can explain non-linear dependencies and volatility patterns, but fails to explain linear dependencies. A third instance is the VAR-LSTM model that can explain the linear and volatility aspects, but fails to model the non-linear patterns. However, there is a need to have a model that can combine the three models to explain the linear, non-linear, and volatility aspects in financial time series data collectively. This research fills this gap by combining VAR, LSTM, and GARCH into a VAR-LSTM-GARCH hybrid model, which provides improved forecasting. This study uses historical five-year daily data for VIX, US Dollar Index, and S&P 500 E-mini futures obtained from Yahoo Finance. The model-building process involves constructing a VAR (9) model selected using AIC criteria to reveal linear dependencies. The residuals from the VAR are used to train an LSTM model to capture nonlinear trends. The residuals of the LSTM are then used to fit an M-GARCH (1, 1) model, which generates volatility cluster estimates. The VAR-LSTM-GARCH hybrid model demonstrates superior performance with substantial improvements across all evaluation metrics compared to individual models, showing consistently lower prediction errors and enhanced forecasting accuracy. The progressive three-stage modeling approach demonstrates that each component contributes incrementally to forecasting performance, with the incorporation of volatility modeling through GARCH being particularly effective in enhancing predictive accuracy. The research suggests using this hybrid model for volatility prediction on multiple portfolios and emphasizes future development of real-time diagnostic processes. The new approach delivers an advanced instrument that helps financial analysts work efficiently by effectively capturing the complex interdependencies in multivariate financial time series data.
Published in | International Journal of Data Science and Analysis (Volume 11, Issue 4) |
DOI | 10.11648/j.ijdsa.20251104.11 |
Page(s) | 99-113 |
Creative Commons |
This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited. |
Copyright |
Copyright © The Author(s), 2025. Published by Science Publishing Group |
Hybrid Model, Volatility, Volatility Clustering, Machine Learning, Forecasting, Vector Autoregressive Model, Long Short-Term Memory Model, Generalized Autoregressive Conditional Heteroscedastic Model
[1] | E. Sinclair, Volatility Trading. John Wiley & Sons, 2011. |
[2] | Kleinberg, J. Ludwig, Sendhil Mullainathan, and C. R. Sunstein, “Discrimination In The Age Of Algorithms,” Dec. 2018, |
[3] | Z. Sengul, Comparison of GARCH and SVRGARCH models: Example of gold return, JOURNAL OF APPLIED MICROECONOMETRICS, vol. 3, no. 1, pp. 23-35, Jul. 2023, |
[4] | H. Lutkepohl, Introduction to Multiple Time Series Analysis. Springer Science & Business Media, 2013. |
[5] | A. Sherstinsky, Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) network, Physica D: Nonlinear Phenomena, vol. 404, p. 132306, Mar. 2020, |
[6] | L. Wang, “The risk measurement of China’s carbon financial market: based on garch and var model,” Applied Ecology and Environmental Research, vol. 17, no. 4, 2019, |
[7] | I. Bolat, “A Research on Risk Management in Financial Markets”, Jun. 2023, |
[8] | S. Siami-Namini, N. Tavakoli, and A. Siami Namin,: A Comparison of ARIMA and Long Short-Term Memory (LSTM) in Forecasting Time Series, 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA), Dec. 2018, |
[9] | P. Booth et al., Modern Actuarial Theory and Practice. CRC Press, 2020. |
[10] | A. Holmes, B. Illowsky, and S. Dean, Introductory Business Statistics 2e. 2023. |
[11] | Y. Song, M. He, Y. Wang, and Y. Zhang, Forecasting crude oil market volatility: A newspaper-based predictor regarding petroleum market volatility, Resources Policy, p. 103093, Nov. 2022, |
[12] | M. T. Moran, Thirty Volatility Indexes: Worldwide Tools to Gauge Sentiment and Diversify Portfolios, The Journal of Index Investing, vol. 4, no. 4, pp. 69-87, Feb. 2014, |
[13] | G. Mokhtari and Fatemeh Aghagoli, Project Portfolio Risk Response Selection Using Bayesian Belief Networks, Iranian journal of management studies, vol. 13, no. 2, pp. 197-219, Apr. 2020, |
[14] | A. Pavlou, M. Doumpos, and C. Zopounidis, “The robustness of portfolio efficient frontiers,” Management Decision, vol. 57, no. 2, pp. 300-313, Feb. 2019, |
[15] | E. W. Chindia, “Forecasting Techniques and Accuracy of Performance Forecasting,” International Journal of Management Excellence, vol. 7, no. 2, p. 813, Aug. 2016, |
[16] | C. Pinheiro and V. Senna, “Price Forecasting Through Multivariate Spectral Analysis: Evidence for Commodities of BM&Fbovespa,” Brazilian Business Review, vol. 13, no. 5, pp. 129-157, Sep. 2016, |
[17] | P. Wang, H. Zhang, Z. Qin, and G. Zhang, “A novel hybrid-Garch model based on ARIMA and SVM for PM 2.5 concentrations forecasting,” Atmospheric Pollution Research, vol. 8, no. 5, pp. 850-860, Sep. 2017, |
[18] | S. Neaime, “The global financial crisis, financial linkages and correlations in returns and volatilities in emerging MENA stock markets,” Emerging Markets Review, vol. 13, no. 3, pp. 268-282, Sep. 2012, |
[19] | L. Bauwens, S. Laurent, and J. V. K. Rombouts, “Multivariate GARCH models: a survey,” Journal of Applied Econometrics, vol. 21, no. 1, pp. 79-109, Jan. 2006, |
[20] | M. Belkhouja and M. Boutahary, “Modeling volatility with time-varying FIGARCH models,” Economic Modelling, vol. 28, no. 3, pp. 1106-1116, May 2011, |
[21] | W. Kong, Z. Y. Dong, Y. Jia, D. J. Hill, Y. Xu, and Y. Zhang, “Short-Term Residential Load Forecasting Based onLSTMRecurrentNeuralNetwork,” IEEETransactions on Smart Grid, vol. 10, no. 1, pp. 841-851, Jan. 2019, |
[22] | F. Karim, S. Majumdar, H. Darabi, and S. Chen, “LSTM Fully Convolutional Networks for Time Series Classification,” IEEE Access, vol. 6, pp. 1662-1669, 2018, |
[23] |
J. H. Stock and M. W. Watson, “Implications of Dynamic Factor Models for VAR Analysis,” National Bureau of Economic Research, Jul. 01, 2005.
https://www.nber.org/papers/w11467 (accessed Sep. 12, 2023). |
[24] | R. Fry and A. Pagan, “Sign Restrictions in Structural Vector Autoregressions: A Critical Review,” Journal of Economic Literature, vol. 49, no. 4, pp. 938-960, Dec. 2011, |
[25] | M. Tejoyadav, R. Nayak, and Umesh Chandra Pati, “Multivariate Water Quality Forecasting of River Ganga Using VAR-LSTM based Hybrid Model,” 2022 IEEE 19th India Council International Conference (INDICON), Nov. 2022, |
[26] | Z. Cheng, J. Lu, H. Zhou, Y. Zhang, and L. Zhang, “Short-Term Traffic Flow Prediction: An Integrated Method of Econometrics and Hybrid Deep Learning,” IEEE Transactions on Intelligent Transportation Systems, pp. 1-14, 2021, |
[27] | H. Y. Kim and C. H. Won, “Forecasting the volatility of stockpriceindex: AhybridmodelintegratingLSTMwith multiple GARCH-type models,” Expert Systems with Applications, vol. 103, pp. 25-37, Aug. 2018, |
[28] | Y. Hu, J. Ni, and L. Wen, “A hybrid deep learning approach by integrating LSTM-ANN networks with GARCH model for copper price volatility prediction,” Physica A: Statistical Mechanics and its Applications, vol. 557, p. 124907, Nov. 2020, |
[29] | M. Nafisi-Moghadam and S. Fattahi, “A Hybrid Model of VAR-DCC-GARCH and Wavelet Analysis for Forecasting Volatility,” Engineering Proceedings, vol. 18, no. 1, p. 6, 2022, |
[30] | C. Cappello, Antonella Congedi, S. D. Iaco, and L. Mariella, “Traditional Prediction Techniques and Machine Learning Approaches for Financial Time Series Analysis,” Mathematics, vol. 13, no. 3, pp. 537-537, Feb. 2025, |
APA Style
Chege, C., Kithinji, M., Gachoki, P. (2025). A Hybrid VAR-LSTM-GARCH Model for Multivariate Volatility Forecasting. International Journal of Data Science and Analysis, 11(4), 99-113. https://doi.org/10.11648/j.ijdsa.20251104.11
ACS Style
Chege, C.; Kithinji, M.; Gachoki, P. A Hybrid VAR-LSTM-GARCH Model for Multivariate Volatility Forecasting. Int. J. Data Sci. Anal. 2025, 11(4), 99-113. doi: 10.11648/j.ijdsa.20251104.11
@article{10.11648/j.ijdsa.20251104.11, author = {Charles Chege and Martin Kithinji and Peter Gachoki}, title = {A Hybrid VAR-LSTM-GARCH Model for Multivariate Volatility Forecasting }, journal = {International Journal of Data Science and Analysis}, volume = {11}, number = {4}, pages = {99-113}, doi = {10.11648/j.ijdsa.20251104.11}, url = {https://doi.org/10.11648/j.ijdsa.20251104.11}, eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ijdsa.20251104.11}, abstract = {Financial markets show persistent volatility, creating barriers to achieving exact financial predictions. The forecasting of multivariate financial data requires forecasting models like the Vector Autoregressive (VAR) model for modeling linear dependencies, the Long Short-Term Memory (LSTM) model for modeling non-linear patterns, and the Generalized Autoregressive Conditional Heteroscedastic (GARCH) model that is capable of modeling volatility clustering. Each of these models fails to handle complete data complexity on its own, as they specialize in unique properties of the data. Recent studies have been carried out that enhance forecasting accuracy by combining two models. The first case is the VAR-GARCH model, which can model linear and volatility clustering aspects but fails to model non-linear dependencies. Another case is the LSTM-GARCH model that can explain non-linear dependencies and volatility patterns, but fails to explain linear dependencies. A third instance is the VAR-LSTM model that can explain the linear and volatility aspects, but fails to model the non-linear patterns. However, there is a need to have a model that can combine the three models to explain the linear, non-linear, and volatility aspects in financial time series data collectively. This research fills this gap by combining VAR, LSTM, and GARCH into a VAR-LSTM-GARCH hybrid model, which provides improved forecasting. This study uses historical five-year daily data for VIX, US Dollar Index, and S&P 500 E-mini futures obtained from Yahoo Finance. The model-building process involves constructing a VAR (9) model selected using AIC criteria to reveal linear dependencies. The residuals from the VAR are used to train an LSTM model to capture nonlinear trends. The residuals of the LSTM are then used to fit an M-GARCH (1, 1) model, which generates volatility cluster estimates. The VAR-LSTM-GARCH hybrid model demonstrates superior performance with substantial improvements across all evaluation metrics compared to individual models, showing consistently lower prediction errors and enhanced forecasting accuracy. The progressive three-stage modeling approach demonstrates that each component contributes incrementally to forecasting performance, with the incorporation of volatility modeling through GARCH being particularly effective in enhancing predictive accuracy. The research suggests using this hybrid model for volatility prediction on multiple portfolios and emphasizes future development of real-time diagnostic processes. The new approach delivers an advanced instrument that helps financial analysts work efficiently by effectively capturing the complex interdependencies in multivariate financial time series data. }, year = {2025} }
TY - JOUR T1 - A Hybrid VAR-LSTM-GARCH Model for Multivariate Volatility Forecasting AU - Charles Chege AU - Martin Kithinji AU - Peter Gachoki Y1 - 2025/07/14 PY - 2025 N1 - https://doi.org/10.11648/j.ijdsa.20251104.11 DO - 10.11648/j.ijdsa.20251104.11 T2 - International Journal of Data Science and Analysis JF - International Journal of Data Science and Analysis JO - International Journal of Data Science and Analysis SP - 99 EP - 113 PB - Science Publishing Group SN - 2575-1891 UR - https://doi.org/10.11648/j.ijdsa.20251104.11 AB - Financial markets show persistent volatility, creating barriers to achieving exact financial predictions. The forecasting of multivariate financial data requires forecasting models like the Vector Autoregressive (VAR) model for modeling linear dependencies, the Long Short-Term Memory (LSTM) model for modeling non-linear patterns, and the Generalized Autoregressive Conditional Heteroscedastic (GARCH) model that is capable of modeling volatility clustering. Each of these models fails to handle complete data complexity on its own, as they specialize in unique properties of the data. Recent studies have been carried out that enhance forecasting accuracy by combining two models. The first case is the VAR-GARCH model, which can model linear and volatility clustering aspects but fails to model non-linear dependencies. Another case is the LSTM-GARCH model that can explain non-linear dependencies and volatility patterns, but fails to explain linear dependencies. A third instance is the VAR-LSTM model that can explain the linear and volatility aspects, but fails to model the non-linear patterns. However, there is a need to have a model that can combine the three models to explain the linear, non-linear, and volatility aspects in financial time series data collectively. This research fills this gap by combining VAR, LSTM, and GARCH into a VAR-LSTM-GARCH hybrid model, which provides improved forecasting. This study uses historical five-year daily data for VIX, US Dollar Index, and S&P 500 E-mini futures obtained from Yahoo Finance. The model-building process involves constructing a VAR (9) model selected using AIC criteria to reveal linear dependencies. The residuals from the VAR are used to train an LSTM model to capture nonlinear trends. The residuals of the LSTM are then used to fit an M-GARCH (1, 1) model, which generates volatility cluster estimates. The VAR-LSTM-GARCH hybrid model demonstrates superior performance with substantial improvements across all evaluation metrics compared to individual models, showing consistently lower prediction errors and enhanced forecasting accuracy. The progressive three-stage modeling approach demonstrates that each component contributes incrementally to forecasting performance, with the incorporation of volatility modeling through GARCH being particularly effective in enhancing predictive accuracy. The research suggests using this hybrid model for volatility prediction on multiple portfolios and emphasizes future development of real-time diagnostic processes. The new approach delivers an advanced instrument that helps financial analysts work efficiently by effectively capturing the complex interdependencies in multivariate financial time series data. VL - 11 IS - 4 ER -