Autoregressive Random Forests: Machine Learning and Lag Selection for Financial Research
Source of Publication
This paper provides evidence on the use of Random Regression Forests (RRF) for optimal lag selection. Using an extended sample of 144 data series, of various data types with different frequencies and sample sizes, we perform optimal lag selection using RRF and compare the results with seven “traditional” information criteria as well as with three other machine learning approaches. We show that the different information criteria produce differing outcomes in terms of optimal lag selection. To quantify performance, we compare the forecast errors on autoregressive models using the optimal lags selected by the criteria and demonstrate that RRF outperforms other approaches. We provide suggestions to researchers as to which approach to use, under different combinations of data type/data frequency and data type/sample size.
Springer Science and Business Media LLC
Random regression forest, Optimal lag, Lasso, Ridge regression, Nayesian model averaging
Polyzos, Efstathios and Siriopoulos, Costas, "Autoregressive Random Forests: Machine Learning and Lag Selection for Financial Research" (2023). All Works. 6029.
Indexed in Scopus