Feature Learning Behavior in Trade Forecasting: Evidence from Tree-Based Machine Learning Models
Corresponding author: S. Balasubramanian
Abstract
This study investigates how tree-based machine learning models learn from historical trade data by examining the importance of lagged export and import values in the context of India's monthly aggregate trade series from Fiscal Year (FY) 1990–91 to FY 2024–25. Rather than prioritizing predictive accuracy, the analysis focuses on feature-based learning behavior using lagged inputs constructed within a supervised learning framework. A time-aware training–testing split is employed to preserve chronological integrity. Feature importance analysis is treated as the primary analytical outcome to evaluate how models allocate learning weights across short-term and medium-term temporal inputs. The results show that recent export lags consistently dominate model learning, whereas selected medium-term import lags also contribute meaningfully. Robustness checks using alternative lag specifications confirm the stability of short-term dominance. These findings demonstrate that tree-based trade forecasting models learn in a structured and selective manner rather than distributing attention uniformly across historical observations. By emphasizing interpretability over pure accuracy comparisons, the study provides deeper insight into temporal learning dynamics in machine learning-based trade forecasting.
Keywords:
trade forecasting, feature learning, lagged variables, tree-based machine learning, XGBoost, model interpretabilityDownloads
References
T. Chen and C. Guestrin, "XGBoost: A Scalable Tree Boosting System," in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 2016, pp. 785–794. DOI: https://doi.org/10.1145/2939672.2939785
L. Breiman, "Random Forests," Machine Learning, vol. 45, no. 1, pp. 5–32, Oct. 2001. DOI: https://doi.org/10.1023/A:1010933404324
S. M. Lundberg and S.-I. Lee, "A unified approach to interpreting model predictions," in Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 2017, pp. 4768–4777.
S. M. Lundberg, G. G. Erion, and S.-I. Lee, "Consistent Individualized Feature Attribution for Tree Ensembles." arXiv, Mar. 07, 2019.
A. Altmann, L. Toloşi, O. Sander, and T. Lengauer, "Permutation importance: a corrected feature importance measure," Bioinformatics, vol. 26, no. 10, pp. 1340–1347, May 2010. DOI: https://doi.org/10.1093/bioinformatics/btq134
S. Makridakis, E. Spiliotis, and V. Assimakopoulos, "The M4 Competition: 100,000 time series and 61 forecasting methods," International Journal of Forecasting, vol. 36, no. 1, pp. 54–74, Jan. 2020. DOI: https://doi.org/10.1016/j.ijforecast.2019.04.014
F. Batarseh, M. Gopinath, G. Nalluru, and J. Beckman, "Application of Machine Learning in Forecasting International Trade Trends." arXiv, Oct. 07, 2019.
H. Jošić and B. Žmuk, "A Machine Learning Approach to Forecast International Trade: The Case of Croatia," Business Systems Research Journal, vol. 13, no. 3, pp. 144–160, Dec. 2022. DOI: https://doi.org/10.2478/bsrj-2022-0030
M. D. Chinn, B. Meunier, and S. Stumpner, "Nowcasting World Trade with Machine Learning: a Three-Step Approach." National Bureau of Economic Research, June 2023. DOI: https://doi.org/10.3386/w31419
J. Sun, Y. Suo, S. Park, T. Xu, Y. Liu, and W. Wang, "Analysis of Bilateral Trade Flow and Machine Learning Algorithms for GDP Forecasting," Engineering, Technology & Applied Science Research, vol. 8, no. 5, pp. 3432–3438, Oct. 2018. DOI: https://doi.org/10.48084/etasr.2311
C. Molnar, Interpretable Machine Learning: A Guide for Making Black Box Models Explainable, 3rd ed. Berlin, Germany: Self-Published, 2022.
B. Subramanian, "Monthly Aggregate Export and Import Trade Data for India (FY 1990–91 to FY 2024–25)." Zenodo, Feb. 19, 2026.
Downloads
How to Cite
License
Copyright (c) 2026 S. Balasubramanian, M. Natarajan

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.
