Comparison of Time Series Regression, Support Vector Regression, Hybrid, and Ensemble Method to Forecast PM2.5
DOI:
https://doi.org/10.56294/dm2025862Keywords:
PM2.5 Forecasting, Time Series Regression, Support Vector Regression, Hybrid Model, Ensemble LearningAbstract
Introduction: PM2.5 pollution poses significant health risks, particularly in Jakarta, where levels often exceed safety thresholds. Accurate forecasting models are essential for air quality management and mitigation strategies.
Methods: This study compares four forecasting models: Time Series Regression (TSR), Support Vector Regression (SVR), a hybrid TSR-SVR model, and an ensemble approach. The dataset consists of 9,119 hourly PM2.5 observations from January 1, 2023, to January 15, 2024. Missing values were imputed using historical hourly trends. Model performance was evaluated using Root Mean Squared Error (RMSE).
Results: The hybrid TSR-SVR model achieved the lowest RMSE (6.829), outperforming TSR (7.595), SVR (7.477), and the ensemble method (7.486). The hybrid approach effectively captures both linear and nonlinear patterns in PM2.5 fluctuations, making it the most accurate model.
Conclusions: Integrating statistical and machine learning models improves PM2.5 forecasting accuracy, aiding policymakers in pollution control efforts. Future studies should explore additional external factors to enhance prediction performance.
References
[1] J. E. Goldstein, “The Volumetric Political Forest: Territory, Satellite Fire Mapping, and Indonesia’s Burning Peatland,” Antipode, vol. 52, no. 4, pp. 1060–1082, Jul. 2020, doi: 10.1111/anti.12576.
[2] Y. Sri Susilo et al., “Valuation of the Economic Impact of Air Pollution to Promote Public Welfare in Jakarta,” Journal of Business and Information Systems, vol. 6, no. 2, 2024, doi: 10.36067/jbis.v6i2.260.
[3] K. Aji Tritamtama, F. E. S. Sembiring, A. Choiruddin, and H. Patria, “Analysis of Air Pollution (SO2) at Some Point of Congestion in DKI Jakarta,” Disease Prevention and Public Health Journal, vol. 17, no. 1, pp. 82–92, Feb. 2023, doi: 10.12928/dpphj.v17i1.6147.
[4] Y. F. Xing, Y. H. Xu, M. H. Shi, and Y. X. Lian, “The Impact of PM2.5 on The Human Respiratory System,” J Thorac Dis, vol. 8, no. 1, pp. E69–E74, 2016, doi: 10.3978/j.issn.2072-1439.2016.01.19.
[5] A. Kunovac, “Maternal Engineered Nanomaterial Inhalation Exposure: Cardiac Maternal Engineered Nanomaterial Inhalation Exposure: Cardiac Molecular Reprogramming in Progeny through Epigenetic and Molecular Reprogramming in Progeny through Epigenetic and Epitranscriptomic Mechanisms Epitranscriptomic Mechanisms,” West Virginia University, Morgantown, 2021. [Online]. Available: https://researchrepository.wvu.edu/etd/10255
[6] R. B. Hayes et al., “PM2.5 Air Pollution and Cause-Specific Cardiovascular Disease Mortality,” Int J Epidemiol, vol. 49, no. 1, pp. 25–35, Feb. 2020, doi: 10.1093/ije/dyz114.
[7] A. Hasnain et al., “Time Series Analysis and Forecasting of Air Pollutants Based on Prophet Forecasting Model in Jiangsu Province, China,” Front Environ Sci, vol. 10, Jul. 2022, doi: 10.3389/fenvs.2022.945628.
[8] G. I. Drewil and R. J. Al-Bahadili, “Air Pollution Prediction Using LSTM Deep Learning and Metaheuristics Algorithms,” Measurement: Sensors, vol. 24, Dec. 2022, doi: 10.1016/j.measen.2022.100546.
[9] C. H. Cordova, M. N. L. Portocarrero, R. Salas, R. Torres, P. C. Rodrigues, and J. L. López-Gonzales, “Air Quality Assessment and Pollution Forecasting Using Artificial Neural Networks in Metropolitan Lima-Peru,” Sci Rep, vol. 11, no. 1, Dec. 2021, doi: 10.1038/s41598-021-03650-9.
[10] D. Kothandaraman et al., “Intelligent Forecasting of Air Quality and Pollution Prediction Using Machine Learning,” 2022, Hindawi Limited. doi: 10.1155/2022/5086622.
[11] T. Handhayani, “An Integrated Analysis of Air Pollution and Meteorological Conditions in Jakarta,” Sci Rep, vol. 13, no. 1, Dec. 2023, doi: 10.1038/s41598-023-32817-9.
[12] S. Makridakis, E. Spiliotis, and V. Assimakopoulos, “The M4 Competition: 100,000 Time Series and 61 Forecasting Methods,” Int J Forecast, vol. 36, no. 1, pp. 54–74, Jan. 2020, doi: 10.1016/j.ijforecast.2019.04.014.
[13] J. Smith and K. F. Wallis, “A Simple Explanation of The Forecast Combination Puzzle,” Oxf Bull Econ Stat, vol. 71, no. 3, pp. 331–355, Jun. 2009, doi: 10.1111/j.1468-0084.2008.00541.x.
[14] B. Cho et al., “Effective Missing Value Imputation Methods for Building Monitoring Data,” in IEEE International Conference on Big Data, Institute of Electrical and Electronics Engineers Inc., Dec. 2020, pp. 2866–2875. doi: 10.1109/BigData50022.2020.9378230.
[15] W. Wibowo, B. S. S. Ulama, T. H. Siagian, T. Purwa, and R. N. Wilantari, “Impact of earthquakes on the number of airline passenger arrivals and departures: A case Study of West Nusa Tenggara Province, Indonesia,” Regional Statistics, vol. 11, no. 3, pp. 133–157, 2021, doi: 10.15196/RS110302.
[16] L. Wang et al., “Satellite-Based Assessment of The Long-Term Efficacy of PM2.5 Pollution Control Policies Across The Taiwan Strait,” Remote Sens Environ, vol. 251, p. 1, Dec. 2020, doi: 10.1016/j.rse.2020.112067.
[17] D. Alita, A. D. Putra, and D. Darwis, “Analysis of Classic assumption test and multiple linear regression Coefficient Test for Employee Structural Office Recommendation,” IJCCS (Indonesian Journal of Computing and Cybernetics Systems), vol. 15, no. 3, pp. 295–306, Jul. 2021, doi: 10.22146/ijccs.65586.
[18] L. L. Li, Z. Y. Cen, M. L. Tseng, Q. Shen, and M. H. Ali, “Improving Short-Term Wind Power Prediction Using Hybrid Improved Cuckoo Search Arithmetic - Support Vector Regression Machine,” J Clean Prod, vol. 279, pp. 1–15, Jan. 2021, doi: 10.1016/j.jclepro.2020.123739.
[19] C. D. Aju, A. L. Achu, M. P. Mohammed, M. C. Raicy, G. Gopinath, and R. Reghunath, “Groundwater Quality Prediction and Risk Assessment in Kerala, India: A Machine-Learning Approach,” J Environ Manage, vol. 370, p. 1, Nov. 2024, doi: 10.1016/j.jenvman.2024.122616.
[20] W. Y. Duan, Y. Han, L. M. Huang, B. B. Zhao, and M. H. Wang, “A Hybrid EMD-SVR Model for The Short-Term Prediction of Significant Wave Height,” Ocean Engineering, vol. 124, pp. 54–73, Sep. 2016, doi: 10.1016/j.oceaneng.2016.05.049.
[21] C. Go, Y. J. Kwak, S. Kwag, S. Eem, S. Lee, and B. S. Ju, “On Developing Accurate Prediction Models for Residual Tensile Strength of GFRP Bars Under Alkaline-Concrete Environment Using a Combined Ensemble Machine Learning Methods,” Case Studies in Construction Materials, vol. 18, p. 16, Jul. 2023, doi: 10.1016/j.cscm.2023.e02157.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Elly Pusporani, Ghisella Asy Sifa , Nurin Faizun , Pressylia Aluisina Putri Widyangga , Adma Novita Sari , M. Fariz Fadillah Mardianto , Sediono (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.
The article is distributed under the Creative Commons Attribution 4.0 License. Unless otherwise stated, associated published material is distributed under the same licence.