TY - JOUR
T1 - Fine particulate matter concentration forecasting using long short-term memory network and meteorological inputs
AU - Istiana, T.
AU - Kurniawan, B.
AU - Soekirno, S.
AU - Wihono, A.
AU - Nuryanto, D. E.
AU - Pertala, B. A.
AU - Sopaheluwakan, A.
N1 - Publisher Copyright:
© 2024 The author(s). This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third-party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit: http://creativecommons.org/licenses/by/4.0/.
PY - 2024/9
Y1 - 2024/9
N2 - BACKGROUND AND OBJECTIVES: In metropolitan settings, the requirement to travel and participate in everyday tasks exposes numerous individuals to the harmful effects of air pollutants, specifically particulate matter 2.5, which has the potential to impact their well-being. Developing precise forecasting models is crucial in mitigating air pollution and providing accurate predictions for the people. Nonetheless, the deficiency in acquiring observable data can frequently lead to unsatisfactory performance of forecasting models in various scenarios. The objective of this study is to address the issue by examining the most effective approaches for predicting the non-linear time-series data of daily particulate matter 2.5 concentration using meteorological inputs. METHODS: The concentration data of particulate matter 2.5 at Central Jakarta and South Jakarta were collected using sensors from the United States of America Embassy in Indonesia and Indonesia’s Meteorological, Climatological, and Geophysical Agency. Conversely, the meteorological information was collected through the Merra-2 satellite. This study introduces the long short-term memory deep learning model and contrasts it with the one-dimensional convolution neural network as well as their hybrid counterpart. The dataset is split into 80 percent training and 20 percent testing data. The root mean square and mean absolute error values are then calculated to determine the performance of the models. FINDINGS: A combination of long short-term memory and fully connected layers using dropouts and early stopping patience techniques has been successfully developed to model the non-linear time-series data of daily particulate matter 2.5 concentration. The model effectively captured the patterns present in the historical data, resulting in outcomes that exhibited similar patterns. The long short-term memory model demonstrates an overall root mean square error and mean absolute error values of 18.53 micrograms per cubic meter and 14.92 micrograms per cubic meter in Central Jakarta and 19.4 micrograms per cubic meter and 15.61 micrograms per cubic meter in South Jakarta, where the best seasonal data were found to be in the June-July-August and December-January-February seasons respectively. CONCLUSION: The air pollution forecasting models, which were created using both seasonal and overall time-series data, have the ability to predict air pollution levels by utilizing historical pollution data and meteorological inputs. The proposed long short-term memory model outperforms the one-dimensional convolution network and their hybrid combination. It has effectively surpassed the constraint of collecting observable data, attaining minimal error values on both sensors and satellite data, signifying a noteworthy progression compared to previous studies. Therefore, it might benefit areas lacking sufficient data, providing a valuable tool for air pollution mitigation.
AB - BACKGROUND AND OBJECTIVES: In metropolitan settings, the requirement to travel and participate in everyday tasks exposes numerous individuals to the harmful effects of air pollutants, specifically particulate matter 2.5, which has the potential to impact their well-being. Developing precise forecasting models is crucial in mitigating air pollution and providing accurate predictions for the people. Nonetheless, the deficiency in acquiring observable data can frequently lead to unsatisfactory performance of forecasting models in various scenarios. The objective of this study is to address the issue by examining the most effective approaches for predicting the non-linear time-series data of daily particulate matter 2.5 concentration using meteorological inputs. METHODS: The concentration data of particulate matter 2.5 at Central Jakarta and South Jakarta were collected using sensors from the United States of America Embassy in Indonesia and Indonesia’s Meteorological, Climatological, and Geophysical Agency. Conversely, the meteorological information was collected through the Merra-2 satellite. This study introduces the long short-term memory deep learning model and contrasts it with the one-dimensional convolution neural network as well as their hybrid counterpart. The dataset is split into 80 percent training and 20 percent testing data. The root mean square and mean absolute error values are then calculated to determine the performance of the models. FINDINGS: A combination of long short-term memory and fully connected layers using dropouts and early stopping patience techniques has been successfully developed to model the non-linear time-series data of daily particulate matter 2.5 concentration. The model effectively captured the patterns present in the historical data, resulting in outcomes that exhibited similar patterns. The long short-term memory model demonstrates an overall root mean square error and mean absolute error values of 18.53 micrograms per cubic meter and 14.92 micrograms per cubic meter in Central Jakarta and 19.4 micrograms per cubic meter and 15.61 micrograms per cubic meter in South Jakarta, where the best seasonal data were found to be in the June-July-August and December-January-February seasons respectively. CONCLUSION: The air pollution forecasting models, which were created using both seasonal and overall time-series data, have the ability to predict air pollution levels by utilizing historical pollution data and meteorological inputs. The proposed long short-term memory model outperforms the one-dimensional convolution network and their hybrid combination. It has effectively surpassed the constraint of collecting observable data, attaining minimal error values on both sensors and satellite data, signifying a noteworthy progression compared to previous studies. Therefore, it might benefit areas lacking sufficient data, providing a valuable tool for air pollution mitigation.
KW - Air pollution Hybrid deep learning models Long short term memory (LSTM) Fine particulate matter (PM) forecasting
UR - http://www.scopus.com/inward/record.url?scp=85200856118&partnerID=8YFLogxK
U2 - 10.22034/gjesm.2024.04.16
DO - 10.22034/gjesm.2024.04.16
M3 - Article
AN - SCOPUS:85200856118
SN - 2383-3572
VL - 10
SP - 1759
EP - 1774
JO - Global Journal of Environmental Science and Management
JF - Global Journal of Environmental Science and Management
IS - 4
ER -