Research on the influencing factors of PM2.5 in China at different spatial scales based on machine learning algorithm
[Display omitted] •Machine learning model improves PM2.5 prediction with R2 up to 0.98 at grid level.•SHAP analysis shows that pollutant factors significantly impact urban PM2.5 levels.•Meteorological factors dominate PM2.5 influences in non-urban areas.•Tailored interventions by region can effectiv...
Saved in:
| Published in | Environment international Vol. 200; p. 109536 |
|---|---|
| Main Authors | , , , , , , , , , , |
| Format | Journal Article |
| Language | English |
| Published |
Elsevier Ltd
01.06.2025
Elsevier |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0160-4120 1873-6750 1873-6750 |
| DOI | 10.1016/j.envint.2025.109536 |
Cover
| Summary: | [Display omitted]
•Machine learning model improves PM2.5 prediction with R2 up to 0.98 at grid level.•SHAP analysis shows that pollutant factors significantly impact urban PM2.5 levels.•Meteorological factors dominate PM2.5 influences in non-urban areas.•Tailored interventions by region can effectively reduce PM2.5 pollution.
PM2.5 pollution is one of the prominent environmental issues currently faced in China, influenced by various factors and showed significant spatial differences. In this study, the Light Gradient Boosting Machine (LightGBM) model was employed in combination with SHapley Additive exPlanation (SHAP) methods to explore the key impact factors (precursor emissions, meteorological conditions, geographical features and socioeconomic factors) on average annual PM2.5 levels from 2015 to 2022 at both city and grid levels in China. The results show that incorporating pollutant concentration into the model enhances its performance, with R2 improving significantly from 0.79 to 0.93, which underscores the importance of pollutant concentration and the outstanding predictive performance of the LightGBM algorithm. Further, after increasing the spatial resolution and applying a grid-level model, R2 further improves to 0.96 ∼ 0.99. SHAP analysis revealed that PM2.5 levels in urban areas are significantly influenced by pollutant concentration such as NO2, CO, and SO2, accounting for 49.3 % of the total impact. In contrast, the grid-based model highlights the dominant role of meteorological factors such as temperature and precipitation influencing PM2.5 levels in non-urban areas. Moreover, the model results also suggested that the PM2.5 pollution in Yangtze River Delta (YRD) and Pearl River Delta (PRD) are mainly controlled by primary emissions, while in Beijing-Tianjin-Hebei (BTH), Fenwei Plain (FWP) and Sichuan Basin (SCB), atmospheric oxidation capacity is a limiting factor. This study underscores the potential of machine learning in atmospheric pollution control and offers insights for developing more effective and region-specific PM2.5 control policies. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| ISSN: | 0160-4120 1873-6750 1873-6750 |
| DOI: | 10.1016/j.envint.2025.109536 |