Analysis and evaluation of ensemble methods in weather classification

Weather classification is essential for numerous fields such as agriculture, environmental monitoring, and disaster management. Accurate classification models are crucial for predicting weather conditions and aiding decision-making processes. Despite the availability of various machine learning tech...

Full description

Saved in:
Bibliographic Details
Published inSignal, image and video processing Vol. 19; no. 9
Main Author Wang, Meixiang
Format Journal Article
LanguageEnglish
Published London Springer London 01.09.2025
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN1863-1703
1863-1711
DOI10.1007/s11760-025-04322-1

Cover

More Information
Summary:Weather classification is essential for numerous fields such as agriculture, environmental monitoring, and disaster management. Accurate classification models are crucial for predicting weather conditions and aiding decision-making processes. Despite the availability of various machine learning techniques, there remains a need to assess and compare their performance effectively on weather classification tasks. Ensuring model reliability and avoiding overfitting is a key challenge, especially when dealing with high-dimensional and categorical data. This research evaluates various machine learning models for weather condition classification, including basic ones such as Logistic Regression (LR), Random Forest (RF), Extreme Gradient Boosting (XGBoost), Categorical Boosting (CatBoost), and ensemble models, namely Voting Classifier, Stacking, and Bagging. Four weather classes with 11,659 samples are used for model training and evaluation. As a result, in testing data, the Voting Classifier achieves the highest performance, with an accuracy of 0.9125, F1-score of 0.91203, and recall of 0.91182, effectively balancing performance and avoiding overfitting. Both RF, XGBoost and Bagging demonstrate robust performance as secondary powerful models, each achieving an accuracy, F1score, and recall of roughly 0.91. Logistic Regression, though performing adequately, yields the lowest results with an accuracy of 0.90379, F1-score of 0.90318, and recall of 0.90309. The findings highlight the potential of ensemble models in real-world applications, where weather type classification is vital for effective preparation and decision-making in fields such as optimizing crop management, tracking climate change impacts, and predicting severe weather events, and enhancing disaster response strategies. Future work will explore the integration of real-time data to further optimize forecasting models and improve practical relevance.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1863-1703
1863-1711
DOI:10.1007/s11760-025-04322-1