A Hybrid Machine Learning Model for Modeling Nitrate Concentration in Water Sources

Nitrate is one of the most dangerous contaminants that can pollute water sources; as a result, it is always tried to use accurate methods to monitor its quantity. The goal of this study is to develop a hybrid machine learning model (HELM) for modeling nitrate concentration in water resources. For th...

Full description

Saved in:
Bibliographic Details
Published inWater, air, and soil pollution Vol. 234; no. 11; p. 721
Main Authors Mazraeh, Adnan, Bagherifar, Meysam, Shabanlou, Saeid, Ekhlasmand, Reza
Format Journal Article
LanguageEnglish
Published Cham Springer International Publishing 01.11.2023
Springer
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN0049-6979
1573-2932
DOI10.1007/s11270-023-06745-3

Cover

More Information
Summary:Nitrate is one of the most dangerous contaminants that can pollute water sources; as a result, it is always tried to use accurate methods to monitor its quantity. The goal of this study is to develop a hybrid machine learning model (HELM) for modeling nitrate concentration in water resources. For this purpose, 1453 samples were collected over a 20-year period (from 2000 to 2020). To develop the HMLM, the concentrations of nitrate data were first clustered using the Jenks natural breaks method (JNBA). After that, a support vector machines (SVM) model was developed for each cluster. In this case, the trial-and-error (TaE) method was used to determine the parameters of kernel functions. However, the sequential forward floating selection (SFFS) technique was employed to select the optimal input parameters to simulate the nitrate content in each cluster. In the last step, to improve the efficiency of the SVM model, the parameters of the kernel functions were determined using the Harris-Hawkes optimization (HHO) algorithm. The HHO algorithm is used because of its ability to intelligently shift between the exploration and exploitation phases during optimization. To develop all of the models, 80% of the data was utilized for training and the remaining for testing. Finally, indices such as root mean square error (RMSE), mean absolute error (MAE), Willmott’s index agreement (WI), explained variance score (EVS), Kling-Gupta efficiency (KGE), and coefficient of determination ( R 2 ) were used to evaluate different models. Based on the results of this study, HMLM can be used to accurately model nitrate concentration. Furthermore, using the HHO optimization algorithm instead of the TaE method considerably improves the SVM model’s performance.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:0049-6979
1573-2932
DOI:10.1007/s11270-023-06745-3