Effect of variable selection algorithms on model performance for predicting moisture content in biological materials using spectral data

Variable selection is a critical step for designing a dedicated multispectral real-time system from multicollinearity spectral data. It improves the prediction ability of the calibration model and provides faster prediction by reducing the curse of dimensionality. The main objective of this study wa...

Full description

Saved in:
Bibliographic Details
Published inAnalytica chimica acta Vol. 1202; p. 339390
Main Authors Kamruzzaman, Mohammed, Kalita, Dipsikha, Ahmed, Md. Toukir, ElMasry, Gamal, Makino, Yoshio
Format Journal Article
LanguageEnglish
Published Netherlands Elsevier B.V 15.04.2022
Subjects
Online AccessGet full text
ISSN0003-2670
1873-4324
1873-4324
DOI10.1016/j.aca.2021.339390

Cover

More Information
Summary:Variable selection is a critical step for designing a dedicated multispectral real-time system from multicollinearity spectral data. It improves the prediction ability of the calibration model and provides faster prediction by reducing the curse of dimensionality. The main objective of this study was to compare the effect of variables selection algorithms on model performance for predicting moisture content in red meat using visible and near-infrared (VNIR) hyperspectral imaging in the spectral range of 400–1000 nm and corn using near-infrared (NIR) spectroscopy in the spectral range of 1100–2498 nm. Six variable selection algorithms including the size of the regression coefficient (RC), variable importance in projection (VIP), genetic algorithm (GA), competitive adaptive reweighted sampling (CARS), successive projection algorithm (SPA), and stepwise regression (SWR) were tested and compared to realize their effects on the model performance for predicting moisture content in red meat and corn. The model based on competitive adaptive reweighted sampling-partial least squares regression (CARS-PLSR) was the best model to predict moisture content in red meat and corn. The results indicated the effectiveness of variable selection for providing the feature wavelengths to design a low-cost, real-time multispectral system. [Display omitted] •Spectral data were used to monitor moisture content in red meat and corn.•Pre-processing did not enhance the model predictability compared to raw spectra.•CARS-PLSR outperformed for predicting moisture in red meat and corn.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0003-2670
1873-4324
1873-4324
DOI:10.1016/j.aca.2021.339390