Effect of variable selection algorithms on model performance for predicting moisture content in biological materials using spectral data
Variable selection is a critical step for designing a dedicated multispectral real-time system from multicollinearity spectral data. It improves the prediction ability of the calibration model and provides faster prediction by reducing the curse of dimensionality. The main objective of this study wa...
Saved in:
| Published in | Analytica chimica acta Vol. 1202; p. 339390 |
|---|---|
| Main Authors | , , , , |
| Format | Journal Article |
| Language | English |
| Published |
Netherlands
Elsevier B.V
15.04.2022
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 0003-2670 1873-4324 1873-4324 |
| DOI | 10.1016/j.aca.2021.339390 |
Cover
| Summary: | Variable selection is a critical step for designing a dedicated multispectral real-time system from multicollinearity spectral data. It improves the prediction ability of the calibration model and provides faster prediction by reducing the curse of dimensionality. The main objective of this study was to compare the effect of variables selection algorithms on model performance for predicting moisture content in red meat using visible and near-infrared (VNIR) hyperspectral imaging in the spectral range of 400–1000 nm and corn using near-infrared (NIR) spectroscopy in the spectral range of 1100–2498 nm. Six variable selection algorithms including the size of the regression coefficient (RC), variable importance in projection (VIP), genetic algorithm (GA), competitive adaptive reweighted sampling (CARS), successive projection algorithm (SPA), and stepwise regression (SWR) were tested and compared to realize their effects on the model performance for predicting moisture content in red meat and corn. The model based on competitive adaptive reweighted sampling-partial least squares regression (CARS-PLSR) was the best model to predict moisture content in red meat and corn. The results indicated the effectiveness of variable selection for providing the feature wavelengths to design a low-cost, real-time multispectral system.
[Display omitted]
•Spectral data were used to monitor moisture content in red meat and corn.•Pre-processing did not enhance the model predictability compared to raw spectra.•CARS-PLSR outperformed for predicting moisture in red meat and corn. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| ISSN: | 0003-2670 1873-4324 1873-4324 |
| DOI: | 10.1016/j.aca.2021.339390 |