A best-practice guide to predicting plant traits from leaf-level hyperspectral data using partial least squares regression

We provide a step-by-step guide for combining measurements of leaf reflectance and leaf traits to build statistical models that estimate traits from reflectance, enabling rapid collection of a diverse range of leaf properties. Abstract Partial least squares regression (PLSR) modelling is a statistic...

Full description

Saved in:
Bibliographic Details
Published inJournal of experimental botany Vol. 72; no. 18; pp. 6175 - 6189
Main Authors Burnett, Angela C, Anderson, Jeremiah, Davidson, Kenneth J, Ely, Kim S, Lamour, Julien, Li, Qianyu, Morrison, Bailey D, Yang, Dedi, Rogers, Alistair, Serbin, Shawn P
Format Journal Article
LanguageEnglish
Published UK Oxford University Press 30.09.2021
Subjects
Online AccessGet full text
ISSN0022-0957
1460-2431
1460-2431
DOI10.1093/jxb/erab295

Cover

More Information
Summary:We provide a step-by-step guide for combining measurements of leaf reflectance and leaf traits to build statistical models that estimate traits from reflectance, enabling rapid collection of a diverse range of leaf properties. Abstract Partial least squares regression (PLSR) modelling is a statistical technique for correlating datasets, and involves the fitting of a linear regression between two matrices. One application of PLSR enables leaf traits to be estimated from hyperspectral optical reflectance data, facilitating rapid, high-throughput, non-destructive plant phenotyping. This technique is of interest and importance in a wide range of contexts including crop breeding and ecosystem monitoring. The lack of a consensus in the literature on how to perform PLSR means that interpreting model results can be challenging, applying existing models to novel datasets can be impossible, and unknown or undisclosed assumptions can lead to incorrect or spurious predictions. We address this lack of consensus by proposing best practices for using PLSR to predict plant traits from leaf-level hyperspectral data, including a discussion of when PLSR is applicable, and recommendations for data collection. We provide a tutorial to demonstrate how to develop a PLSR model, in the form of an R script accompanying this manuscript. This practical guide will assist all those interpreting and using PLSR models to predict leaf traits from spectral data, and advocates for a unified approach to using PLSR for predicting traits from spectra in the plant sciences.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
SC0012704
USDOE Office of Science (SC), Biological and Environmental Research (BER)
BNL-221656-2021-JAAM
ISSN:0022-0957
1460-2431
1460-2431
DOI:10.1093/jxb/erab295