Mapping forest change using stacked generalization: An ensemble approach
The ever-increasing volume and accessibility of remote sensing data has spawned many alternative approaches for mapping important environmental features and processes. For example, there are several viable but highly varied strategies for using time series of Landsat imagery to detect changes in for...
Saved in:
| Published in | Remote sensing of environment Vol. 204; pp. 717 - 728 |
|---|---|
| Main Authors | , , , , , , , , , , , , , , , , , |
| Format | Journal Article |
| Language | English |
| Published |
New York
Elsevier Inc
01.01.2018
Elsevier BV |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0034-4257 1879-0704 |
| DOI | 10.1016/j.rse.2017.09.029 |
Cover
| Summary: | The ever-increasing volume and accessibility of remote sensing data has spawned many alternative approaches for mapping important environmental features and processes. For example, there are several viable but highly varied strategies for using time series of Landsat imagery to detect changes in forest cover. Performance among algorithms varies across complex natural systems, and it is reasonable to ask if aggregating the strengths of an ensemble of classifiers might result in increased overall accuracy. Relatively simple rules have been used in the past to aggregate classifications among remotely sensed maps (e.g. using majority predictions), and in other fields, empirical models have been used to create situationally specific algorithm weights. The latter process, called “stacked generalization” (or “stacking”), typically uses a parametric model for the fusion of algorithm outputs. We tested the performance of several leading forest disturbance detection algorithms against ensembles of the outputs of those same algorithms based upon stacking using both parametric and Random Forests-based fusion rules. Stacking using a Random Forests model cut omission and commission error rates in half in many cases in relation to individual change detection algorithms, and cut error rates by one quarter compared to more conventional parametric stacking. Stacking also offers two auxiliary benefits: alignment of outputs to the precise definitions built into a particular set of empirical calibration data; and, outputs which may be adjusted such that map class totals match independent estimates of change in each year. In general, ensemble predictions improve when new inputs are added that are both informative and uncorrelated with existing ensemble components. As increased use of cloud-based computing makes ensemble mapping methods more accessible, the most useful new algorithms may be those that specialize in providing spectral, temporal, or thematic information not already available through members of existing ensembles.
[Display omitted]
•Stacking can be used to leverage an ensemble of maps to improve accuracy.•Stacking can align a mapping process with project-specific class definitions.•Random Forests was better than logistic regression as an ensemble fusion rule.•Cloud computing lowers barriers to stacking.•Future algorithm development may focus on specialization to fill ensemble gaps. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 |
| ISSN: | 0034-4257 1879-0704 |
| DOI: | 10.1016/j.rse.2017.09.029 |