Presence-Only Data and the EM Algorithm
In ecological modeling of the habitat of a species, it can be prohibitively expensive to determine species absence. Presence-only data consist of a sample of locations with observed presences and a separate group of locations sampled from the full landscape, with unknown presences. We propose an exp...
Saved in:
| Published in | Biometrics Vol. 65; no. 2; pp. 554 - 563 |
|---|---|
| Main Authors | , , , , |
| Format | Journal Article |
| Language | English |
| Published |
Malden, USA
Blackwell Publishing Inc
01.06.2009
Wiley-Blackwell Publishing Blackwell Publishing Ltd |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0006-341X 1541-0420 1541-0420 |
| DOI | 10.1111/j.1541-0420.2008.01116.x |
Cover
| Summary: | In ecological modeling of the habitat of a species, it can be prohibitively expensive to determine species absence. Presence-only data consist of a sample of locations with observed presences and a separate group of locations sampled from the full landscape, with unknown presences. We propose an expectation-maximization algorithm to estimate the underlying presence-absence logistic model for presence-only data. This algorithm can be used with any off-the-shelf logistic model. For models with stepwise fitting procedures, such as boosted trees, the fitting process can be accelerated by interleaving expectation steps within the procedure. Preliminary analyses based on sampling from presence-absence records of fish in New Zealand rivers illustrate that this new procedure can reduce both deviance and the shrinkage of marginal effect estimates that occur in the naive model often used in practice. Finally, it is shown that the population prevalence of a species is only identifiable when there is some unrealistic constraint on the structure of the logistic model. In practice, it is strongly recommended that an estimate of population prevalence be provided. |
|---|---|
| Bibliography: | http://dx.doi.org/10.1111/j.1541-0420.2008.01116.x istex:7A9C126844FC4F6B2708FD62A10B2402524E960C ark:/67375/WNG-MJJWCT2Z-4 ArticleID:BIOM1116 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23 |
| ISSN: | 0006-341X 1541-0420 1541-0420 |
| DOI: | 10.1111/j.1541-0420.2008.01116.x |