Presence-Only Data and the EM Algorithm

In ecological modeling of the habitat of a species, it can be prohibitively expensive to determine species absence. Presence-only data consist of a sample of locations with observed presences and a separate group of locations sampled from the full landscape, with unknown presences. We propose an exp...

Full description

Saved in:
Bibliographic Details
Published inBiometrics Vol. 65; no. 2; pp. 554 - 563
Main Authors Ward, Gill, Hastie, Trevor, Barry, Simon, Elith, Jane, Leathwick, John R.
Format Journal Article
LanguageEnglish
Published Malden, USA Blackwell Publishing Inc 01.06.2009
Wiley-Blackwell Publishing
Blackwell Publishing Ltd
Subjects
Online AccessGet full text
ISSN0006-341X
1541-0420
1541-0420
DOI10.1111/j.1541-0420.2008.01116.x

Cover

More Information
Summary:In ecological modeling of the habitat of a species, it can be prohibitively expensive to determine species absence. Presence-only data consist of a sample of locations with observed presences and a separate group of locations sampled from the full landscape, with unknown presences. We propose an expectation-maximization algorithm to estimate the underlying presence-absence logistic model for presence-only data. This algorithm can be used with any off-the-shelf logistic model. For models with stepwise fitting procedures, such as boosted trees, the fitting process can be accelerated by interleaving expectation steps within the procedure. Preliminary analyses based on sampling from presence-absence records of fish in New Zealand rivers illustrate that this new procedure can reduce both deviance and the shrinkage of marginal effect estimates that occur in the naive model often used in practice. Finally, it is shown that the population prevalence of a species is only identifiable when there is some unrealistic constraint on the structure of the logistic model. In practice, it is strongly recommended that an estimate of population prevalence be provided.
Bibliography:http://dx.doi.org/10.1111/j.1541-0420.2008.01116.x
istex:7A9C126844FC4F6B2708FD62A10B2402524E960C
ark:/67375/WNG-MJJWCT2Z-4
ArticleID:BIOM1116
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
ISSN:0006-341X
1541-0420
1541-0420
DOI:10.1111/j.1541-0420.2008.01116.x