Literature DB >> 18759851

Presence-only data and the em algorithm.

Gill Ward1, Trevor Hastie, Simon Barry, Jane Elith, John R Leathwick.   

Abstract

SUMMARY: In ecological modeling of the habitat of a species, it can be prohibitively expensive to determine species absence. Presence-only data consist of a sample of locations with observed presences and a separate group of locations sampled from the full landscape, with unknown presences. We propose an expectation-maximization algorithm to estimate the underlying presence-absence logistic model for presence-only data. This algorithm can be used with any off-the-shelf logistic model. For models with stepwise fitting procedures, such as boosted trees, the fitting process can be accelerated by interleaving expectation steps within the procedure. Preliminary analyses based on sampling from presence-absence records of fish in New Zealand rivers illustrate that this new procedure can reduce both deviance and the shrinkage of marginal effect estimates that occur in the naive model often used in practice. Finally, it is shown that the population prevalence of a species is only identifiable when there is some unrealistic constraint on the structure of the logistic model. In practice, it is strongly recommended that an estimate of population prevalence be provided.

Entities:  

Mesh:

Year:  2009        PMID: 18759851      PMCID: PMC4821886          DOI: 10.1111/j.1541-0420.2008.01116.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  24 in total

1.  Niches and distributional areas: concepts, methods, and assumptions.

Authors:  Jorge Soberón; Miguel Nakamura
Journal:  Proc Natl Acad Sci U S A       Date:  2009-09-23       Impact factor: 11.205

2.  Integrating microRNA target predictions for the discovery of gene regulatory networks: a semi-supervised ensemble learning approach.

Authors:  Gianvito Pio; Donato Malerba; Domenica D'Elia; Michelangelo Ceci
Journal:  BMC Bioinformatics       Date:  2014-01-10       Impact factor: 3.169

3.  Inference from presence-only data; the ongoing controversy.

Authors:  Trevor Hastie; Will Fithian
Journal:  Ecography (Cop.)       Date:  2013-08-01       Impact factor: 5.992

4.  Accounting for habitat when considering climate: has the niche of the Adonis blue butterfly changed in the UK?

Authors:  Rory S O'Connor; Rosemary S Hails; Jeremy A Thomas
Journal:  Oecologia       Date:  2014-01-11       Impact factor: 3.225

5.  A maximum likelihood approach to electronic health record phenotyping using positive and unlabeled patients.

Authors:  Lingjiao Zhang; Xiruo Ding; Yanyuan Ma; Naveen Muthu; Imran Ajmal; Jason H Moore; Daniel S Herman; Jinbo Chen
Journal:  J Am Med Inform Assoc       Date:  2020-01-01       Impact factor: 4.497

6.  Bias correction in species distribution models: pooling survey and collection data for multiple species.

Authors:  William Fithian; Jane Elith; Trevor Hastie; David A Keith
Journal:  Methods Ecol Evol       Date:  2014-10-10       Impact factor: 7.781

7.  Range bagging: a new method for ecological niche modelling from presence-only data.

Authors:  John M Drake
Journal:  J R Soc Interface       Date:  2015-06-06       Impact factor: 4.118

8.  PUlasso: High-Dimensional Variable Selection With Presence-Only Data.

Authors:  Hyebin Song; Garvesh Raskutti
Journal:  J Am Stat Assoc       Date:  2019-04-11       Impact factor: 5.033

9.  Finite-Sample Equivalence in Statistical Models for Presence-Only Data.

Authors:  William Fithian; Trevor Hastie
Journal:  Ann Appl Stat       Date:  2013-12-01       Impact factor: 2.083

Review 10.  Protein function in precision medicine: deep understanding with machine learning.

Authors:  Burkhard Rost; Predrag Radivojac; Yana Bromberg
Journal:  FEBS Lett       Date:  2016-08-06       Impact factor: 4.124

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.