| Literature DB >> 25619994 |
Evangelos Pafilis1, Sune P Frankild1, Julia Schnetzer2, Lucia Fanini1, Sarah Faulwetter1, Christina Pavloudi1, Katerina Vasileiadou1, Patrick Leary1, Jennifer Hammock1, Katja Schulz1, Cynthia Sims Parr1, Christos Arvanitidis1, Lars Juhl Jensen1.
Abstract
UNLABELLED: The association of organisms to their environments is a key issue in exploring biodiversity patterns. This knowledge has traditionally been scattered, but textual descriptions of taxa and their habitats are now being consolidated in centralized resources. However, structured annotations are needed to facilitate large-scale analyses. Therefore, we developed ENVIRONMENTS, a fast dictionary-based tagger capable of identifying Environment Ontology (ENVO) terms in text. We evaluate the accuracy of the tagger on a new manually curated corpus of 600 Encyclopedia of Life (EOL) species pages. We use the tagger to associate taxa with environments by tagging EOL text content monthly, and integrate the results into the EOL to disseminate them to a broad audience of users.Entities:
Mesh:
Year: 2015 PMID: 25619994 PMCID: PMC4443677 DOI: 10.1093/bioinformatics/btv045
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Top: The “Overview” tab of the EOL taxon pages show a subset of the ENVO terms obtained through text mining; an extended list of such terms is available in the “Data” tab. Parts of the page have been resized to improve readability. Bottom: The latter list provides links to the EOL text sections where each term was found (highlighted in bold)