Literature DB >> 23221298

Building an efficient curation workflow for the Arabidopsis literature corpus.

Donghui Li1, Tanya Z Berardini, Robert J Muller, Eva Huala.   

Abstract

TAIR (The Arabidopsis Information Resource) is the model organism database (MOD) for Arabidopsis thaliana, a model plant with a literature corpus of about 39 000 articles in PubMed, with over 4300 new articles added in 2011. We have developed a literature curation workflow incorporating both automated and manual elements to cope with this flood of new research articles. The current workflow can be divided into two phases: article selection and curation. Structured controlled vocabularies, such as the Gene Ontology and Plant Ontology are used to capture free text information in the literature as succinct ontology-based annotations suitable for the application of computational analysis methods. We also describe our curation platform and the use of text mining tools in our workflow. Database URL: www.arabidopsis.org

Entities:  

Mesh:

Year:  2012        PMID: 23221298      PMCID: PMC3515862          DOI: 10.1093/database/bas047

Source DB:  PubMed          Journal:  Database (Oxford)        ISSN: 1758-0463            Impact factor:   3.451


  20 in total

1.  Plant Physiology and TAIR partnership.

Authors:  Donald R Ort; Aleel K Grennan
Journal:  Plant Physiol       Date:  2008-03       Impact factor: 8.340

2.  Textpresso: an ontology-based information retrieval and extraction system for biological literature.

Authors:  Hans-Michael Müller; Eimear E Kenny; Paul W Sternberg
Journal:  PLoS Biol       Date:  2004-09-21       Impact factor: 8.029

3.  Entity/quality-based logical definitions for the human skeletal phenome using PATO.

Authors:  Georgios V Gkoutos; Chris Mungall; Sandra Dolken; Michael Ashburner; Suzanna Lewis; John Hancock; Paul Schofield; Sebastian Kohler; Peter N Robinson
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2009

4.  Big data: The future of biocuration.

Authors:  Doug Howe; Maria Costanzo; Petra Fey; Takashi Gojobori; Linda Hannick; Winston Hide; David P Hill; Renate Kania; Mary Schaeffer; Susan St Pierre; Simon Twigger; Owen White; Seung Yon Rhee
Journal:  Nature       Date:  2008-09-04       Impact factor: 49.962

5.  The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools.

Authors:  Philippe Lamesch; Tanya Z Berardini; Donghui Li; David Swarbreck; Christopher Wilks; Rajkumar Sasidharan; Robert Muller; Kate Dreher; Debbie L Alexander; Margarita Garcia-Hernandez; Athikkattuvalasu S Karthikeyan; Cynthia H Lee; William D Nelson; Larry Ploetz; Shanker Singh; April Wensel; Eva Huala
Journal:  Nucleic Acids Res       Date:  2011-12-02       Impact factor: 16.971

6.  Database resources of the National Center for Biotechnology Information.

Authors:  Eric W Sayers; Tanya Barrett; Dennis A Benson; Evan Bolton; Stephen H Bryant; Kathi Canese; Vyacheslav Chetvernin; Deanna M Church; Michael Dicuccio; Scott Federhen; Michael Feolo; Ian M Fingerman; Lewis Y Geer; Wolfgang Helmberg; Yuri Kapustin; Sergey Krasnov; David Landsman; David J Lipman; Zhiyong Lu; Thomas L Madden; Tom Madej; Donna R Maglott; Aron Marchler-Bauer; Vadim Miller; Ilene Karsch-Mizrachi; James Ostell; Anna Panchenko; Lon Phan; Kim D Pruitt; Gregory D Schuler; Edwin Sequeira; Stephen T Sherry; Martin Shumway; Karl Sirotkin; Douglas Slotta; Alexandre Souvorov; Grigory Starchenko; Tatiana A Tatusova; Lukas Wagner; Yanli Wang; W John Wilbur; Eugene Yaschenko; Jian Ye
Journal:  Nucleic Acids Res       Date:  2011-12-02       Impact factor: 16.971

7.  Text mining for the biocuration workflow.

Authors:  Lynette Hirschman; Gully A P C Burns; Martin Krallinger; Cecilia Arighi; K Bretonnel Cohen; Alfonso Valencia; Cathy H Wu; Andrew Chatr-Aryamontri; Karen G Dowell; Eva Huala; Anália Lourenço; Robert Nash; Anne-Lise Veuthey; Thomas Wiegers; Andrew G Winter
Journal:  Database (Oxford)       Date:  2012-04-18       Impact factor: 3.451

8.  Assessment of community-submitted ontology annotations from a novel database-journal partnership.

Authors:  Tanya Z Berardini; Donghui Li; Robert Muller; Raymond Chetty; Larry Ploetz; Shanker Singh; April Wensel; Eva Huala
Journal:  Database (Oxford)       Date:  2012-08-01       Impact factor: 3.451

9.  ChEBI: a database and ontology for chemical entities of biological interest.

Authors:  Kirill Degtyarenko; Paula de Matos; Marcus Ennis; Janna Hastings; Martin Zbinden; Alan McNaught; Rafael Alcántara; Michael Darsow; Mickaël Guedj; Michael Ashburner
Journal:  Nucleic Acids Res       Date:  2007-10-11       Impact factor: 16.971

10.  Semi-automated curation of protein subcellular localization: a text mining-based approach to Gene Ontology (GO) Cellular Component curation.

Authors:  Kimberly Van Auken; Joshua Jaffery; Juancarlos Chan; Hans-Michael Müller; Paul W Sternberg
Journal:  BMC Bioinformatics       Date:  2009-07-21       Impact factor: 3.169

View more
  14 in total

1.  An effective biomedical document classification scheme in support of biocuration: addressing class imbalance.

Authors:  Xiangying Jiang; Martin Ringwald; Judith A Blake; Cecilia Arighi; Gongbo Zhang; Hagit Shatkay
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

2.  The Arabidopsis information resource: Making and mining the "gold standard" annotated reference plant genome.

Authors:  Tanya Z Berardini; Leonore Reiser; Donghui Li; Yarik Mezheritsky; Robert Muller; Emily Strait; Eva Huala
Journal:  Genesis       Date:  2015-08-04       Impact factor: 2.487

3.  GOTA: GO term annotation of biomedical literature.

Authors:  Pietro Di Lena; Giacomo Domeniconi; Luciano Margara; Gianluca Moro
Journal:  BMC Bioinformatics       Date:  2015-10-28       Impact factor: 3.169

4.  Overview of the gene ontology task at BioCreative IV.

Authors:  Yuqing Mao; Kimberly Van Auken; Donghui Li; Cecilia N Arighi; Peter McQuilton; G Thomas Hayman; Susan Tweedie; Mary L Schaeffer; Stanley J F Laulederkind; Shur-Jen Wang; Julien Gobeill; Patrick Ruch; Anh Tuan Luu; Jung-Jae Kim; Jung-Hsien Chiang; Yu-De Chen; Chia-Jung Yang; Hongfang Liu; Dongqing Zhu; Yanpeng Li; Hong Yu; Ehsan Emadzadeh; Graciela Gonzalez; Jian-Ming Chen; Hong-Jie Dai; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2014-08-25       Impact factor: 3.451

5.  In Silico Analysis of Correlations between Protein Disorder and Post-Translational Modifications in Algae.

Authors:  Atsushi Kurotani; Tetsuya Sakurai
Journal:  Int J Mol Sci       Date:  2015-08-20       Impact factor: 5.923

6.  RARGE II: an integrated phenotype database of Arabidopsis mutant traits using a controlled vocabulary.

Authors:  Kenji Akiyama; Atsushi Kurotani; Kei Iida; Takashi Kuromori; Kazuo Shinozaki; Tetsuya Sakurai
Journal:  Plant Cell Physiol       Date:  2013-11-21       Impact factor: 4.927

7.  Manually curated database of rice proteins.

Authors:  Pratibha Gour; Priyanka Garg; Rashmi Jain; Shaji V Joseph; Akhilesh K Tyagi; Saurabh Raghuvanshi
Journal:  Nucleic Acids Res       Date:  2013-11-07       Impact factor: 16.971

8.  A method for increasing expressivity of Gene Ontology annotations using a compositional approach.

Authors:  Rachael P Huntley; Midori A Harris; Yasmin Alam-Faruque; Judith A Blake; Seth Carbon; Heiko Dietze; Emily C Dimmer; Rebecca E Foulger; David P Hill; Varsha K Khodiyar; Antonia Lock; Jane Lomax; Ruth C Lovering; Prudence Mutowo-Meullenet; Tony Sawford; Kimberly Van Auken; Valerie Wood; Christopher J Mungall
Journal:  BMC Bioinformatics       Date:  2014-05-21       Impact factor: 3.169

9.  Hybrid curation of gene-mutation relations combining automated extraction and crowdsourcing.

Authors:  John D Burger; Emily Doughty; Ritu Khare; Chih-Hsuan Wei; Rajashree Mishra; John Aberdeen; David Tresner-Kirsch; Ben Wellner; Maricel G Kann; Zhiyong Lu; Lynette Hirschman
Journal:  Database (Oxford)       Date:  2014-09-22       Impact factor: 3.451

10.  BC4GO: a full-text corpus for the BioCreative IV GO task.

Authors:  Kimberly Van Auken; Mary L Schaeffer; Peter McQuilton; Stanley J F Laulederkind; Donghui Li; Shur-Jen Wang; G Thomas Hayman; Susan Tweedie; Cecilia N Arighi; James Done; Hans-Michael Müller; Paul W Sternberg; Yuqing Mao; Chih-Hsuan Wei; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2014-07-28       Impact factor: 3.451

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.