Literature DB >> 17238429

Literature based discovery of gene clusters using phylogenetic methods.

Indra Neil Sarkar1, Abha Agrawal.   

Abstract

Biomedical literature can offer valuable information for organizing genes associated with the etiology and pathogenesis of disease. In this study, we demonstrate the utility of existing phylogenetic methods for organizing 375 genes associated with Breast Cancer using the MeSH annotations from over 35,000 Medline articles. Specifically, we compare the clustering (using the Colless Imbalance Index, Ic) of distance-based methods, which are used by popular phylogenetic clustering algorithms, and a character- based method (Maximum Parsimony) that is commonly used for phylogenetic studies. Focusing on genes that cluster around BRCA1 and BRCA2, we examine the relevance of the clustered genes proposed by the different clustering methods based on the number of exclusive MeSH terms. Our results indicate that existing phylogenetic methods and associated metrics can be used for organizing genes according to annotated knowledge in biomedical literature.

Entities:  

Mesh:

Year:  2006        PMID: 17238429      PMCID: PMC1839645     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  14 in total

1.  Textquest: document clustering of Medline abstracts for concept discovery in molecular biology.

Authors:  I Iliopoulos; A J Enright; C A Ouzounis
Journal:  Pac Symp Biocomput       Date:  2001

2.  Characteristic attributes in cancer microarrays.

Authors:  I N Sarkar; P J Planet; T E Bael; S E Stanley; M Siddall; R DeSalle; D H Figurski
Journal:  J Biomed Inform       Date:  2002-04       Impact factor: 6.317

3.  MrBayes 3: Bayesian phylogenetic inference under mixed models.

Authors:  Fredrik Ronquist; John P Huelsenbeck
Journal:  Bioinformatics       Date:  2003-08-12       Impact factor: 6.937

4.  MedScan, a natural language processing engine for MEDLINE abstracts.

Authors:  Svetlana Novichkova; Sergei Egorov; Nikolai Daraselia
Journal:  Bioinformatics       Date:  2003-09-01       Impact factor: 6.937

5.  More genes or more taxa? The relative contribution of gene number and taxon number to phylogenetic accuracy.

Authors:  Antonis Rokas; Sean B Carroll
Journal:  Mol Biol Evol       Date:  2005-03-02       Impact factor: 16.240

Review 6.  What's in a character?

Authors:  Rob DeSalle
Journal:  J Biomed Inform       Date:  2005-12-01       Impact factor: 6.317

7.  Overview of the HUPO Plasma Proteome Project: results from the pilot phase with 35 collaborating laboratories and multiple analytical groups, generating a core dataset of 3020 proteins and a publicly-available database.

Authors:  Gilbert S Omenn; David J States; Marcin Adamski; Thomas W Blackwell; Rajasree Menon; Henning Hermjakob; Rolf Apweiler; Brian B Haab; Richard J Simpson; James S Eddes; Eugene A Kapp; Robert L Moritz; Daniel W Chan; Alex J Rai; Arie Admon; Ruedi Aebersold; Jimmy Eng; William S Hancock; Stanley A Hefta; Helmut Meyer; Young-Ki Paik; Jong-Shin Yoo; Peipei Ping; Joel Pounds; Joshua Adkins; Xiaohong Qian; Rong Wang; Valerie Wasinger; Chi Yue Wu; Xiaohang Zhao; Rong Zeng; Alexander Archakov; Akira Tsugita; Ilan Beer; Akhilesh Pandey; Michael Pisano; Philip Andrews; Harald Tammen; David W Speicher; Samir M Hanash
Journal:  Proteomics       Date:  2005-08       Impact factor: 3.984

Review 8.  Support versus corroboration.

Authors:  Mary G Egan
Journal:  J Biomed Inform       Date:  2005-12-07       Impact factor: 6.317

Review 9.  Automated extraction of information in molecular biology.

Authors:  M A Andrade; P Bork
Journal:  FEBS Lett       Date:  2000-06-30       Impact factor: 4.124

10.  Systematic association of genes to phenotypes by genome and literature mining.

Authors:  Jan O Korbel; Tobias Doerks; Lars J Jensen; Carolina Perez-Iratxeta; Szymon Kaczanowski; Sean D Hooper; Miguel A Andrade; Peer Bork
Journal:  PLoS Biol       Date:  2005-04-05       Impact factor: 8.029

View more
  1 in total

1.  Quantitative biomedical annotation using medical subject heading over-representation profiles (MeSHOPs).

Authors:  Warren A Cheung; B F Francis Ouellette; Wyeth W Wasserman
Journal:  BMC Bioinformatics       Date:  2012-09-27       Impact factor: 3.169

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.