Literature DB >> 14757821

A method for finding communities of related genes.

Dennis M Wilkinson1, Bernardo A Huberman.   

Abstract

We present a method for creating a network of gene co-occurrences from the literature and partitioning it into communities of related genes. The way in which our method identifies communities makes it likely that the component genes of each community will be related by their function. The method processes a large database of article abstracts, synthesizing information from many sources to shed light on groups of genes that have been shown to interact. It is a tool to be used by researchers in the biomedical sciences to swiftly search for known interactions and to provide insight into unexplored connections. The partitioning procedure is designed to be particularly applicable to large networks in which individual nodes may play a role in more than one community. In this paper, we explain the details of the method, in particular the partitioning process. We also apply the method to produce communities of genes related to colon cancer and show that the results are useful.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 14757821      PMCID: PMC387302          DOI: 10.1073/pnas.0307740100

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  16 in total

1.  Genome evolution. Global methylation in eutherian hybrids.

Authors:  I Roemer; F Grützner; H Winking; T Haaf; A Orth; L Skidmore; D Antczak; R Fundele
Journal:  Nature       Date:  1999-09-09       Impact factor: 49.962

2.  A literature network of human genes for high-throughput analysis of gene expression.

Authors:  T K Jenssen; A Laegreid; J Komorowski; E Hovig
Journal:  Nat Genet       Date:  2001-05       Impact factor: 38.330

3.  MedMiner: an Internet text-mining tool for biomedical information, with application to gene expression profiling.

Authors:  L Tanabe; U Scherf; L H Smith; J K Lee; L Hunter; J N Weinstein
Journal:  Biotechniques       Date:  1999-12       Impact factor: 1.993

4.  Detecting Gene Symbols and Names in Biological Texts: A First Step toward Pertinent Information Extraction.

Authors: 
Journal:  Genome Inform Ser Workshop Genome Inform       Date:  1998

5.  Toward Routine Automatic Pathway Discovery from On-line Scientific Text Abstracts.

Authors: 
Journal:  Genome Inform Ser Workshop Genome Inform       Date:  1999

6.  Biobibliometrics: information retrieval and visualization from co-occurrences of gene names in Medline abstracts.

Authors:  B J Stapley; G Benoit
Journal:  Pac Symp Biocomput       Date:  2000

7.  GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles.

Authors:  C Friedman; P Kra; H Yu; M Krauthammer; A Rzhetsky
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

8.  Robust relational parsing over biomedical literature: extracting inhibit relations.

Authors:  J Pustejovsky; J Castaño; J Zhang; M Kotecki; B Cochran
Journal:  Pac Symp Biocomput       Date:  2002

9.  Using text analysis to identify functionally coherent gene groups.

Authors:  Soumya Raychaudhuri; Hinrich Schütze; Russ B Altman
Journal:  Genome Res       Date:  2002-10       Impact factor: 9.043

10.  A literature based method for identifying gene-disease connections.

Authors:  Lada A Adamic; Dennis Wilkinson; Bernardo A Huberman; Eytan Adar
Journal:  Proc IEEE Comput Soc Bioinform Conf       Date:  2002
View more
  19 in total

1.  From single genes to co-expression networks: extracting knowledge from barley functional genomics.

Authors:  P Faccioli; P Provero; C Herrmann; A M Stanca; C Morcia; V Terzi
Journal:  Plant Mol Biol       Date:  2005-07       Impact factor: 4.076

2.  Complex network analysis of free-energy landscapes.

Authors:  D Gfeller; P De Los Rios; A Caflisch; F Rao
Journal:  Proc Natl Acad Sci U S A       Date:  2007-01-31       Impact factor: 11.205

3.  Clique-based data mining for related genes in a biomedical database.

Authors:  Tsutomu Matsunaga; Chikara Yonemori; Etsuji Tomita; Masaaki Muramatsu
Journal:  BMC Bioinformatics       Date:  2009-07-01       Impact factor: 3.169

Review 4.  Cyclophilin nomenclature problems, or, 'a visit from the sequence police'.

Authors:  Daniel W Nebert; Nickolas A Sophos; Vasilis Vasiliou; David R Nelson
Journal:  Hum Genomics       Date:  2004-08       Impact factor: 4.639

5.  Building a protein name dictionary from full text: a machine learning term extraction approach.

Authors:  Lei Shi; Fabien Campagne
Journal:  BMC Bioinformatics       Date:  2005-04-07       Impact factor: 3.169

6.  Improved network community structure improves function prediction.

Authors:  Juyong Lee; Steven P Gross; Jooyoung Lee
Journal:  Sci Rep       Date:  2013       Impact factor: 4.379

7.  The effect of edge definition of complex networks on protein structure identification.

Authors:  Jing Sun; Runyu Jing; Di Wu; Tuanfei Zhu; Menglong Li; Yizhou Li
Journal:  Comput Math Methods Med       Date:  2013-02-28       Impact factor: 2.238

8.  Discovering the hidden sub-network component in a ranked list of genes or proteins derived from genomic experiments.

Authors:  Luz García-Alonso; Roberto Alonso; Enrique Vidal; Alicia Amadoz; Alejandro de María; Pablo Minguez; Ignacio Medina; Joaquín Dopazo
Journal:  Nucleic Acids Res       Date:  2012-07-27       Impact factor: 16.971

9.  Application of a new probabilistic model for mining implicit associated cancer genes from OMIM and medline.

Authors:  Shanfeng Zhu; Yasushi Okuno; Gozoh Tsujimoto; Hiroshi Mamitsuka
Journal:  Cancer Inform       Date:  2007-02-25

10.  The transfer and transformation of collective network information in gene-matched networks.

Authors:  Takashi Kitsukawa; Takeshi Yagi
Journal:  Sci Rep       Date:  2015-10-09       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.