Literature DB >> 12819151

Inferring higher functional information for RIKEN mouse full-length cDNA clones with FACTS.

Takeshi Nagashima1, Diego G Silva, Nikolai Petrovsky, Luis A Socha, Harukazu Suzuki, Rintaro Saito, Takeya Kasukawa, Igor V Kurochkin, Akihiko Konagaya, Christian Schönbach.   

Abstract

FACTS (Functional Association/Annotation of cDNA Clones from Text/Sequence Sources) is a semiautomated knowledge discovery and annotation system that integrates molecular function information derived from sequence analysis results (sequence inferred) with functional information extracted from text. Text-inferred information was extracted from keyword-based retrievals of MEDLINE abstracts and by matching of gene or protein names to OMIM, BIND, and DIP database entries. Using FACTS, we found that 47.5% of the 60,770 RIKEN mouse cDNA FANTOM2 clone annotations were informative for text searches. MEDLINE queries yielded molecular interaction-containing sentences for 23.1% of the clones. When disease MeSH and GO terms were matched with retrieved abstracts, 22.7% of clones were associated with potential diseases, and 32.5% with GO identifiers. A significant number (23.5%) of disease MeSH-associated clones were also found to have a hereditary disease association (OMIM Morbidmap). Inferred neoplastic and nervous system disease represented 49.6% and 36.0% of disease MeSH-associated clones, respectively. A comparison of sequence-based GO assignments with informative text-based GO assignments revealed that for 78.2% of clones, identical GO assignments were provided for that clone by either method, whereas for 21.8% of clones, the assignments differed. In contrast, for OMIM assignments, only 28.5% of clones had identical sequence-based and text-based OMIM assignments. Sequence, sentence, and term-based functional associations are included in the FACTS database (http://facts.gsc.riken.go.jp/), which permits results to be annotated and explored through web-accessible keyword and sequence search interfaces. The FACTS database will be a critical tool for investigating the functional complexity of the mouse transcriptome, cDNA-inferred interactome (molecular interactions), and pathome (pathologies).

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12819151      PMCID: PMC403704          DOI: 10.1101/gr.1019903

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  36 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  PNAD-CSS: a workbench for constructing a protein name abbreviation dictionary.

Authors:  M Yoshida; K Fukuda; T Takagi
Journal:  Bioinformatics       Date:  2000-02       Impact factor: 6.937

3.  Differential steroid hormone regulation of human glandular kallikrein (hK2) and prostate-specific antigen (PSA) in breast cancer cell lines.

Authors:  A Magklara; L Grass; E P Diamandis
Journal:  Breast Cancer Res Treat       Date:  2000-02       Impact factor: 4.872

4.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.

Authors:  A Bairoch; R Apweiler
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

5.  Eotaxin induces degranulation and chemotaxis of eosinophils through the activation of ERK2 and p38 mitogen-activated protein kinases.

Authors:  G T Kampen; S Stafford; T Adachi; T Jinquan; S Quan; J A Grant; P S Skov; L K Poulsen; R Alam
Journal:  Blood       Date:  2000-03-15       Impact factor: 22.113

6.  Apoptosis and protein expression after focal cerebral ischemia in rat.

Authors:  Y Li; M Chopp; C Powers; N Jiang
Journal:  Brain Res       Date:  1997-08-15       Impact factor: 3.252

7.  Comparison of DNA sequences with protein sequences.

Authors:  W R Pearson; T Wood; Z Zhang; W Miller
Journal:  Genomics       Date:  1997-11-15       Impact factor: 5.736

8.  Cutting edge: the orphan chemokine receptor G protein-coupled receptor-2 (GPR-2, CCR10) binds the skin-associated chemokine CCL27 (CTACK/ALP/ILC).

Authors:  B Homey; W Wang; H Soto; M E Buchanan; A Wiesenborn; D Catron; A Müller; T K McClanahan; M C Dieu-Nosjean; R Orozco; T Ruzicka; P Lehmann; E Oldham; A Zlotnik
Journal:  J Immunol       Date:  2000-04-01       Impact factor: 5.422

9.  DNA damage and DNA damage-inducible protein Gadd45 following ischemia in the P7 neonatal rat.

Authors:  C Charriaut-Marlangue; E Richard; Y Ben-Ari
Journal:  Brain Res Dev Brain Res       Date:  1999-09-06

10.  Cloning and characterization of hurpin (protease inhibitor 13): A new skin-specific, UV-repressible serine proteinase inhibitor of the ovalbumin serpin family.

Authors:  H F Abts; T Welss; A Mirmohammadsadegh; K Köhrer; G Michel; T Ruzicka
Journal:  J Mol Biol       Date:  1999-10-15       Impact factor: 5.469

View more
  6 in total

1.  FREP: a database of functional repeats in mouse cDNAs.

Authors:  Takeshi Nagashima; Hideo Matsuda; Diego G Silva; Nikolai Petrovsky; Akihiko Konagaya; Christian Schönbach; Takeya Kasukawa; Takahiro Arakawa; Piero Carninci; Jun Kawai; Yoshihide Hayashizaki
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  The mammalian protein-protein interaction database and its viewing system that is linked to the main FANTOM2 viewer.

Authors:  Harukazu Suzuki; Rintaro Saito; Mutsumi Kanamori; Chikatoshi Kai; Christian Schönbach; Takeshi Nagashima; Junko Hosaka; Yoshihide Hayashizaki
Journal:  Genome Res       Date:  2003-06       Impact factor: 9.043

3.  Impact of alternative initiation, splicing, and termination on the diversity of the mRNA transcripts encoded by the mouse transcriptome.

Authors:  Mihaela Zavolan; Shinji Kondo; Christian Schonbach; Jun Adachi; David A Hume; Yoshihide Hayashizaki; Terry Gaasterland
Journal:  Genome Res       Date:  2003-06       Impact factor: 9.043

4.  Impairment of organ-specific T cell negative selection by diabetes susceptibility genes: genomic analysis by mRNA profiling.

Authors:  Adrian Liston; Kristine Hardy; Yvonne Pittelkow; Susan R Wilson; Lydia E Makaroff; Aude M Fahrer; Christopher C Goodnow
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

5.  Ontological visualization of protein-protein interactions.

Authors:  Harold J Drabkin; Christopher Hollenbeck; David P Hill; Judith A Blake
Journal:  BMC Bioinformatics       Date:  2005-02-11       Impact factor: 3.169

6.  Identification of "pathologs" (disease-related genes) from the RIKEN mouse cDNA dataset using human curation plus FACTS, a new biological information extraction system.

Authors:  Diego G Silva; Christian Schönbach; Vladimir Brusic; Luis A Socha; Takeshi Nagashima; Nikolai Petrovsky
Journal:  BMC Genomics       Date:  2004-04-29       Impact factor: 3.969

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.