Literature DB >> 12671005

Informatics for unveiling hidden genome signatures.

Takashi Abe1, Shigehiko Kanaya, Makoto Kinouchi, Yuta Ichiba, Tokio Kozuki, Toshimichi Ikemura.   

Abstract

With the increasing amount of available genome sequences, novel tools are needed for comprehensive analysis of species-specific sequence characteristics for a wide variety of genomes. We used an unsupervised neural network algorithm, a self-organizing map (SOM), to analyze di-, tri-, and tetranucleotide frequencies in a wide variety of prokaryotic and eukaryotic genomes. The SOM, which can cluster complex data efficiently, was shown to be an excellent tool for analyzing global characteristics of genome sequences and for revealing key combinations of oligonucleotides representing individual genomes. From analysis of 1- and 10-kb genomic sequences derived from 65 bacteria (a total of 170 Mb) and from 6 eukaryotes (460 Mb), clear species-specific separations of major portions of the sequences were obtained with the di-, tri-, and tetranucleotide SOMs. The unsupervised algorithm could recognize, in most 10-kb sequences, the species-specific characteristics (key combinations of oligonucleotide frequencies) that are signature features of each genome. We were able to classify DNA sequences within one and between many species into subgroups that corresponded generally to biological categories. Because the classification power is very high, the SOM is an efficient and fundamental bioinformatic strategy for extracting a wide range of genomic information from a vast amount of sequences.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12671005      PMCID: PMC430167          DOI: 10.1101/gr.634603

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  24 in total

1.  Genome-scale compositional comparisons in eukaryotes.

Authors:  A J Gentles; S Karlin
Journal:  Genome Res       Date:  2001-04       Impact factor: 9.043

2.  The DNA sequence of human chromosome 21.

Authors:  M Hattori; A Fujiyama; T D Taylor; H Watanabe; T Yada; H S Park; A Toyoda; K Ishii; Y Totoki; D K Choi; Y Groner; E Soeda; M Ohki; T Takagi; Y Sakaki; S Taudien; K Blechschmidt; A Polley; U Menzel; J Delabar; K Kumpf; R Lehmann; D Patterson; K Reichwald; A Rump; M Schillhabel; A Schudy; W Zimmermann; A Rosenthal; J Kudoh; K Schibuya; K Kawasaki; S Asakawa; A Shintani; T Sasaki; K Nagamine; S Mitsuyama; S E Antonarakis; S Minoshima; N Shimizu; G Nordsiek; K Hornischer; P Brant; M Scharfe; O Schon; A Desario; J Reichelt; G Kauer; H Blocker; J Ramser; A Beck; S Klages; S Hennig; L Riesselmann; E Dagand; T Haaf; S Wehrmeyer; K Borzym; K Gardiner; D Nizetic; F Francis; H Lehrach; R Reinhardt; M L Yaspo
Journal:  Nature       Date:  2000-05-18       Impact factor: 49.962

Review 3.  Compositional bias in DNA.

Authors:  C Gautier
Journal:  Curr Opin Genet Dev       Date:  2000-12       Impact factor: 5.578

4.  Analysis of codon usage diversity of bacterial genes with a self-organizing map (SOM): characterization of horizontally transferred genes with emphasis on the E. coli O157 genome.

Authors:  S Kanaya; M Kinouchi; T Abe; Y Kudo; Y Yamada; T Nishi; H Mori; T Ikemura
Journal:  Gene       Date:  2001-10-03       Impact factor: 3.688

Review 5.  The evolution of isochores.

Authors:  A Eyre-Walker; L D Hurst
Journal:  Nat Rev Genet       Date:  2001-07       Impact factor: 53.242

6.  Analysis of codon usage patterns of bacterial genomes using the self-organizing map.

Authors:  H C Wang; J Badger; P Kearney; M Li
Journal:  Mol Biol Evol       Date:  2001-05       Impact factor: 16.240

Review 7.  Codon catalog usage and the genome hypothesis.

Authors:  R Grantham; C Gautier; M Gouy; R Mercier; A Pavé
Journal:  Nucleic Acids Res       Date:  1980-01-11       Impact factor: 16.971

8.  Chromosome-wide assessment of replication timing for human chromosomes 11q and 21q: disease-related genes in timing-switch regions.

Authors:  Yoshihisa Watanabe; Asao Fujiyama; Yuta Ichiba; Masahira Hattori; Tetsushi Yada; Yoshiyuki Sakaki; Toshimichi Ikemura
Journal:  Hum Mol Genet       Date:  2002-01-01       Impact factor: 6.150

9.  Requirement of CHROMOMETHYLASE3 for maintenance of CpXpG methylation.

Authors:  A M Lindroth; X Cao; J P Jackson; D Zilberman; C M McCallum; S Henikoff; S E Jacobsen
Journal:  Science       Date:  2001-05-10       Impact factor: 47.728

10.  Genome sequence of Yersinia pestis KIM.

Authors:  Wen Deng; Valerie Burland; Guy Plunkett; Adam Boutin; George F Mayhew; Paul Liss; Nicole T Perna; Debra J Rose; Bob Mau; Shiguo Zhou; David C Schwartz; Jaqueline D Fetherston; Luther E Lindler; Robert R Brubaker; Gregory V Plano; Susan C Straley; Kathleen A McDonough; Matthew L Nilles; Jyl S Matson; Frederick R Blattner; Robert D Perry
Journal:  J Bacteriol       Date:  2002-08       Impact factor: 3.490

View more
  82 in total

1.  Practical application of self-organizing maps to interrelate biodiversity and functional data in NGS-based metagenomics.

Authors:  Marc Weber; Hanno Teeling; Sixing Huang; Jost Waldmann; Mariette Kassabgy; Bernhard M Fuchs; Anna Klindworth; Christine Klockow; Antje Wichels; Gunnar Gerdts; Rudolf Amann; Frank Oliver Glöckner
Journal:  ISME J       Date:  2010-12-16       Impact factor: 10.302

Review 2.  Metabolomics and its role in understanding cellular responses in plants.

Authors:  Ritu Bhalla; Kothandaraman Narasimhan; Sanjay Swarup
Journal:  Plant Cell Rep       Date:  2005-11-16       Impact factor: 4.570

3.  First genome data from uncultured upland soil cluster alpha methanotrophs provide further evidence for a close phylogenetic relationship to Methylocapsa acidiphila B2 and for high-affinity methanotrophy involving particulate methane monooxygenase.

Authors:  Peter Ricke; Michael Kube; Satoshi Nakagawa; Christoph Erkel; Richard Reinhardt; Werner Liesack
Journal:  Appl Environ Microbiol       Date:  2005-11       Impact factor: 4.792

4.  A comparative categorization of protein function encoded in bacterial or archeal genomic islands.

Authors:  Rainer Merkl
Journal:  J Mol Evol       Date:  2005-12-06       Impact factor: 2.395

5.  Classification and regression tree (CART) analyses of genomic signatures reveal sets of tetramers that discriminate temperature optima of archaea and bacteria.

Authors:  Betsey Dexter Dyer; Michael J Kahn; Mark D Leblanc
Journal:  Archaea       Date:  2008-12       Impact factor: 3.273

Review 6.  A bioinformatician's guide to metagenomics.

Authors:  Victor Kunin; Alex Copeland; Alla Lapidus; Konstantinos Mavromatis; Philip Hugenholtz
Journal:  Microbiol Mol Biol Rev       Date:  2008-12       Impact factor: 11.056

7.  Super paramagnetic clustering of DNA sequences.

Authors:  Sugiarto Radjiman; Lianyi Han; Jian-Sheng Wang; Yu Zong Chen
Journal:  J Biol Phys       Date:  2006-01       Impact factor: 1.365

8.  Self-organizing maps: a tool to ascertain taxonomic relatedness based on features derived from 16S rDNA sequence.

Authors:  D V Raje; H J Purohit; Y P Badhe; S S Tambe; B D Kulkarni
Journal:  J Biosci       Date:  2010-12       Impact factor: 1.826

9.  Metabolomics approach for determining growth-specific metabolites based on Fourier transform ion cyclotron resonance mass spectrometry.

Authors:  Hiroki Takahashi; Kosuke Kai; Yoko Shinbo; Kenichi Tanaka; Daisaku Ohta; Taku Oshima; Md Altaf-Ul-Amin; Ken Kurokawa; Naotake Ogasawara; Shigehiko Kanaya
Journal:  Anal Bioanal Chem       Date:  2008-06-16       Impact factor: 4.142

Review 10.  Achievements and new knowledge unraveled by metagenomic approaches.

Authors:  Carola Simon; Rolf Daniel
Journal:  Appl Microbiol Biotechnol       Date:  2009-09-16       Impact factor: 4.813

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.