Literature DB >> 16364569

Self-Organizing Map (SOM) unveils and visualizes hidden sequence characteristics of a wide range of eukaryote genomes.

Takashi Abe1, Hideaki Sugawara, Shigehiko Kanaya, Makoto Kinouchi, Toshimichi Ikemura.   

Abstract

Novel tools are needed for comprehensive comparisons of interspecies characteristics of massive amounts of genomic sequences currently available. An unsupervised neural network algorithm, Self-Organizing Map (SOM), is an effective tool for clustering and visualizing high-dimensional complex data on a single map. We modified the conventional SOM, on the basis of batch-learning SOM, for genome informatics making the learning process and resulting map independent of the order of data input. We generated the SOMs for tri- and tetranucleotide frequencies in 10- and 100-kb sequence fragments from 38 eukaryotes for which almost complete genome sequences are available. SOM recognized species-specific characteristics (key combinations of oligonucleotide frequencies) in the genomic sequences, permitting species-specific classification of the sequences without any information regarding the species. We also generated the SOM for tetranucleotide frequencies in 1-kb sequence fragments from the human genome and found sequences for four functional categories (5' and 3' UTRs, CDSs and introns) were classified primarily according to the categories. Because the classification and visualization power is very high, SOM is an efficient and powerful tool for extracting a wide range of genome information.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16364569     DOI: 10.1016/j.gene.2005.09.040

Source DB:  PubMed          Journal:  Gene        ISSN: 0378-1119            Impact factor:   3.688


  15 in total

1.  Classification and regression tree (CART) analyses of genomic signatures reveal sets of tetramers that discriminate temperature optima of archaea and bacteria.

Authors:  Betsey Dexter Dyer; Michael J Kahn; Mark D Leblanc
Journal:  Archaea       Date:  2008-12       Impact factor: 3.273

2.  Comparative genomic analysis of the human genome and six bat genomes using unsupervised machine learning: Mb-level CpG and TFBS islands.

Authors:  Yuki Iwasaki; Toshimichi Ikemura; Kennosuke Wada; Yoshiko Wada; Takashi Abe
Journal:  BMC Genomics       Date:  2022-07-08       Impact factor: 4.547

3.  The genome of Pelotomaculum thermopropionicum reveals niche-associated evolution in anaerobic microbiota.

Authors:  Tomoyuki Kosaka; Souichiro Kato; Takefumi Shimoyama; Shunichi Ishii; Takashi Abe; Kazuya Watanabe
Journal:  Genome Res       Date:  2008-01-24       Impact factor: 9.043

4.  Resolving prokaryotic taxonomy without rRNA: longer oligonucleotide word lengths improve genome and metagenome taxonomic classification.

Authors:  Eric B Alsop; Jason Raymond
Journal:  PLoS One       Date:  2013-07-01       Impact factor: 3.240

5.  Alignment-free visualization of metagenomic data by nonlinear dimension reduction.

Authors:  Cedric C Laczny; Nicolás Pinel; Nikos Vlassis; Paul Wilmes
Journal:  Sci Rep       Date:  2014-03-31       Impact factor: 4.379

6.  A novel bioinformatics strategy for function prediction of poorly-characterized protein genes obtained from metagenome analyses.

Authors:  Takashi Abe; Shigehiko Kanaya; Hiroshi Uehara; Toshimichi Ikemura
Journal:  DNA Res       Date:  2009-10-03       Impact factor: 4.458

7.  A supervised learning approach for taxonomic classification of core-photosystem-II genes and transcripts in the marine environment.

Authors:  Shani Tzahor; Dikla Man-Aharonovich; Benjamin C Kirkup; Tali Yogev; Ilana Berman-Frank; Martin F Polz; Oded Béjà; Yael Mandel-Gutfreund
Journal:  BMC Genomics       Date:  2009-05-16       Impact factor: 3.969

8.  Using growing self-organising maps to improve the binning process in environmental whole-genome shotgun sequencing.

Authors:  Chon-Kit Kenneth Chan; Arthur L Hsu; Sen-Lin Tang; Saman K Halgamuge
Journal:  J Biomed Biotechnol       Date:  2008

9.  Notable clustering of transcription-factor-binding motifs in human pericentric regions and its biological significance.

Authors:  Yuki Iwasaki; Kennosuke Wada; Yoshiko Wada; Takashi Abe; Toshimichi Ikemura
Journal:  Chromosome Res       Date:  2013-07-30       Impact factor: 5.239

10.  Visualization of genome signatures of eukaryote genomes by batch-learning self-organizing map with a special emphasis on Drosophila genomes.

Authors:  Takashi Abe; Yuta Hamano; Toshimichi Ikemura
Journal:  Biomed Res Int       Date:  2014-03-11       Impact factor: 3.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.