Literature DB >> 25344495

A novel representation of genomic sequences for taxonomic clustering and visualization by means of self-organizing maps.

Soledad Delgado1, Federico Morán1, Antonio Mora2, Juan Julián Merelo2, Carlos Briones2.   

Abstract

MOTIVATION: Self-organizing maps (SOMs) are readily available bioinformatics methods for clustering and visualizing high-dimensional data, provided that such biological information is previously transformed to fixed-size, metric-based vectors. To increase the usefulness of SOM-based approaches for the analysis of genomic sequence data, novel representation methods are required that automatically and objectively transform aligned nucleotide sequences into numeric vectors, dealing with both nucleotide ambiguity and gaps derived from sequence alignment.
RESULTS: Six different codification variants based on Euclidean space, just like SOM processing, have been tested using two SOM models: the classical Kohonen's SOM and growing cell structures. They have been applied to two different sets of sequences: 32 sequences of small sub-unit ribosomal RNA from organisms belonging to the three domains of life, and 44 sequences of the reverse transcriptase region of the pol gene of human immunodeficiency virus type 1 belonging to different groups and sub-types. Our results show that the most important factor affecting the accuracy of sequence clustering is the assignment of an extra weight to the presence of alignment-derived gaps. Although each of the codification variants shows a different level of taxonomic consistency, the results are in agreement with sequence-based phylogenetic reconstructions and anticipate a broad applicability of this codification method.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25344495     DOI: 10.1093/bioinformatics/btu708

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  4 in total

1.  Can artificial neural replicators be useful for studying RNA replicators?

Authors:  Alexandr A Ezhov
Journal:  Arch Virol       Date:  2020-08-19       Impact factor: 2.574

2.  SARS-CoV-2 Mutant Spectra at Different Depth Levels Reveal an Overwhelming Abundance of Low Frequency Mutations.

Authors:  Brenda Martínez-González; María Eugenia Soria; Lucía Vázquez-Sirvent; Cristina Ferrer-Orta; Rebeca Lobo-Vega; Pablo Mínguez; Lorena de la Fuente; Carlos Llorens; Beatriz Soriano; Ricardo Ramos-Ruíz; Marta Cortón; Rosario López-Rodríguez; Carlos García-Crespo; Pilar Somovilla; Antoni Durán-Pastor; Isabel Gallego; Ana Isabel de Ávila; Soledad Delgado; Federico Morán; Cecilio López-Galíndez; Jordi Gómez; Luis Enjuanes; Llanos Salar-Vidal; Mario Esteban-Muñoz; Jaime Esteban; Ricardo Fernández-Roblas; Ignacio Gadea; Carmen Ayuso; Javier Ruíz-Hornillos; Nuria Verdaguer; Esteban Domingo; Celia Perales
Journal:  Pathogens       Date:  2022-06-08

3.  A Two-Level, Intramutant Spectrum Haplotype Profile of Hepatitis C Virus Revealed by Self-Organized Maps.

Authors:  Soledad Delgado; Celia Perales; Carlos García-Crespo; María Eugenia Soria; Isabel Gallego; Ana Isabel de Ávila; Brenda Martínez-González; Lucía Vázquez-Sirvent; Cecilio López-Galíndez; Federico Morán; Esteban Domingo
Journal:  Microbiol Spectr       Date:  2021-11-10

4.  An incremental anomaly detection model for virtual machines.

Authors:  Hancui Zhang; Shuyu Chen; Jun Liu; Zhen Zhou; Tianshu Wu
Journal:  PLoS One       Date:  2017-11-08       Impact factor: 3.240

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.