Literature DB >> 20122198

Using genomic signatures for HIV-1 sub-typing.

Aridaman Pandit1, Somdatta Sinha.   

Abstract

BACKGROUND: Human Immunodeficiency Virus type 1 (HIV-1), the causative agent of Acquired Immune Deficiency Syndrome (AIDS), exhibits very high genetic diversity with different variants or subtypes prevalent in different parts of the world. Proper classification of the HIV-1 subtypes, displaying differential infectivity, plays a major role in monitoring the epidemic and is also a critical component for effective treatment strategy. The existing methods to classify HIV-1 sequence subtypes, based on phylogenetic analysis focusing only on specific genes/regions, have shown inconsistencies as they lack the capability to analyse whole genome variations. Several isolates are left unclassified due to unresolved sub-typing. It is apparent that classification of subtypes based on complete genome sequences, rather than sub-genomic regions, is a more robust and comprehensive approach to address genome-wide heterogeneity. However, no simple methodology exists that directly computes HIV-1 subtype from the complete genome sequence.
RESULTS: We use Chaos Game Representation (CGR) as an approach to identify the distinctive genomic signature associated with the DNA sequence organisation in different HIV-1 subtypes. We first analysed the effect of nucleotide word lengths (k = 2 to 8) on whole genomes of the HIV-1 M group sequences, and found the optimum word length of k = 6, that could classify HIV-1 subtypes based on a Test sequence set. Using the optimised word length, we then showed accurate classification of the HIV-1 subtypes from both the Reference Set sequences and from all available sequences in the database. Finally, we applied the approach to cluster the five unclassified HIV-1 sequences from Africa and Europe, and predict their possible subtypes.
CONCLUSION: We propose a genomic signature-based approach, using CGR with suitable word length optimisation, which can be applied to classify intra-species variations, and apply it to the complex problem of HIV-1 subtype classification. We demonstrate that CGR is a simple and computationally less intensive method that not only accurately segregates the HIV-1 subtype and sub-subtypes, but also aid in the classification of the unclassified sequences. We hope that it will be useful in subtype annotation of the newly sequenced HIV-1 genomes.

Entities:  

Mesh:

Year:  2010        PMID: 20122198      PMCID: PMC3009497          DOI: 10.1186/1471-2105-11-S1-S26

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  22 in total

1.  Near-full-length genome sequencing of divergent African HIV type 1 subtype F viruses leads to the identification of a new HIV type 1 subtype designated K.

Authors:  K Triques; A Bourgeois; N Vidal; E Mpoudi-Ngole; C Mulanga-Kabeya; N Nzilambi; N Torimiro; E Saman; E Delaporte; M Peeters
Journal:  AIDS Res Hum Retroviruses       Date:  2000-01-20       Impact factor: 2.205

Review 2.  Global molecular epidemiology of HIV: understanding the genesis of AIDS pandemic.

Authors:  Yutaka Takebe; Rie Uenishi; Xiaojie Li
Journal:  Adv Pharmacol       Date:  2008

Review 3.  The emergence of simian/human immunodeficiency viruses.

Authors:  G Myers; K MacInnes; B Korber
Journal:  AIDS Res Hum Retroviruses       Date:  1992-03       Impact factor: 2.205

4.  Chaos game representation of gene structure.

Authors:  H J Jeffrey
Journal:  Nucleic Acids Res       Date:  1990-04-25       Impact factor: 16.971

5.  Fidelity of HIV-1 reverse transcriptase.

Authors:  B D Preston; B J Poiesz; L A Loeb
Journal:  Science       Date:  1988-11-25       Impact factor: 47.728

6.  Nucleotide, dinucleotide and trinucleotide frequencies explain patterns observed in chaos game representations of DNA sequences.

Authors:  N Goldman
Journal:  Nucleic Acids Res       Date:  1993-05-25       Impact factor: 16.971

7.  Phylogenetic analysis of gag genes from 70 international HIV-1 isolates provides evidence for multiple genotypes.

Authors:  J Louwagie; F E McCutchan; M Peeters; T P Brennan; E Sanders-Buell; G A Eddy; G van der Groen; K Fransen; G M Gershy-Damet; R Deleys
Journal:  AIDS       Date:  1993-06       Impact factor: 4.177

8.  Genetic and phylogenetic analysis of env subtypes G and H in central Africa.

Authors:  W Janssens; L Heyndrickx; K Fransen; J Motte; M Peeters; J N Nkengasong; P M Ndumbe; E Delaporte; J L Perret; C Atende
Journal:  AIDS Res Hum Retroviruses       Date:  1994-07       Impact factor: 2.205

9.  Importance of purine and pyrimidine content of local nucleotide sequences (six bases long) for evolution of the human immunodeficiency virus type 1.

Authors:  H Doi
Journal:  Proc Natl Acad Sci U S A       Date:  1991-10-15       Impact factor: 11.205

10.  A web-based genotyping resource for viral sequences.

Authors:  Mikhail Rozanov; Uwe Plikat; Colombe Chappey; Andrey Kochergin; Tatiana Tatusova
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

View more
  12 in total

1.  Analysis of dinucleotide signatures in HIV-1 subtype B genomes.

Authors:  Aridaman Pandit; Jyothirmayi Vadlamudi; Somdatta Sinha
Journal:  J Genet       Date:  2013-12       Impact factor: 1.166

2.  Pattern matching through Chaos Game Representation: bridging numerical and discrete data structures for biological sequence analysis.

Authors:  Susana Vinga; Alexandra M Carvalho; Alexandre P Francisco; Luís Ms Russo; Jonas S Almeida
Journal:  Algorithms Mol Biol       Date:  2012-05-02       Impact factor: 1.405

3.  The evolutionary significance of certain amino acid substitutions and their consequences for HIV-1 immunogenicity toward HLA's A*0201 and B*27.

Authors:  Luke Hecht; Anton Dormer
Journal:  Bioinformation       Date:  2013-03-19

4.  Detection of viral sequence fragments of HIV-1 subfamilies yet unknown.

Authors:  Thomas Unterthiner; Anne-Kathrin Schultz; Jan Bulla; Burkhard Morgenstern; Mario Stanke; Ingo Bulla
Journal:  BMC Bioinformatics       Date:  2011-04-11       Impact factor: 3.169

5.  N-gram analysis of 970 microbial organisms reveals presence of biological language models.

Authors:  Hatice Ulku Osmanbeyoglu; Madhavi K Ganapathiraju
Journal:  BMC Bioinformatics       Date:  2011-01-10       Impact factor: 3.169

6.  Classification of HIV-1 sequences using profile Hidden Markov Models.

Authors:  Sanjiv K Dwivedi; Supratim Sengupta
Journal:  PLoS One       Date:  2012-05-18       Impact factor: 3.240

7.  An investigation into inter- and intragenomic variations of graphic genomic signatures.

Authors:  Rallis Karamichalis; Lila Kari; Stavros Konstantinidis; Steffen Kopecki
Journal:  BMC Bioinformatics       Date:  2015-08-07       Impact factor: 3.169

8.  Mapping the space of genomic signatures.

Authors:  Lila Kari; Kathleen A Hill; Abu S Sayem; Rallis Karamichalis; Nathaniel Bryans; Katelyn Davis; Nikesh S Dattani
Journal:  PLoS One       Date:  2015-05-22       Impact factor: 3.240

9.  A phylogenetic analysis of the brassicales clade based on an alignment-free sequence comparison method.

Authors:  Klas Hatje; Martin Kollmar
Journal:  Front Plant Sci       Date:  2012-08-29       Impact factor: 5.753

10.  LABEL: fast and accurate lineage assignment with assessment of H5N1 and H9N2 influenza A hemagglutinins.

Authors:  Samuel S Shepard; C Todd Davis; Justin Bahl; Pierre Rivailler; Ian A York; Ruben O Donis
Journal:  PLoS One       Date:  2014-01-23       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.