Trevor G Bell1, Mukhlid Yousif1, Anna Kramvis1. 1. Hepatitis Virus Diversity Research Unit, Department of Internal Medicine, University of the Witwatersrand, 7 York Road, Parktown, South Africa.
Abstract
BACKGROUND: Hepatitis B virus (HBV) DNA sequence data from thousands of samples are present in the public sequence databases. No publicly available, up-to-date, multiple sequence alignments, containing full-length and subgenomic fragments per genotype, are available. Such alignments are useful in many analysis applications, including data-mining and phylogenetic analyses. RESULTS: By issuing a query, all HBV sequence data from the GenBank public database was downloaded (67,893 sequences). Full-length and subgenomic sequences, which were genotyped by the submitters (30,852 sequences), were placed into a multiple sequence alignment, for each genotype (genotype A: 5868 sequences, B: 4630, C: 7820, D: 8300, E: 2043, F: 985, G: 189, H: 108, I: 23), according to the results of offline BLAST searches against a custom reference library of full-length sequences. Further curation was performed to improve the alignment. CONCLUSIONS: The algorithm described in this paper generates, for each of the nine HBV genotypes, multiple sequence alignments, which contain full-length and subgenomic fragments. The alignments can be updated as new sequences become available in the online public sequence databases. The alignments are available at http://hvdr.bioinf.wits.ac.za/alignments.
BACKGROUND:Hepatitis B virus (HBV) DNA sequence data from thousands of samples are present in the public sequence databases. No publicly available, up-to-date, multiple sequence alignments, containing full-length and subgenomic fragments per genotype, are available. Such alignments are useful in many analysis applications, including data-mining and phylogenetic analyses. RESULTS: By issuing a query, all HBV sequence data from the GenBank public database was downloaded (67,893 sequences). Full-length and subgenomic sequences, which were genotyped by the submitters (30,852 sequences), were placed into a multiple sequence alignment, for each genotype (genotype A: 5868 sequences, B: 4630, C: 7820, D: 8300, E: 2043, F: 985, G: 189, H: 108, I: 23), according to the results of offline BLAST searches against a custom reference library of full-length sequences. Further curation was performed to improve the alignment. CONCLUSIONS: The algorithm described in this paper generates, for each of the nine HBV genotypes, multiple sequence alignments, which contain full-length and subgenomic fragments. The alignments can be updated as new sequences become available in the online public sequence databases. The alignments are available at http://hvdr.bioinf.wits.ac.za/alignments.
Entities:
Keywords:
Bioinformatics; Blast; Genbank; HBV; Hepatitis B virus; Sequence alignment
Authors: Tulio de Oliveira; Koen Deforche; Sharon Cassol; Mika Salminen; Dimitris Paraskevis; Chris Seebregts; Joe Snoeck; Estrelita Janse van Rensburg; Annemarie M J Wensing; David A van de Vijver; Charles A Boucher; Ricardo Camacho; Anne-Mieke Vandamme Journal: Bioinformatics Date: 2005-08-02 Impact factor: 6.937
Authors: Lilly K W Yuen; Anna Ayres; Margaret Littlejohn; Danielle Colledge; Andrew Edgely; William J Maskill; Stephen A Locarnini; Angeline Bartholomeusz Journal: Antiviral Res Date: 2006-12-22 Impact factor: 5.970
Authors: Helene Norder; Anne-Marie Couroucé; Pierre Coursaget; José M Echevarria; Shou-Dong Lee; Isa K Mushahwar; Betty H Robertson; Stephen Locarnini; Lars O Magnius Journal: Intervirology Date: 2004 Impact factor: 1.763
Authors: Dennis A Benson; Karen Clark; Ilene Karsch-Mizrachi; David J Lipman; James Ostell; Eric W Sayers Journal: Nucleic Acids Res Date: 2014-11-20 Impact factor: 19.160
Authors: Ceejay L Boyce; Stephaney Willis; Timothy N A Archampong; Margaret Lartey; Kwamena W Sagoe; Adjoa Obo-Akwa; Ernest Kenu; Awewura Kwara; Jason T Blackard Journal: Virus Genes Date: 2019-07-25 Impact factor: 2.332
Authors: Motswedi Anderson; Wonderful T Choga; Sikhulile Moyo; Trevor Graham Bell; Tshepiso Mbangiwa; Bonolo B Phinius; Lynette Bhebhe; Theresa K Sebunya; Joseph Makhema; Richard Marlink; Anna Kramvis; Max Essex; Rosemary M Musonda; Jason T Blackard; Simani Gaseitsiwe Journal: Genes (Basel) Date: 2018-08-21 Impact factor: 4.096
Authors: Anna L McNaughton; Peter A Revill; Margaret Littlejohn; Philippa C Matthews; M Azim Ansari Journal: J Gen Virol Date: 2020-03 Impact factor: 3.891
Authors: Mmatsatsi K Matlou; Lucinda R Gaelejwe; Andrew M Musyoki; J Nare Rakgole; Selokela G Selabe; Edina Amponsah-Dacosta Journal: Heliyon Date: 2019-04-06
Authors: Wonderful Tatenda Choga; Motswedi Anderson; Edward Zumbika; Bonolo B Phinius; Tshepiso Mbangiwa; Lynnette N Bhebhe; Kabo Baruti; Peter Opiyo Kimathi; Kaelo K Seatla; Rosemary M Musonda; Trevor Graham Bell; Sikhulile Moyo; Jason T Blackard; Simani Gaseitsiwe Journal: Viruses Date: 2020-07-06 Impact factor: 5.818
Authors: Vera Holzmayer; Robert Hance; Patricia Defechereux; Robert Grant; Mary C Kuhns; Gavin Cloherty; Mary A Rodgers Journal: J Med Virol Date: 2018-11-08 Impact factor: 2.327