| Literature DB >> 18025038 |
Carla Kuiken1, Peter Hraber, James Thurmond, Karina Yusim.
Abstract
The hepatitis C virus (HCV) is a significant public health threat worldwide. The virus is highly variable and evolves rapidly, making it an elusive target for the immune system and for vaccine and drug design. Presently, approximately 50 000 HCV sequences have been published. A central website that provides annotated sequences and analysis tools will be helpful to HCV scientists worldwide. The HCV sequence database collects and annotates sequence data, and provides them to the public via a website that contains a user-friendly search interface and a large number of sequence analysis tools, following the model of the highly regarded and widely used Los Alamos HIV database. The HCV website can be accessed via http://hcv.lanl.gov and http://hcv-db.org.Entities:
Mesh:
Substances:
Year: 2007 PMID: 18025038 PMCID: PMC2238885 DOI: 10.1093/nar/gkm962
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Histograms showing the number, genotype and genomic region for two sets of sequences: all those sampled in Africa, and all those sampled in the United States of America. Please note that the scales of the vertical axes are different. It can immediately be seen that there are many more sequences from the USA; that they are mostly of genotype 1 whereas African sequences are of all genotypes; and that the best region for comparative analysis right now (because most background sequences would be available) in Africa would be NS5B or Core/E1, while for the US it would be E1/E2.
Figure 2.World map showing the geographical distribution of sequences with different genotypes in the HCV database. The map was generated using the geography tool, available on the HCV website.
Figure 3.Output of the Sequence Locator tool, which finds the location of a sequence fragment of any length. The graphic shows where the user's input sequence is located; the table beneath it shows the amino acid translation and coordinates of all of the genes included in the fragment.