| Literature DB >> 35348640 |
Zhenyu Yang1, Wenjing Yi1, Jin Tao1, Xiaoqing Liu2, Michael Q Zhang3,4, Guiqian Chen1, Qi Dai1,3.
Abstract
Human papillomavirus (HPV) can cause condyloma acuminatum and cervical cancer. Some mutations of these viruses are closely related to the persistent infection of cervical cancer and are ideal cancer vaccine targets. Several databases have been developed to collect HPV sequences, but no HPV mutation database has been published. This paper reports a Chinese HPV mutation database (HPVMD-C), which contains 149 HPV genotypes, 468 HPV mutations, 3409 protein sequences, 4727 domains and 236 epitopes. We analyzed the mutation distribution among HPV genotypes, domains and epitopes. We designed a visualization tool to display these mutations, domains and epitopes and provided more detailed information about the disease, region and related literature. We also proposed an HPV genotype prediction tool, which can predict HPV carcinogenic or non-carcinogenic risk genotypes. We expect that HPVMD-C will complement the existing database and provide valuable resources for HPV vaccine research and cervical cancer treatment. HPVMD-C is freely available at Database URL: http://bioinfo.zstu.edu.cn/hpv.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35348640 PMCID: PMC9216535 DOI: 10.1093/database/baac018
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 4.462
Figure 1.An overview of HPVMD-C 1.0: (i) Distribution of mutation content among the protein sequences, the domains and the epitopes; (ii) Distribution of the mutations among the domains; (iii) Distribution of the mutations among the epitopes; (iv) Mutation detection using BLAST; (v) Risk type prediction using the characteristics of amino acids and the SVM.
Figure 2.Distribution of mutation content among the whole sequences, the domains and the epitopes; the parameters on the left can adjust the display of the pie chart.
Figure 3.An example of HPVMD-C 1.0 usage: (A) Prediction page—Select optimal parameters: ‘E6’ data set, beta propensity characteristic index ‘BURA740102’, reduction number 8 and ‘RCTD’ feature; submit and get the result (B); (C) Blast search page and the result page—Use arrow to mark points that are different from the query sequence and highlight the mutation about this sequence in red; using mouse to slide over a mutation will pop up a detail box; references result page (D) will be displayed by clicking the references link.
Figure 4.(A) Mutation section in ‘Detail’ page: Use visualization techniques to show the position and type of mutations; related diseases, races, regional information and literature links are also listed; (B) Domain section in detail page: In the graphical view, bar represents the entire sequence, dark part represents the domain or motif and dots represent the mutation; (C) Epitope section in detail page: The second line shows the entire sequence, light part highlights the epitope and dark part highlights the mutation; inside two boxes are the epitopes containing N29S; (D) Epitope conservation result page: The query epitope TC000124 are highlighted; the conservation rate is 48.65%.