Yuhan Hao1,2, Liying Yang1,3, Antonio Galvao Neto1, Milan R Amin4, Dervla Kelly1, Stuart M Brown2,5, Ryan C Branski4, Zhiheng Pei1,3,6. 1. Department of Pathology. 2. Applied Bioinformatics Laboratories. 3. Department of Medicine. 4. Department of Otolaryngology-Head and Neck Surgery. 5. Department of Cell Biology, New York University School of Medicine, New York, NY, USA. 6. Department of Veterans Affairs New York Harbor Healthcare System, New York, NY, USA.
Abstract
Motivation: Shotgun DNA sequencing provides sensitive detection of all 182 HPV types in tissue and body fluid. However, existing computational methods either produce false positives misidentifying HPV types due to shared sequences among HPV, human and prokaryotes, or produce false negative since they identify HPV by assembled contigs requiring large abundant of HPV reads. Results: We designed HPViewer with two custom HPV reference databases masking simple repeats and homology sequences respectively and one homology distance matrix to hybridize these two databases. It directly identified HPV from short DNA reads rather than assembled contigs. Using 100 100 simulated samples, we revealed that HPViewer was robust for samples containing either high or low number of HPV reads. Using 12 shotgun sequencing samples from respiratory papillomatosis, HPViewer was equal to VirusTAP, and Vipie and better than HPVDetector with the respect to specificity and was the most sensitive method in the detection of HPV types 6 and 11. We demonstrated that contigs-based approaches had disadvantages of detection of HPV. In 1573 sets of metagenomic data from 18 human body sites, HPViewer identified 104 types of HPV in a body-site associated pattern and 89 types of HPV co-occurring in one sample with other types of HPV. We demonstrated HPViewer was sensitive and specific for HPV detection in metagenomic data. Availability and implementation: HPViewer can be accessed at https://github.com/yuhanH/HPViewer/. Supplementary information: Supplementary data are available at Bioinformatics online.
Motivation: Shotgun DNA sequencing provides sensitive detection of all 182 HPV types in tissue and body fluid. However, existing computational methods either produce false positives misidentifying HPV types due to shared sequences among HPV, human and prokaryotes, or produce false negative since they identify HPV by assembled contigs requiring large abundant of HPV reads. Results: We designed HPViewer with two custom HPV reference databases masking simple repeats and homology sequences respectively and one homology distance matrix to hybridize these two databases. It directly identified HPV from short DNA reads rather than assembled contigs. Using 100 100 simulated samples, we revealed that HPViewer was robust for samples containing either high or low number of HPV reads. Using 12 shotgun sequencing samples from respiratory papillomatosis, HPViewer was equal to VirusTAP, and Vipie and better than HPVDetector with the respect to specificity and was the most sensitive method in the detection of HPV types 6 and 11. We demonstrated that contigs-based approaches had disadvantages of detection of HPV. In 1573 sets of metagenomic data from 18 human body sites, HPViewer identified 104 types of HPV in a body-site associated pattern and 89 types of HPV co-occurring in one sample with other types of HPV. We demonstrated HPViewer was sensitive and specific for HPV detection in metagenomic data. Availability and implementation: HPViewer can be accessed at https://github.com/yuhanH/HPViewer/. Supplementary information: Supplementary data are available at Bioinformatics online.
Authors: J M Walboomers; M V Jacobs; M M Manos; F X Bosch; J A Kummer; K V Shah; P J Snijders; J Peto; C J Meijer; N Muñoz Journal: J Pathol Date: 1999-09 Impact factor: 7.996
Authors: M L Gillison; W M Koch; R B Capone; M Spafford; W H Westra; L Wu; M L Zahurak; R W Daniel; M Viglione; D E Symer; K V Shah; D Sidransky Journal: J Natl Cancer Inst Date: 2000-05-03 Impact factor: 13.506
Authors: H Kawaguchi; S Ohno; K Araki; M Miyazaki; H Saeki; M Watanabe; S Tanaka; K Sugimachi Journal: Cancer Res Date: 2000-06-01 Impact factor: 12.701
Authors: Nubia Muñoz; F Xavier Bosch; Silvia de Sanjosé; Rolando Herrero; Xavier Castellsagué; Keerti V Shah; Peter J F Snijders; Chris J L M Meijer Journal: N Engl J Med Date: 2003-02-06 Impact factor: 91.245
Authors: Shahnaz Begum; Maura L Gillison; M Ali Ansari-Lari; Keerti Shah; William H Westra Journal: Clin Cancer Res Date: 2003-12-15 Impact factor: 12.531
Authors: Yuhan Hao; Ryan Ruiz; Liying Yang; Antonio Galvao Neto; Milan R Amin; Dervla Kelly; Stratos Achlatis; Scott Roof; Renjie Bing; Kasthuri Kannan; Stuart M Brown; Zhiheng Pei; Ryan C Branski Journal: Sci Rep Date: 2019-11-12 Impact factor: 4.379
Authors: Matthew J Inkman; Kay Jayachandran; Thomas M Ellis; Fiona Ruiz; Michael D McLellan; Christopher A Miller; Yufeng Wu; Akinyemi I Ojesina; Julie K Schwarz; Jin Zhang Journal: Sci Rep Date: 2020-08-31 Impact factor: 4.379