| Literature DB >> 34372528 |
Laila Sara Arroyo Mühr1, Daniel Guerendiain2,3, Kate Cuschieri2, Karin Sundström4.
Abstract
Next-generation sequencing (NGS) yields powerful opportunities for studying human papillomavirus (HPV) genomics for applications in epidemiology, public health, and clinical diagnostics. HPV genotypes, variants, and point mutations can be investigated in clinical materials and described in previously unprecedented detail. However, both the NGS laboratory analysis and bioinformatical approach require numerous steps and checks to ensure robust interpretation of results. Here, we provide a step-by-step review of recommendations for validation and quality assurance procedures of each step in the typical NGS workflow, with a focus on whole-genome sequencing approaches. The use of directed pilots and protocols to ensure optimization of sequencing data yield, followed by curated bioinformatical procedures, is particularly emphasized. Finally, the storage and sharing of data sets are discussed. The development of international standards for quality assurance should be a goal for the HPV NGS community, similar to what has been developed for other areas of sequencing efforts including microbiology and molecular pathology. We thus propose that it is time for NGS to be included in the global efforts on quality assurance and improvement of HPV-based testing and diagnostics.Entities:
Keywords: HPV; NGS; WGS; deep sequencing; human papillomavirus; next-generation sequencing; whole-genome sequencing
Mesh:
Year: 2021 PMID: 34372528 PMCID: PMC8310033 DOI: 10.3390/v13071323
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1Next-generation sequencing (NGS) for human papillomavirus (HPV) detection and characterization process steps, from sample preparation to data analysis, with focus on whole-genome sequencing (WGS).
Steps, potential quality issues, and proposed mitigations for next-generation sequencing (NGS) analytical workflow, with a focus on whole-genome sequencing purposes.
| Human Papillomavirus Detection by Next-Generation Sequencing | ||
|---|---|---|
| NGS Step | Possible Difficulties | Mitigations |
| Sample preparation | Nucleic acid quality and/or quantity outside library prep kit requirements | Selection of appropriate nucleic acid extraction methods. |
| Incorrect fragment size (too short or too long) | ||
| Library preparation and sequencing | Incorrect fragment size (too short or too long) | Correct selection of library kit and fragment length. |
| Data analysis | Low sequencing depth | Library preparation and sequencing piloting, and re-analysis |
| Incorrect alignment | ||
| Mix/chimeras of microbial organisms | Filter reads—use of updated databases and careful settings of parameters. De novo assembly evaluation (HPV Chimera scripts) | |
| Validation of pipeline | Digital IQC, EQA, external assessment. | |
| Storage | Large amount of data | Cloud services, compression of files, and storage of only raw input and final output. |
| Security | Restricted super-user access, individually curated data access | |
| Length of data storage | Organization policy/data archiving laws and regulations | |