| Literature DB >> 31752912 |
Gianpiero Zamperin1, Pierrick Lucas2,3, Irene Cano4, David Ryder4, Miriam Abbadi1, David Stone4, Argelia Cuenca5, Estelle Vigouroux3,6, Yannick Blanchard7,8, Valentina Panzarin9.
Abstract
BACKGROUND: Next generation sequencing (NGS) is becoming widely used among diagnostics and research laboratories, and nowadays it is applied to a variety of disciplines, including veterinary virology. The NGS workflow comprises several steps, namely sample processing, library preparation, sequencing and primary/secondary/tertiary bioinformatics (BI) analyses. The latter is constituted by a complex process extremely difficult to standardize, due to the variety of tools and metrics available. Thus, it is of the utmost importance to assess the comparability of results obtained through different methods and in different laboratories. To achieve this goal, we have organized a proficiency test focused on the bioinformatics components for the generation of complete genome sequences of salmonid rhabdoviruses.Entities:
Keywords: Bioinformatics; NGS; Proficiency testing; Virology
Mesh:
Year: 2019 PMID: 31752912 PMCID: PMC6868765 DOI: 10.1186/s12985-019-1223-8
Source DB: PubMed Journal: Virol J ISSN: 1743-422X Impact factor: 4.099
Fig. 1Samples description. Outer and inner rings show samples distribution with respect to viral strain (VHSV, IHNV, calibrator) and to sequencing technology (Illumina MiSeq, Ion Proton™), respectively
Fig. 2Discrepancies distribution. a: Histograms show the total number of discrepancies counted in respect to type (SNP, indel) and genome localization (CDS, non-coding intergenic regions, genome termini); histograms are further divided to display total distribution, and distribution for each type of sequencing technology (Illumina MiSeq, Ion Proton™). b: Histograms show discrepancies from the point of view of every participant lab; inconsistent sites are counted in respect to representing the majority or being the odd one out
Fig. 3Explanation of inconsistencies. a: The histogram shows the total number of justifications for nucleotide assignment given by participants for the subset of 50 discrepancies randomly chosen. Justifications are represented by the BI step to which an explanation is ascribed. b: A pie chart showing the source distribution of the discrepancy subset; “manual curation” is further divided into three subgroups, depending from the reason behind the discrepancy itself: “A” arisen when at least one participant assigned a nucleotide during manual curation; “B” arisen after manual curation as well as after de novo assembly during step iv); “C” arisen after de novo assembly of step iv)