| Literature DB >> 35778290 |
Katherine A Lau1, Kristy Horan2, Anders Gonçalves da Silva2, Alexa Kaufer3, Torsten Theis3, Susan A Ballard2, William D Rawlinson4.
Abstract
Extensive studies and analyses into the molecular features of severe acute respiratory syndrome related coronavirus 2 (SARS-CoV-2) have enhanced the surveillance and investigation of its clusters and transmission worldwide. The whole genome sequencing (WGS) approach is crucial in identifying the source of infection and transmission routes by monitoring the emergence of variants over time and through communities. Varying SARS-CoV-2 genomics capacity and capability levels have been established in public health laboratories across different Australian states and territories. Therefore, laboratories performing SARS-CoV-2 WGS for public health purposes are recommended to participate in an external proficiency testing program (PTP). This study describes the development of a SARS-CoV-2 WGS PTP. The PTP assessed the performance of laboratories while providing valuable insight into the current state of SARS-CoV-2 genomics in public health across Australia. Part 1 of the PTP contained eight simulated SARS-CoV-2 positive and negative specimens to assess laboratories' wet and dry laboratory capacity. Part 2 involved the analysis of a genomic dataset that consisted of a multi-FASTA file of 70 consensus genomes of SARS-CoV-2. Participating laboratories were required to (1) submit raw data for independent bioinformatics analysis, (2) analyse the data with their processes, and (3) answer relevant questions about the data. The performance of the laboratories was commendable, despite some variation in the reported results due to the different sequencing and bioinformatics approaches used by laboratories. The overall outcome is positive and demonstrates the critical role of the PTP in supporting the implementation and validation of SARS-CoV-2 WGS processes. The data derived from this PTP will contribute to the development of SARS-CoV-2 bioinformatic quality control (QC) and performance benchmarking for accreditation.Entities:
Keywords: SARS-CoV-2; bioinformatics analysis; proficiency testing; whole genome sequencing
Mesh:
Year: 2022 PMID: 35778290 PMCID: PMC9239710 DOI: 10.1016/j.pathol.2022.04.002
Source DB: PubMed Journal: Pathology ISSN: 0031-3025 Impact factor: 5.335
Content of each specimen of Part 1 of the SARS-CoV-2 whole genome sequencing proficiency testing program, with its associated GISAID ID and the average CT
| RCPAQAP | Content | In-house NAT (E-gene) | External NAT (E-gene) |
|---|---|---|---|
| BS01 | SARS-CoV-2 RNA (EPI_ISL_406844) | 16.8 | 16.2 |
| BS02 | SARS-CoV-2 RNA (EPI_ISL_419750) | 17.0 | 16.5 |
| BS03 | SARS-CoV-2 RNA (EPI_ISL_480701) | 17.3 | 16.7 |
| BS04 | SARS-CoV-2 RNA (EPI_ISL_519314) | 17.3 | 16.9 |
| BS05 | SARS-CoV-2 RNA (EPI_ISL_563416) | 17.3 | 17.6 |
| BS06 | Negative specimen | Not detected | Not detected |
| BS07 | SARS-CoV-2 RNA 1:102 dilution (EPI_ISL_419750) | 24.1 | 22.4 |
| BS08 | SARS-CoV-2 RNA 1:103 dilution (EPI_ISL_419750) | 27.5 | 26.3 |
CT, cycle threshold; GISAID, Global Initiative on Sharing All Influenza Data; NAT, nucleic acid testing.
BS02, BS07 and BS08 were prepared from the same RNA preparation, with BS07 and BS08 serially diluted from BS02 at 1:102 and 1:103, respectively.
In-house sequence analysis for each specimen of Part 1 of the SARS-CoV-2 whole genome sequencing proficiency testing program
| RCPAQAP | Genome recovered (%) | Pango lineage | Single-nucleotide polymorphism (SNP) | Amino acid replacement |
|---|---|---|---|---|
| BS01 | 99.6 | B | T19065C, T22303G, G26144T | S:S247R; ORF3a:G251V |
| BS02 | 99.6 | B.1 | C241T, C1059T, C3037T, C14408T, A23403G, G25563T | nsp2:T85I; nsp12:P323L; S:D614G; ORF3a:Q57H |
| BS03 | 98.9 | B.1.1.136 | C241T, C3037T, C14408T, A23063T, A23403G, C26984T, T28196C, G28881A, G28882A, G28883C | nsp12:P323L; S:N501Y; S:D614G; N:R203K; N:G204R |
| BS04 | 99.6 | D.2 | C241T, A1163T, C3037T, T7540C, C14408T, G16647T, C18555T, G22992A, G23401A, A23403G, C28647T, G28881A, G28882A, G28883C | nsp2:I120F; nsp12:P323L; S:S477N; S:D614G; N:A125V; N:R203K; N:G204R |
| BS05 | 99.2 | D.2 | C241T, A1163T, C3037T, T7540C, C9996T, C10279T, C14408T, C14599T, G16647T, C18555T, G22205C, G22927T, G22992A, G23401A, A23403G, A28416T, G28881A, G28882A, G28883C, C29585T | nsp2:I120F; nsp4:S481L; nsp12:P323L; S:D215H; S:L455F; S:S477N; S:D614G; N:N48I; N:R203K; N:G204R; ORF10:P10S |
| BS06 | 1.00 | N/A | N/A | N/A |
| BS07 | 98.8 | B.1 | C241T, C1059T, C3037T, C14408T, A23403G, G25563T | nsp2:T85I; nsp12:P323L; S:D614G; ORF3a:Q57H |
| BS08 | 98.3 | B.1 | C241T, C1059T, C3037T, C14408T, A23403G, G25563T | nsp2:T85I; nsp12:P323L; S:D614G; ORF3a:Q57H |
Pangolin v2.1.7 (https://github.com/cov-lineages/pangolin) with pangoLEARN version 2021-01-11 (https://github.com/cov-lineages/pangoLEARN) were used in the analysis.
Sequence base pairs deletion and N(s) are excluded. C241T is a common SNP introduced to the sequence likely due to degradation of material and therefore absence of C241T, as reported by participants is considered concordant.
nsp2:T85I is equivalent to ORF1a:T265I, ORF1ab:T265I and ORF1ab:T85I; nsp12:P323L is equivalent to nsp12b:P314L, ORF1b:P314L, ORF1ab:P4715L, and ORF1ab:P323L; nsp2:I120F is equivalent to ORF1a:I300F, ORF1ab:I300F and ORF1ab:I120F; N:R203K and N:G204R are equivalent to N:RG608-609KR; N:R203K is equivalent to N:R50K; N:G204R is equivalent to N:G50R and ORF14:G50R; nsp4:S481L is equivalent to ORF1a:S3244L, ORF1ab:S3244L and ORF1ab:S481L.
Cluster ID of the 70 SARS-CoV-2 consensus genomes included in Part 2 of the SARS-CoV-2 whole genome sequencing proficiency testing program
| Accession | RCPAQAP | Cluster ID | Accession | RCPAQAP | Cluster ID | Accession | RCPAQAP | Cluster ID |
|---|---|---|---|---|---|---|---|---|
| MT520173.1 | RCPA-PTP-2021-S001 | A | MT520273.1 | RCPA-PTP-2021-S025 | A | MT520445.1 | RCPA-PTP-2021-S049 | A |
| MT520175.1 | RCPA-PTP-2021-S002 | A | MT520285.1 | RCPA-PTP-2021-S026 | A | MT520448.1 | RCPA-PTP-2021-S050 | A |
| MT520196.1 | RCPA-PTP-2021-S003 | A | MT520288.1 | RCPA-PTP-2021-S027 | A | MT520453.1 | RCPA-PTP-2021-S051 | A |
| MT520202.1 | RCPA-PTP-2021-S004 | B | MT520295.1 | RCPA-PTP-2021-S028 | A | MT520454.1 | RCPA-PTP-2021-S052 | A |
| MT520206.1 | RCPA-PTP-2021-S005 | A | MT520300.1 | RCPA-PTP-2021-S029 | A | MT520460.1 | RCPA-PTP-2021-S053 | A |
| MT520208.1 | RCPA-PTP-2021-S006 | A | MT520309.1 | RCPA-PTP-2021-S030 | A | MT520464.1 | RCPA-PTP-2021-S054 | A |
| MT520216.1 | RCPA-PTP-2021-S007 | A | MT520312.1 | RCPA-PTP-2021-S031 | A | MT520479.1 | RCPA-PTP-2021-S055 | C |
| MT520219.1 | RCPA-PTP-2021-S008 | A | MT520318.1 | RCPA-PTP-2021-S032 | A | MT520480.1 | RCPA-PTP-2021-S056 | A |
| MT520222.1 | RCPA-PTP-2021-S009 | A | MT520324.1 | RCPA-PTP-2021-S033 | A | MT520482.1 | RCPA-PTP-2021-S057 | A |
| MT520226.1 | RCPA-PTP-2021-S010 | A | MT520330.1 | RCPA-PTP-2021-S034 | A | MT520483.1 | RCPA-PTP-2021-S058 | A |
| MT520230.1 | RCPA-PTP-2021-S011 | A | MT520334.1 | RCPA-PTP-2021-S035 | A | MT520495.1 | RCPA-PTP-2021-S059 | A |
| MT520232.1 | RCPA-PTP-2021-S012 | A | MT520340.1 | RCPA-PTP-2021-S036 | C | MT520504.1 | RCPA-PTP-2021-S060 | A |
| MT520234.1 | RCPA-PTP-2021-S013 | A | MT520349.1 | RCPA-PTP-2021-S037 | A | MT520505.1 | RCPA-PTP-2021-S061 | A |
| MT520235.1 | RCPA-PTP-2021-S014 | A | MT520369.1 | RCPA-PTP-2021-S038 | A | MT520507.1 | RCPA-PTP-2021-S062 | A |
| MT520239.1 | RCPA-PTP-2021-S015 | A | MT520374.1 | RCPA-PTP-2021-S039 | A | MT520510.1 | RCPA-PTP-2021-S063 | A |
| MT520244.1 | RCPA-PTP-2021-S016 | A | MT520380.1 | RCPA-PTP-2021-S040 | A | MT520514.1 | RCPA-PTP-2021-S064 | A |
| MT520246.1 | RCPA-PTP-2021-S017 | A | MT520400.1 | RCPA-PTP-2021-S041 | C | MT520525.1 | RCPA-PTP-2021-S065 | A |
| MT520247.1 | RCPA-PTP-2021-S018 | A | MT520407.1 | RCPA-PTP-2021-S042 | A | MT520529.1 | RCPA-PTP-2021-S066 | A |
| MT520248.1 | RCPA-PTP-2021-S019 | A | MT520417.1 | RCPA-PTP-2021-S043 | A | MT520530.1 | RCPA-PTP-2021-S067 | B |
| MT520256.1 | RCPA-PTP-2021-S020 | A | MT520420.1 | RCPA-PTP-2021-S044 | A | MT520536.1 | RCPA-PTP-2021-S068 | A |
| MT520263.1 | RCPA-PTP-2021-S021 | B | MT520421.1 | RCPA-PTP-2021-S045 | A | MT520538.1 | RCPA-PTP-2021-S069 | A |
| MT520268.1 | RCPA-PTP-2021-S022 | A | MT520426.1 | RCPA-PTP-2021-S046 | A | MT520539.1 | RCPA-PTP-2021-S070 | A |
| MT520270.1 | RCPA-PTP-2021-S023 | A | MT520435.1 | RCPA-PTP-2021-S047 | A | |||
| MT520271.1 | RCPA-PTP-2021-S024 | A | MT520440.1 | RCPA-PTP-2021-S048 | B |
Performance assessment of participants based on the consensus sequence submitted for Part 1 of the SARS-CoV-2 whole genome sequencing proficiency testing program
| Sample ID | Performance assessment based on consensus sequence | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | |
| BS01 | Pass | Pass | NA | Pass | Pass | Pass | Pass | Pass | Pass | Pass | Pass |
| BS02 | Pass | Pass | NA | Pass | Pass | Pass | Pass | Pass | Pass | Pass | Pass |
| BS03 | Pass | Pass | NA | Pass | Pass | Pass | Pass | Pass | Pass | Pass | Pass |
| BS04 | Pass | Pass | NA | Pass | Pass | Pass | Pass | Pass | Pass | Pass | Pass |
| BS05 | Pass | Pass | NA | Pass | Pass | Pass | Pass | Pass | Pass | Pass | Pass |
| BS06 | Pass | Pass | NA | Pass | Pass | Pass | Pass | Pass | Pass | Fail | Pass |
| BS07 | Pass | Pass | NA | Pass | Pass | Pass | Pass | Pass | Pass | Pass | Pass |
| BS08 | Pass | Pass | NA | Pass | Pass | Pass | Pass | Pass | Pass | Pass | Pass |
NA, not assessed.
Fig. 1Comparisons of results on the (A) pango lineage and the (B) genome recovered for samples BS01–BS08, based on the analysis of the consensus sequences submitted by participants and their questionnaire responses in Part 1 of the SARS-CoV-2 whole genome sequencing proficiency testing program.
Fig. 2Overview of the average read depth for samples BS01–BS08, as reported by participants in Part 1 questionnaire responses.
Fig. 3Discordant results for the single nucleotide polymorphism (SNP) and amino acid replacement for samples BS01–BS08, as reported by participants in Part 1 questionnaire response.
Fig. 4Assessment of result concordance with benchmarked cluster A, as reported by participants in the phylogenetic analysis of 70 SARS-CoV-2 consensus genomes in Part 2 of the SARS-CoV-2 whole genome sequencing proficiency testing program.