| Literature DB >> 33247099 |
Soren Alexandersen1,2,3, Anthony Chamings4,5, Tarka Raj Bhatta4,5.
Abstract
Severe acute respiratory syndrome coronavirus-2 (Entities:
Mesh:
Substances:
Year: 2020 PMID: 33247099 PMCID: PMC7695715 DOI: 10.1038/s41467-020-19883-7
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Table showing summary information about the individuals and samples included in this study.
| Individual with gender and age group | Sample ID | NGS number | Clinical symptoms | Sample collection date | Diagnostic SARS-CoV-2 RT-PCR test (Ct Value) |
|---|---|---|---|---|---|
| 1 Female, 20-40 years | GC-28 | GC-28/67 | Fever, cough, sore throat, body pains, chest pain, non- productive cough | 28/1/20 | Not Detected |
| 2 Female, 20-40 years | GC-26 | GC-26/66 | Sore throat, dry cough | 7/3/20 | Detected (21) |
| 3 Female, 20-40 years | GC-13 | GC-13/35 | Body aches, headaches, dry cough, shortness of breath | 23/3/20 | Detected (29) |
| 4 Female, 40-60 years | GC-11 | GC-11/34 & GC-11/38 | Cold, sinusitis | 24/3/20 | Detected (19) |
| GC-20 | GC-20/63 | Cold, sinusitis | 2/4/20 (9) | Detected (31) | |
| GC-24 | GC-24/61 | Cold, sinusitis | 7/4/20 (14) | Detected (31) | |
| 5 Female, 20-40 years | GC-12 | GC-12/36 | Sore throat, rigor, fever | 24/3/20 | Detected (31) |
| 6 Female, 40-60 years | GC-14 | GC-14/33 & GC-14/37 | Unspecified | 28/3/20 | Detected (18) |
| GC-23 | GC-23/60 | Asymptomatic | 8/4/20 (11) | Detected (31) | |
| GC-51 | GC-51/62 | Asymptomatic | 14/4/20 (17) | Detected (31) | |
| 7 Female, 40-60 years | GC-21 | GC-21/64 | Shortness of breath, cough, rhinorrhoea and sore throat | 3/4/20 | Detected (31) |
| 8 Male, 40-60 years | GC-25 | GC-25/65 | Sore throat, hoarse voice | 10/4/20 | Detected (19) |
| 9 Female, 40-60 years | GC-55 | GC-55/68 | One day history of cough, no sore throat, no runny nose, no fever | 24/4/20 | Detected (16) |
Samples from a total of 9 individuals, including one control individual (Individual 1) are included. Two infected individuals (Individual 4 and 6) each had 3 samples collected at different dates. The gender and age group of each individual are shown. The days post initial sample collection are shown in brackets after the date. Sample identification and NGS sample number (barcode) are shown together with summary clinical symptoms, sampling date and results of the diagnostic SARS-COV-2 RT-PCR test (Ct value).
Table showing the details of the sample number, average read length, number of reads and number of reads mapped to each subgenomic SARS-CoV-2 RNA.
| Sample | Average read length | Number of reads in millions | S Orf2 | Orf3a | E Orf4 | M Orf5 | Orf6 | Orf7a | Orf7b | Orf8 | N Orf9 | Orf10 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GC-28/67 | 76 | 1.8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| GC-26/66 | 207 | 14.3 | 1611 | 17,702 | 18,165 | 5793 | 7066 | 19,443 | 0 | 807 | 20,156 | 0 |
| GC-13/35 | 149 | 1.6 | 1 | 17 | 17 | 9 | 14 | 72 | 0 | 17 | 81 | 0 |
| GC-11/34 | 73 | 1.2 | 8 | 29 | 98 | 63 | 58 | 592 | 0 | 250 | 35 | 0 |
| GC-11/38 | 184 | 3.3 | 179 | 2464 | 1715 | 1702 | 1587 | 4107 | 0 | 406 | 5071 | 0 |
| GC-20/63 | 83 | 1.3 | 0 | 0 | 133 | 1 | 972 | 0 | 0 | 0 | 0 | 0 |
| GC-24/61 | 115 | 2.3 | 0 | 0 | 16,067 | 283 | 0 | 5132 | 0 | 0 | 14,442 | 0 |
| GC-12/36 | 163 | 1.3 | 0 | 4 | 0 | 0 | 1 | 1 | 0 | 1 | 1 | 0 |
| GC-14/33 | 153 | 6 | 15 | 57 | 77 | 99 | 60 | 261 | 0 | 42 | 174 | 0 |
| GC-14/37 | 185 | 5.2 | 243 | 6798 | 5691 | 5548 | 6551 | 12,898 | 0 | 1539 | 33,261 | 0 |
| GC-23/60 | 150 | 5.2 | 204 | 74 | 11,117 | 304 | 12,972 | 6902 | 0 | 1463 | 51,614 | 0 |
| GC-51/62 | 100 | 2.6 | 0 | 0 | 0 | 89 | 0 | 19,847 | 0 | 0 | 15,130 | 0 |
| GC-21/64 | 92 | 3 | 0 | 0 | 0 | 88 | 0 | 9408 | 0 | 1052 | 3966 | 0 |
| GC-25/65 | 84 | 5.1 | 2 | 10 | 16,581 | 1249 | 16,655 | 158,755 | 0 | 11,990 | 1841 | 0 |
| GC-55/68 | 84 | 3.9 | 13 | 21 | 35,701 | 4453 | 4265 | 166,578 | 5 | 3328 | 942 | 0 |
Fig. 1SARS-CoV-2 genomic and subgenomic RNA structure showing genes and open reading frames (ORF) together with violin plots showing the number of reads per total of 5 million reads in the diagnostic samples mapped to the leader-containing subgenomic RNAs in the fasta file used for mapping.
The median read count is indicated by the white dot, the interquartile range (IQR) by the thick bar, and the furthest values within 1.5*IQR indicated by the thin black line (n = 14 data points from 12 biological samples from eight individuals run once). The structure of the SARS-CoV-2 genomic RNA is shown at the top and each subgenomic RNA is illustrated next to the respective violin plot. The nucleotide positions for the leader-TRS (transcription-regulatory sequences) joining locations are indicated alongside each subgenomic RNA (numbering based on reference Wuhan-Hu-1 NC_045512.2/MN908947.3 [https://www.ncbi.nlm.nih.gov/nuccore/NC_045512.2/ and https://www.ncbi.nlm.nih.gov/nuccore/MN908947.3]). In the current study no reads mapped to the tentative Orf10 leader-containing subgenomic RNA.
Table showing the details of the sample number, average read length, number of reads and number of reads mapped to each subgenomic SARS-CoV-2 RNA adjusted so they represent reads normalized to a total of 5 million (5 M) reads for each sample for easier comparison.
| Sample | Average read length | Number of reads adjusted to 5 M reads | S Orf2 | Orf3a | E Orf4 | M Orf5 | Orf6 | Orf7a | Orf7b | Orf8 | N Orf9 | Orf10 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GC-28/67 | 76 | 5 M | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| GC-26/66 | 207 | 5 M | 563 | 6190 | 6351 | 2026 | 2471 | 6798 | 0 | 282 | 7048 | 0 |
| GC-13/35 | 149 | 5 M | 3 | 53 | 53 | 28 | 44 | 225 | 0 | 53 | 253 | 0 |
| GC-11/34 | 73 | 5 M | 33 | 121 | 408 | 263 | 242 | 2467 | 0 | 1042 | 146 | 0 |
| GC-11/38 | 184 | 5 M | 271 | 3733 | 2598 | 2579 | 2405 | 6223 | 0 | 615 | 7683 | 0 |
| GC-20/63 | 83 | 5 M | 0 | 0 | 512 | 4 | 3738 | 0 | 0 | 0 | 0 | 0 |
| GC-24/61 | 115 | 5 M | 0 | 0 | 34,928 | 615 | 0 | 11,157 | 0 | 0 | 31,396 | 0 |
| GC-12/36 | 163 | 5 M | 0 | 15 | 0 | 0 | 4 | 4 | 0 | 4 | 4 | 0 |
| GC-14/33 | 153 | 5 M | 13 | 48 | 64 | 83 | 50 | 218 | 0 | 35 | 145 | 0 |
| GC-14/37 | 185 | 5 M | 234 | 6537 | 5472 | 5335 | 6299 | 12,402 | 0 | 1480 | 31,982 | 0 |
| GC-23/60 | 150 | 5 M | 196 | 71 | 10,689 | 292 | 12,473 | 6637 | 0 | 1407 | 49,629 | 0 |
| GC-51/62 | 100 | 5 M | 0 | 0 | 0 | 171 | 0 | 38,167 | 0 | 0 | 29,096 | 0 |
| GC-21/64 | 92 | 5 M | 0 | 0 | 0 | 147 | 0 | 15,680 | 0 | 1753 | 6610 | 0 |
| GC-25/65 | 84 | 5 M | 2 | 10 | 16,256 | 1225 | 16,328 | 155,642 | 0 | 11,755 | 1805 | 0 |
| GC-55/68 | 84 | 5 M | 17 | 27 | 45,771 | 5709 | 5468 | 213,562 | 6 | 4267 | 1208 | 0 |
Fig. 2Read count per 5 million (5 M) showing reads mapped to either the first 21,500 nucleotides (nt) of the virus genome, to the subgenomic region from nucleotide 21,500 onward, to subgenomic RNA containing the leader sequence, to the included cellular control mRNA amplicons and reads not mapped to any of these.
n = 15 data points from 13 biological samples from nine individuals run once.
Table showing the number of NGS reads, per 5 mill reads, mapped to control cellular mRNA amplicons and the highest and average number of reads for SARS-CoV-2 amplicons included in the Ampliseq panel.
| Sample | Sample collection date | SARS-CoV-2 PCR (Ct) | TBP | LRP1 | HMBS | MYC | ITGB7 | Total | Mill reads | Total per 5 M reads | Virus amplicon coverage per 5 mill reads | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Highest in first 21,500 nt | Highest in full virus genome | Average of 168 amplicons in first 21,500 nt | |||||||||||
| GC-28/67 | 28/1/20 | Not detected | 157,551 | 2944 | 70,221 | 22,633 | 16 | 253,365 | 1.8 | 703,792 | 0 | 0 | 0 |
| GC-26/66 | 7/3/20 | Detected (21) | 1486 | 193 | 476 | 338 | 732 | 3225 | 14.3 | 1128 | 99,927 | 116,186 | 48,581 |
| GC13/35 | 23/3/20 | Detected (29) | 54 | 2 | 22 | 12 | 120 | 210 | 1.6 | 656 | 3022 | 26,200 | 185 |
| GC-11/34 | 24/3/20 | Detected (19) | 0 | 0 | 0 | 1 | 1 | 2 | 1.2 | 8 | 12,617 | 3,776,575 | 726 |
| GC-11/38 | 24/3/20 | Detected (19) | 0 | 9 | 0 | 0 | 2 | 11 | 3.3 | 17 | 55,377 | 373,552 | 15,201 |
| GC-20/63 | 2/4/20 | Detected (31) | 259,450 | 11,510 | 35,621 | 43,798 | 284,859 | 635,238 | 1.3 | 2,443,223 | 28,396 | 1,337,769 | 72 |
| GC-24/61 | 7/4/20 | Detected (31) | 222,864 | 82,467 | 66,551 | 138,145 | 220,593 | 730,620 | 2.3 | 1,588,304 | 181,672 | 469,278 | 9510 |
| GC-12/36 | 24/3/20 | Detected (31) | 421 | 66 | 28 | 626 | 1725 | 2866 | 1.3 | 11,023 | 81 | 900 | 4 |
| GC-14/33 | 28/3/20 | Detected (18) | 0 | 1 | 0 | 0 | 1 | 2 | 6 | 2 | 693 | 1880 | 172 |
| GC-14/37 | 28/3/20 | Detected (18) | 1 | 18 | 1 | 2 | 3 | 25 | 5.3 | 24 | 79,092 | 91,491 | 15,358 |
| GC-23/60 | 8/4/20 | Detected (31) | 214,567 | 81,749 | 91,585 | 163,657 | 214,265 | 765,823 | 2.6 | 1,472,737 | 408,398 | 661,887 | 21,965 |
| GC-51/62 | 14/4/20 | Detected (31) | 483,313 | 260.623 | 210,787 | 159,479 | 242,317 | 1,356,519 | 2.6 | 2,608,690 | 32,142 | 269,108 | 222 |
| GC-21/64 | 3/4/20 | Detected (31) | 575,261 | 167,868 | 230,305 | 96,124 | 580,411 | 1,649,969 | 3 | 2,749,948 | 125,907 | 704,475 | 827 |
| GC-25/65 | 10/4/20 | Detected (19) | 5 | 7 | 85 | 186 | 1055 | 1338 | 5.1 | 1312 | 351,893 | 1,129,815 | 3614 |
| GC-55/68 | 24/4/20 | Detected (16) | 2 | 0 | 0 | 0 | 0 | 2 | 3.9 | 3 | 417,778 | 946,012 | 3922 |
Fig. 3Violin plot showing the estimated ratio of virus genomic reads to subgenomic reads containing the leader for each of the diagnostic samples included in the study.
The ratio is estimated in different ways, including comparing a the total reads for the first 21,500 nucleotides of the virus genome with the total number of subgenomic reads; b the most abundant amplicon in the first 21,500 nt of the virus genome with the most abundant subgenomic RNA amplicon reads; c the most abundant full virus genome amplicon with the most abundant subgenomic RNA amplicon reads; and d the average full virus amplicons reads with the average subgenomic RNA amplicons reads. The median read count ratio is indicated by the white dot, the interquartile range (IQR) by the thick bar, and the furthest values within 1.5*IQR indicated by the thin black line (n = 14 data points from 12 biological samples from eight individuals run once).
Table showing sample details with corresponding Ct values of PCR amplification using specific targets.
| Sample | Sample collection date | Leader-7a sub-genomic (Set 1) (Ct) | 7a Genomic and sub-genomic (Set 2) (Ct) | Leader-5′-UTR genomic (Set 3) (Ct) | 5′-UTR genomic (Set 4) (Ct) | N Gene (Ct) | Orf 1ab (Ct) | S gene (Ct) |
|---|---|---|---|---|---|---|---|---|
| NTC | Neg | Neg | Neg | Neg | Neg | Neg | Neg | |
| GC-28/67 | 28/01/2020 | Neg | Neg | Neg | Neg | Neg | Neg | Neg |
| GC-26/66 | 7/03/2020 | 28.9 | 24.8 | 25.7 | 25.9 | 25.7 | 26.2 | 26.7 |
| GC-13/35 | 23/03/2020 | 32.2 | 28.6 | 29.6 | 28.8 | 29.6 | 28.0 | 29.1 |
| GC-11/34/38 | 24/03/2020 | 24.3 | 19.7 | 20.6 | 21.4 | 21.3 | 21.9 | 22.4 |
| GC-20/63 | 2/04/2020 | Neg | 35.3 | 35.6 | 35.7 | 35.8 | Neg | Neg |
| GC-24/61 | 7/04/2020 | Neg | 34.8 | 33.8 | 36 | 35.4 | 35.3 | Neg |
| GC-12/36 | 24/03/2020 | Neg | Neg | Neg | Neg | Neg | Neg | Neg |
| GC-14/33/37 | 28/03/2020 | 21.1 | 17.5 | 17.8 | 18.6 | 19.3 | 19 | 19.7 |
| GC-23/60 | 8/04/2020 | 35.9 | 33.3 | 32.9 | 33.3 | 33.6 | 32.9 | 35.2 |
| GC-51/62 | 14/04/2020 | Neg | Neg | Neg | 35.9 | 37 | Neg | Neg |
| GC-21/64 | 3/04/2020 | Neg | Neg | Neg | 36.8 | 36.3 | Neg | Neg |
| GC-25/65 | 10/04/2020 | 22.3 | 18.9 | 20.1 | 21 | 20.4 | 21.7 | 22.2 |
| GC-55/68 | 24/04/2020 | 19.3 | 15 | 15.7 | 16.1 | 16.4 | 16.7 | 17.5 |
NTC non-template control (water).
Fig. 4Genomic and subgenomic PCR values shown as 40 minus Ct (40-Ct).
The values are shown for each sample and for the four in-house PCRs detecting 7a subgenomic RNA only, 7a total RNA (i.e., genomic RNA and subgenomic RNA up to and including 7a), the 5′-UTR (untranslated region) leader or the 5′UTR. NTC non-template control (water).
Table showing the primers designed and used to detect specific targets in the SARS-CoV-2 genome.
| Set | Primers | Sequence (5′->3′) | Length | Region |
|---|---|---|---|---|
| 1 | SARS-CoV2-Ampliseq-P1-18-41-F | TCCCAGGTAACAAACCAACCAACT | 24 | Leader |
| SARS-CoV-2-RP1-3-TRB-27531-27512 | AAATGGTGAATTGCCCTCGT | 20 | Orf7a | |
| 2 | SARS-CoV-2-FP2-TRB-27401-27425 | TTATTCTTTTCTTGGCACTGATAAC | 25 | Orf7a |
| SARS-CoV-2-RP1-3-TRB-27531-27512 | AAATGGTGAATTGCCCTCGT | 20 | Orf7a | |
| 3 | SARS-CoV2-Ampliseq-P1-18-41-F | TCCCAGGTAACAAACCAACCAACT | 24 | Leader |
| SARS-CoV-2_RP4_206_187 | GACGAAACCGTAAGCAGCCT | 20 | 5′ UTR | |
| 4 | SARS-CoV-2_FP4_79-99 | AAAATCTGTGTGGCTGTCACT | 21 | 5′ UTR |
| SARS-CoV-2_RP4_206_187 | GACGAAACCGTAAGCAGCCT | 20 | 5′ UTR | |
| 5 | SARS-CoV-2-FP2-TRB-27401-27425 | TTATTCTTTTCTTGGCACTGATAAC | 25 | Orf7a |
| SARS-CoV-2-RP1-2-TRB-27511-27491 | ATGTTCCAGAAGAGCAAGGTT | 21 | Orf7a |