| Literature DB >> 33738124 |
Juliana D Siqueira1, Livia R Goes1,2, Brunna M Alves1, Pedro S de Carvalho1, Claudia Cicala2, James Arthos2, João P B Viola3, Andréia C de Melo4, Marcelo A Soares1.
Abstract
Numerous factors have been identified to influence susceptibility to Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) infection and disease severity. Cancer patients are more prone to clinically evolve to more severe COVID-19 conditions, but the determinants of such a more severe outcome remain largely unknown. We have determined the full-length SARS-CoV-2 genomic sequences of cancer patients and healthcare workers (non-cancer controls) by deep sequencing and investigated the within-host viral population of each infection, quantifying intrahost genetic diversity. Naso- and oropharyngeal SARS-CoV-2+ swabs from 57 cancer patients and 14 healthcare workers from the Brazilian National Cancer Institute were collected in April to May 2020. Complete genome amplification using ARTIC network V3 multiplex primers was performed followed by next-generation sequencing. Assemblies were conducted in Geneious R11, where consensus sequences were extracted and intrahost single nucleotide variants were identified. Maximum likelihood phylogenetic analysis was performed using PhyMLv.3.0 and lineages were classified using Pangolin and CoV-GLUE. Phylogenetic analysis showed that all but one strain belonged to clade B1.1. Four genetically linked mutations known as the globally dominant SARS-CoV-2 haplotype (C241T, C3037T, C14408T and A23403G) were found in the majority of consensus sequences. SNV signatures of previously characterized Brazilian genomes were also observed in most samples. Another 85 SNVs were found at a lower frequency (1.4%-19.7%) among the consensus sequences. Cancer patients displayed a significantly higher intrahost viral genetic diversity compared to healthcare workers. This difference was independent of SARS-CoV-2 Ct values obtained at the diagnostic tests, which did not differ between the two groups. The most common nucleotide changes of intrahost SNVs in both groups were consistent with APOBEC and ADAR activities. Intrahost genetic diversity in cancer patients was not associated with disease severity, use of corticosteroids, or use of antivirals, characteristics that could influence viral diversity. Moreover, the presence of metastasis, either in general or specifically in the lung, was not associated with intrahost diversity among cancer patients. Cancer patients carried significantly higher numbers of minor variants compared to non-cancer counterparts. Further studies on SARS-CoV-2 diversity in especially vulnerable patients will shed light onto the understanding of the basis of COVID-19 different outcomes in humans.Entities:
Keywords: COVID-19; SARS-CoV-2; cancer; full-length genome; single nucleotide variant
Year: 2021 PMID: 33738124 PMCID: PMC7928633 DOI: 10.1093/ve/veab013
Source DB: PubMed Journal: Virus Evol ISSN: 2057-1577
Demographic and clinic characteristics of the cancer patients and healthcare workers studied.
| Characteristic | Patients (%) | Healthcare workers (%) |
|
|---|---|---|---|
| n = 57 | n = 14 | ||
| Age, years (median, range) | 61 (9–79) | 40.5 (33–57) | <0.001 |
| Age | |||
| <25 years | 3.5 | 0 | |
| 25–64 years | 57.9 | 100 | |
| ≥65 years | 38.6 | 0 | |
| Gender | 0.118 | ||
| Female | 61.4 | 85.7 | |
| Male | 38.6 | 14.3 | |
| Time in days between symptom onset and sample collection (median, range) | 3 (0–34) | NC | NT |
| Symptoms at COVID-19 diagnosis | |||
| Cough | 59.6 | 85.7 | 0.1956 |
| Fever | 57.9 | 57.1 | 0.7645 |
| Dyspnea | 56.1 | 7.1 | <0.001 |
| Fatigue | 24.6 | 21.4 | 1 |
| Diarrhea | 14.0 | 7.1 | 0.6719 |
| Nausea/vomiting | 12.3 | 0 | 0.3301 |
| Anorexia | 7.0 | 0 | 0.5721 |
| Sore throat | 5.3 | 42.8 | 0.0018 |
| Myalgia | 3.5 | 0 | 1 |
| Headache | 3.5 | 42.8 | <0.001 |
| Anosmia | 3.5 | 42.8 | <0.001 |
| Ageusia | 3.5 | 0 | 1 |
| Coryza | 3.5 | 85.7 | <0.001 |
| None | 0 | 0 | |
| Missing | 7.0 | 0 | |
| Death | 0.0029 | ||
| Yes, from COVID-19 | 33.3 | 0 | |
| Yes, other cause | 5.3 | 0 | |
| No | 54.4 | 100 | |
| Missing | 7.0 | 0 | |
| Smoking | |||
| Past/current | 21.0 | NC | NT |
| Never | 24.6 | NC | NT |
| Missing | 54.4 | NC | NT |
| Primary cancer site | |||
| Solid tumors | 71.9 | NA | NT |
| Hematological malignancies | 28.1 | NA | NT |
| Metastatic disease | |||
| Yes, to the lung | 14.0 | NA | NT |
| Yes, to other organs | 24.6 | NA | NT |
| No | 47.4 | NA | NT |
| Missing | 14.0 | NA | NT |
| Use of corticosteroid | |||
| Yes | 19.3 | NA | NT |
| No | 87.2 | NA | NT |
| Missing | 3.5 | NA | NT |
| Use of oseltamivir | |||
| Yes | 14.0 | NA | NT |
| No | 82.4 | NA | NT |
| Missing | 3.5 | NA | NT |
NC, not collected; NA, not applicable; NT, not tested.
Data available for 45 patients.
Figure 1.Maximum likelihood phylogenetic trees of near full-length SARS-CoV-2 genomes characterized. Tree including cancer patients (depicted in red circles), healthcare workers (in blue), Brazilian sequences classified as B1, B1.* and the Brazilian circulating strains B1.1-BR/B.1.1-EU/BR available on GISAID (in gray). (B) Tree showing epidemiological linkage of cancer patients (shown in red), healthcare workers (in blue) and all SARS-Cov-2 sequences from Rio de Janeiro state (in gray) available on GISAID Database. In both cases, GISAID was accessed on July 27th, 2020. Bootstrap values greater than 70 are shown in both trees.
Figure 2.Number of iSNVs per ORF analyzed. Data for cancer patients (A) and healthcare workers (B) are shown. Each iSNV showed an intrahost frequency greater than 2% with a minimum depth coverage of 500x. The table on the right shows ORF name and genome coordinate based on SARS-CoV-2 Wuhan-Hu-1 reference sequence genome (GenBank number MN908947), ORF size in bp, number of samples, average base pairs analyzed and average (min–max) substitutions per site × 10−4. The last three columns refer only to samples with minimum depth coverage of 500× for at least 60% of the given ORF region extension.
Figure 3.Viral genetic diversity in cancer patients and HCWs. Diversity was calculated by number of iSNVs per site × 10−4. Tukey boxplots show the viral genetic diversity in cancer patients compared to HCW (Mann–Whitney test (two-tailed), **P = 0.0093) (A). The difference is still significant when outlier patients are removed from the analysis (Mann–Whitney test (two-tailed), *P = 0.0299) (B). Viral genetic diversity in cancer patients and healthcare workers samples (n = 71) showed a positive correlation with Ct values (C). No correlation between viral genetic diversity and Ct values is observed when analyzing samples with Ct < 20 (D). Spearman correlation analysis rs and P-values are indicated.
Association between SARS-CoV-2 genetic diversity and clinical outcomes.
| Characteristic | Median iSNVs/site × 10-4 |
|
|---|---|---|
| Death | ||
| Yes | 0.92 | 0.722 |
| No | 1.21 | |
| Death from COVID-19 | ||
| Yes | 1.05 | 0.934 |
| No | 1.04 | |
| Intensive care unit | ||
| Yes | 0.93 | 0.722 |
| No | 1.05 | |
| Use of corticosteroid | ||
| Yes | 0.91 | 0.660 |
| No | 1.11 | |
| Use of oseltamivir | ||
| Yes | 1.92 | 0.384 |
| No | 0.89 | |
| Haematological malignancies | ||
| Yes | 1.37 | 0.722 |
| No | 0.98 | |
| Metastatic disease | ||
| Yes | 1.64 | 0.279 |
| No | 0.98 | |
| Metastatic disease to the lung | ||
| Yes | 1.70 | 0.279 |
| No | 0.98 |
After Benjamini–Hochberg correction for multiple comparisons.
Figure 4.iSNVs according to the nucleotide change. Distribution of nucleotide changes across the genome for all samples (n = 71) is shown (A). C-to-T mutations are shown in orange, A-to-G are in green and the remaining changes are in grey. Genome coordinates are relative to the SARS-CoV-2 Wuhan-Hu-1 reference sequence (GenBank acc.# MN908947). Average frequency of each type of nucleotide change found for cancer patients (red) and healthcare workers (blue) (B).