| Literature DB >> 35019701 |
Celia Boukadida1, Blanca Taboada2, Marina Escalera-Zamudio3, Pavel Isa2, José Ernesto Ramírez-González4, Joel Armando Vazquez-Perez5, José Esteban Muñoz-Medina6, Concepción Grajales-Muñiz7, Carolina González-Torres8, Francisco Javier Gaytán-Cervantes8, Alma Rincón-Rubio1, Margarita Matías-Florentino1, Héctor Esteban Paz-Juárez1, Alejandro Sanchez-Flores9, Edgar Mendieta-Condado4, Gisela Barrera-Badillo4, Lucía Hernández-Rivas4, Susana López2, Irma López-Martínez4, Santiago Ávila-Ríos1, Carlos F Arias2.
Abstract
The coronavirus disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), has shown a wide spectrum of clinical manifestations ranging from asymptomatic infections to severe disease and death. Pre-existing medical conditions and age have been mainly linked to the development of severe disease; however, the potential association of viral genetic characteristics with different clinical conditions remains unclear. SARS-CoV-2 variants with increased transmissibility were detected early in the pandemics, and several variants with potential relevance for public health are currently circulating around the world. In this study, we characterized 57 complete SARS-CoV-2 genomes during the exponential growth phase of the early epidemiological curve in Mexico, in April 2020. Patients were categorized under distinct disease severity outcomes: mild disease or ambulatory care, severe disease or hospitalized, and deceased. To reduce bias related to risk factors, the patients were less than 60 years old and with no diagnosed comorbidities A trait-association phylogenomic approach was used to explore genotype-phenotype associations, represented by the co-occurrence of mutations, disease severity outcome categories, and clusters of Mexican sequences. Phylogenetic results revealed a higher genomic diversity compared to the initial viruses detected during the early stage of the local epidemic. We identified a total of 90 single nucleotide variants compared to the Wuhan-Hu-1 genome, including 54 nonsynonymous mutations. We did not find evidence for the co-occurrence of mutations associated with specific disease outcomes. Therefore, in the group of patients studied, disease severity was likely mainly driven by the host genetic background and other demographic factors. IMPORTANCE The genetic association of severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) with different clinical conditions remains unclear and needs further investigation. In this study, we characterized 57 complete SARS-CoV-2 genomes from patients in Mexico with distinct disease severity outcomes: mild disease or ambulatory care, severe disease or hospitalized, and deceased. To reduce bias related to risk factors the patients were less than 60 years old and with no diagnosed comorbidities. We did not find evidence for the co-occurrence of mutations associated with specific disease outcomes. Therefore, in the group of patients studied, disease severity was likely mainly driven by the host genetic background and other demographic factors.Entities:
Keywords: Mexico; SARS-CoV-2; distinct clinical outcomes; genomic diversity
Mesh:
Year: 2022 PMID: 35019701 PMCID: PMC8754132 DOI: 10.1128/spectrum.01249-21
Source DB: PubMed Journal: Microbiol Spectr ISSN: 2165-0497
Characteristics of the study participants
| Disease outcome category | Ambulatory care (A) | Hospitalized (H) | Deceased (D) | |
|---|---|---|---|---|
| Age (yrs), median (range) | 34 (24-50) | 47 (22-59) | 45 (27-59) |
|
| Sex (Male), | 15 (62.5%) | 10 (71.4%) | 15 (78.9%) | NS, |
| Location | ||||
| Mexico City, | 8 (33.3%) | 11 (78.6%) | 11 (57.9%) |
|
| Other states of Mexico, | 16 (66.7%) | 3 (21.4%) | 8 (42.1%) |
|
| Symptoms | 20 (83.3%) | 11 (78.6%) | 16 (88.9%) | NS, |
| Time from symptom onset to diagnosis (days), median (interquartile range) | 2.5 (1–4) | 8 (2.5–13.25) | 6 (4.75–8.25) |
|
Kruskal-Wallis and Chi-square tests were used to compare continuous and categorical variables, respectively (NS, not significant; *, p ≤ 0.05; **, p ≤ 0.005; ****, p < 0.0001). NA, not available. In bold, the values that were statistically significant.
FIG 1Genome variation compared to Wuhan-Hu-1 across severity groups. (A) Number of mutations and (B) amino acid changes, compared to the reference genome Wuhan-Hu-1 in the ambulatory care (A), hospitalized (H), and deceased (D) study groups. Boxes contain the 25th and 75th percentiles, and whiskers show the minimum and maximum values.
Description of full-length SARS-CoV-2 genomes
| Sample ID | Pango lineage | # of changes | aa changes | |
|---|---|---|---|---|
| nt | aa | |||
| 25_INER_49A_M_MEX_D_26/03/2020 | B.1.609 | 6 | 2 | Orf1b:P314L,S:D614G |
| 42_INER_53A_M_CMX_D_30/03/2020 | B.1.609 | 7 | 3 | Orf1a:Q4189H,Orf1b:P314L,S:D614G |
| 68_INER_40A_M_MEX_H_04/04/2020 | B.1.609 | 8 | 2 | Orf1b:P314L,S:D614G |
| 75_INER_34A_M_CMX_A_04/04/2020 | B.1.1.222 | 9 | 5 | N:R203K,N:G204R,Orf1b:P314L,S:D614G,S:T732A |
| 83_INER_39A_M_CMX_D_06/04/2020 | B.1.609 | 7 | 3 | Orf1b:P314L,S:A260S,S:D614G |
| 85_INER_27A_M_CMX_A_06/04/2020 | B.1 | 7 | 4 | Orf1a:L3338F,Orf1b:P314L,Orf3a:Q57H,S:D614G |
| 91_INER_45A_F_MEX_D_07/04/2020 | B.1.609 | 6 | 2 | Orf1b:P314L,S:D614G |
| 95_INER_56A_M_CMX_D_13/04/2020 | B.1.609 | 9 | 3 | Orf1a:I1276T,Orf1b:P314L,S:D614G |
| 104_INER_43A_M_CMX_D_10/04/2020 | B.1.111 | 7 | 4 | Orf1b:P314L,Orf3a:Q57H,Orf7a:T14I,S:D614G |
| 106_INER_36A_F_CMX_H_10/04/2020 | B.1 | 5 | 2 | Orf1b:P314L,S:D614G |
| 109_INER_41A_F_MEX_D_08/04/2020 | B.1 | 6 | 2 | Orf1b:P314L,S:D614G |
| 112_INER_28A_M_CMX_A_08/04/2020 | B.1 | 7 | 5 | Orf1a:T265I,Orf1a:D3972E,Orf1b:P314L,Orf3a:Q57H,S:D614G |
| 118_INER_59A_M_CMX_D_08/04/2020 | B.1.609 | 7 | 2 | Orf1b:P314L,S:D614G |
| 126_INER_56A_M_MEX_D_11/04/2020 | B.1.1 | 9 | 5 | N:R203K,N:G204R,Orf1a:A1049V,Orf1b:P314L,S:D614G |
| 137_INER_58A_M_CMX_D_13/04/2020 | B.1.1 | 10 | 7 | N:R203K,N:G204R,Orf1a:I404T,Orf1a:H712Y,Orf1b:A176V,Orf1b:P314L,S:D614G |
| 173_INER_34A_M_CMX_H_15/04/2020 | B.1.1 | 12 | 6 | M:A69S,N:R203K,N:G204R,Orf1b:P314L,Orf1b:L1629S,S:D614G |
| 192_INER_39A_M_CMX_D_17/04/2020 | B.1.609 | 8 | 4 | Orf1a:Y1920H,Orf1a:A4285V,Orf1b:P314L,S:D614G |
| 204_INER_52A_M_CMX_H_18/04/2020 | B.1 | 7 | 5 | Orf1a:T265I,Orf1b:P314L,Orf3a:Q57H,S:D614G,S:D936Y |
| 205_INER_34A_M_CMX_A_18/04/2020 | B.1 | 7 | 3 | Orf1b:P314L,Orf1b:V1768I,S:D614G |
| 207_INER_36A_M_MEX_H_17/04/2020 | B.1 | 9 | 7 | Orf1a:K141R,Orf1a:T265I,Orf1b:P314L,Orf1b:G2197D,Orf3a:Q57H,S:D614G, S:D936Y |
| 221_INER_59A_M_CMX_H_18/04/2020 | B.1.609 | 7 | 2 | Orf1b:P314L,S:D614G |
| 228_INER_53A_F_CMX_H_19/04/2020 | B.1 | 9 | 7 | Orf1a:K141R,Orf1a:T265I,Orf1b:P314L,Orf1b:G2197D,Orf3a:Q57H,S:D614G, S:D936Y |
| 229_INER_50A_M_CMX_H_19/04/2020 | B.1.609 | 10 | 5 | N:S327L,Orf1a:G3072C,Orf1b:P314L,Orf3a:S166L,S:D614G |
| 238_INER_59A_M_CMX_D_21/04/2020 | B.1.609 | 6 | 2 | Orf1b:P314L,S:D614G |
| 260_INER_43A_M_MEX_D_22/04/2020 | B.1.609 | 7 | 2 | Orf1b:P314L,S:D614G |
| 275_INER_45A_M_CMX_D_25/04/2020 | B.1 | 6 | 3 | Orf1b:P314L,S:V143F,S:D614G |
| 286_INER_48A_M_MEX_H_26/04/2020 | B.1.609 | 8 | 4 | Orf1a:A4285V,Orf1b:P314L,S:D614G,S:V1268I |
| 296_INER_38A_F_CMX_H_23/04/2020 | B.1 | 7 | 3 | Orf1b:P314L,Orf3a:Q57H,S:D614G |
| 317_INER_55A_F_CMX_H_27/04/2020 | B.1 | 9 | 4 | Orf1a:L1249F,Orf1a:N1995H,Orf1b:P314L,S:D614G |
| 689_InDRE_38A_F_YUC_D_10/04/2020 | B.1 | 6 | 3 | Orf1a:T2952I,Orf1b:P314L,S:D614G |
| 1498_InDRE_35A_M_CMX_D_15/04/2020 | B.1.609 | 8 | 4 | Orf1a:Y1920H,Orf1a:A4285V,Orf1b:P314L,S:D614G |
| 5361_InDRE_27A_F_TLA_D_30/04/2020 | B.1.609 | 8 | 3 | Orf1a:A4285V,Orf1b:P314L,S:D614G |
| 8279_IMSS_31A_M_CMX_D_01/04/2020 | B.1 | 6 | 4 | Orf1a:T265I,Orf1b:P314L,Orf3a:Q57H,S:D614G |
| 9823_IMSS_55A_M_CMX_H_05/04/2020 | B.1 | 6 | 3 | N:A359S,Orf1b:P314L,S:D614G |
| 10793_IMSS_26A_F_MEX_A_08/04/2020 | B.1 | 8 | 4 | Orf1a:E647G,Orf1a:S2132G,Orf1b:P314L,S:D614G |
| 10932_IMSS_30A_M_CMX_A_08/04/2020 | B.1.1 | 10 | 4 | N:R203K,N:G204R,Orf1b:P314L,S:D614G |
| 10938_IMSS_22A_M_CMX_H_08/04/2020 | B.1 | 8 | 4 | Orf1a:M3652T,Orf1b:P314L,Orf3a:Q57H,S:D614G |
| 11296_IMSS_49A_M_CMX_A_07/04/2020 | B.1 | 8 | 5 | N:G243C,Orf1a:T4031I,Orf1b:P314L,S:D614G,S:V1122L |
| 11398_IMSS_59A_M_GRO_D_08/04/2020 | B.1.1 | 8 | 4 | N:R203K,N:G204R,Orf1b:P314L,S:D614G |
| 11433_IMSS_43A_F_BCS_A_07/04/2020 | B.1.609 | 6 | 2 | Orf1b:P314L,S:D614G |
| 11461_IMSS_32A_M_GRO_A_08/04/2020 | B.1 | 10 | 4 | Orf1a:T50I,Orf1a:S142L,Orf1b:P314L,S:D614G |
| 11647_IMSS_33A_F_BCS_A_08/04/2020 | B.1.609 | 7 | 3 | Orf1a:S4398L,Orf1b:P314L,S:D614G |
| 11688_IMSS_48A_M_BCS_A_07/04/2020 | B.1.609 | 6 | 2 | Orf1b:P314L,S:D614G |
| 11692_IMSS_50A_F_BCS_A_07/04/2020 | B.1.609 | 6 | 2 | Orf1b:P314L,S:D614G |
| 11697_IMSS_37A_M_BCS_A_07/04/2020 | B.1.609 | 7 | 2 | Orf1b:P314L,S:D614G |
| 11712_IMSS_41A_F_BCS_A_08/04/2020 | B.1.609 | 6 | 2 | Orf1b:P314L,S:D614G |
| 11713_IMSS_35A_F_BCS_A_08/04/2020 | B.1.609 | 6 | 2 | Orf1b:P314L,S:D614G |
| 11768_IMSS_42A_M_BCN_A_08/04/2020 | B.1.609 | 9 | 5 | Orf1a:P777S,Orf1a:P3613S,Orf1b:P314L,Orf1b:S1003G,S:D614G |
| 11769_IMSS_24A_M_BCN_A_08/04/2020 | B.1.609 | 8 | 3 | Orf1a:P3613S,Orf1b:P314L,S:D614G |
| 12060_IMSS_26A_M_MEX_A_08/04/2020 | B.1 | 8 | 4 | E:L73F,Orf1a:T4355I,Orf1b:P314L,S:D614G |
| 12542_IMSS_39A_M_BCS_A_06/04/2020 | B.1.609 | 6 | 2 | Orf1b:P314L,S:D614G |
| 14214_IMSS_26A_M_CMX_A_16/04/2020 | B.1.1 | 10 | 5 | N:R203K,N:G204R,Orf1b:P314L,Orf1b:V1706I,S:D614G |
| 14215_IMSS_47A_F_CMX_A_16/04/2020 | B.1.189 | 10 | 5 | Orf1a:V2047F,Orf1b:P314L,Orf7a:E95*,Orf9b:R32L,S:D614G |
| 16144_IMSS_27A_M_MEX_A_19/04/2020 | B.1.189 | 9 | 4 | Orf1a:V2047F,Orf1b:P314L,Orf9b:R32L,S:D614G |
| 16221_IMSS_46A_M_CMX_H_17/04/2020 | B.1.1 | 11 | 7 | N:R203K,N:G204R,Orf1a:H712Y,Orf1a:T3058I,Orf1a:L3606F,Orf1b:P314L,S:D614G |
| 17307_IMSS_29A_F_MEX_A_17/04/2020 | B.1 | 7 | 2 | Orf1b:P314L,S:D614G |
| 17360_IMSS_38A_F_GRO_A_17/04/2020 | B.1 | 6 | 2 | Orf1b:P314L,S:D614G |
nt, nucleotide; aa, amino acid.
Compared to Wuhan-Hu-1 reference genome (GenBank MN908947).
FIG 2Lineage distribution across severity groups. The PANGO lineages of genomic sequences belonging to the ambulatory care (A), hospitalized (H), and deceased (D) study groups were determined using the PANGOLIN tool v3.1.7 (lineages version 2021-07-09) (54).
FIG 3Phylogenetic placement of Mexican isolates. ML tree inferred from the Orf1ab+S alignment comprising 565 sequences, including the 57 Mexican virus genome sequences generated for this work, classified under the different disease outcome groups (A, indicated in red; H, indicated in blue; and D, indicated in yellow). An additional 34 Mexican sequences available from the GISAID platform at the time of the analysis were also included (indicated in green), together with the 17 Mexican isolates characterized from the earliest introduction events of SARS-CoV-2 within the country (indicated in purple) (26). From the 57 isolates sequenced for this study, 27 formed a single well-supported cluster belonging to the PANGO lineages B.1.609 and B.1.189. Within this single cluster, four subclusters were identified (SC1-4), all diverging from the same ancestral sequence. The 30 remaining Mexican isolates sequenced for this study displayed a scattered positioning across the tree.
FIG 4Distribution of identified mutations in the SARS-CoV-2 genome. For each single nucleotide variant identified, the number of viral isolates was plotted as a function of the nucleotide position along the SARS-CoV-2 genome. Single nucleotide variants were divided into 4 categories: in-frame deletion (black bar), mutations in noncoding regions (dark yellow bars), nonsynonymous mutations (pink bars), and synonymous mutations (turquoise bars). The number of single nucleotide variants in each category is indicated in the legend box. The SARS-CoV-2 genome organization is represented below the plot.