| Literature DB >> 31138840 |
Takashi Shiina1, Shingo Suzuki1, Nicolas Congy-Jolivet2,3, Alice Aarnink2, Henri-Jean Garchon4,5, Nathalie Dereuddre-Bosquet6, Bruno Vaslin6, Nicolas Tchitchek6, Delphine Desjardins6, Brigitte Autran7, Olivier Lambotte6,8, Ioannis Theodorou9, Roger Le Grand6, Antoine Blancher10,11,12.
Abstract
The association between gene polymorphisms and plasma virus load at the set point (SP-PVL) was investigated in Mauritian macaques inoculated with SIV. Among 44 macaques inoculated with 50 AID50, six individuals were selected: three with SP-PVL among the highest and three with SP-PVL among the lowest. The exons of 390 candidate genes of these six animals were sequenced. Twelve non-synonymous single nucleotide polymorphisms (NS-SNPs) lying in nine genes potentially associated with PVL were genotyped in 23 animals. Three NS-SNPs with probabilities of association with PVL less than 0.05 were genotyped in a total of 44 animals. One NS-SNP lying in exon 1 of the IL37 gene displayed a significant association (p = 3.33 × 10-4) and a strong odds ratio (19.52). Multiple linear regression modeling revealed three significant predictors of SP-PVL, including the IL37 exon 1 NS-SNP (p = 0.0004) and the MHC Class IB haplotypes M2 (p = 0.0007) and M6 (p = 0.0013). These three factors in conjunction explained 48% of the PVL variance (p = 4.8 × 10-6). The potential role of IL37 in the control of SIV infection is discussed.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31138840 PMCID: PMC6538695 DOI: 10.1038/s41598-019-44235-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
SNP information.
| Sample ID | Total SNP number | SNP number of coding region | SNP number of non-coding region | |
|---|---|---|---|---|
| Non-synonymous | Synonymous | |||
| 9204 | 3.500 | 447 | 522 | 2.531 |
| 11137 | 2.661 | 476 | 550 | 1.635 |
| 11245 | 3.374 | 440 | 546 | 2.388 |
| Z857 | 2.854 | 426 | 495 | 1.933 |
| 10435 | 3.222 | 421 | 486 | 2.315 |
| 10465 | 3.434 | 523 | 565 | 2.346 |
| Total | 19.045 | 2.733 | 3.164 | 13.148 |
Genotypes of 12 NS- SNPs using 14 animals with low PVL (log-PVL <2.7) and 9 animals with high PVL (log-PVL >4.3).
| Locus | CD244 | IFNGR1 | LEF1 | IRF9 | IRF9 | IFIT3 | CXCR2 | EIF2AK3 | IL18RAP | IL37 | RANBP2 | DDX53 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Chromosome | Chr.1 | Chr.4 | Chr.5 | Chr.7 | Chr.7 | Chr.9 | Chr.12 | Chr.13 | Chr.13 | Chr.13 | Chr.13 | Chr.X |
| Location | 90,806,827 | 41,537,040 | 107,555,612 | 87,945,244 | 87,945,249 | 87,895,310 | 107,701,421 | 19,755,581 | 6,821,318 | 17,064,452 | 13,295,674 | 20,943,589 |
| Ref./Var. | T/C | C/T | T/A | A/G | C/T | C/T | G/A | T/C | A/C | C/T | C/T | G/C |
| Odds ratio |
| 1.08 | 1.16 |
|
| 2.92 | 1.04 | 1.32 | 1.96 |
| 1.64 | 1.24 |
| 95% Cl | 1.59 to 110.40 | 0,28 to 4,21 | 0.23 to 6.47 | 1.02 to 18.16 | 1.02 to 18.16 | 0.72 to 13.50 | 0.18 to 5.37 | 0.34 to 5.14 | 0.50 to 8.36 | 2.24 to 986.65 | 0.31 to 9.79 | 0.28 to 5.32 |
|
| 7.62 × 10−03 | 1 | 1 | 3.40 × 10−02 | 3.40 × 10−02 | 0.129 | 1 | 0.764 | 0.364 |
| 0.717 | 0.753 |
| Animals (route)[log-PVL] | ||||||||||||
| 23037 (IR) [0.000] | C/C | C/T | T/A | A/A | T/T | T/T | G/G | T/C | A/C | C/C | C/T | G/G |
| 14468 (IR) [1.732] | C/C | T/T | T/T | A/A | T/T | C/T | G/G | T/C | A/C | C/C | C/T | C/C |
| 9204 (IV) [1.778] | C/C | C/C | T/A | A/A | T/T | C/T | G/A | T/C | A/A | C/C | C/C | G/G |
| 11137 (IV) [1.778] | C/C | C/C | T/A | A/A | T/T | C/T | G/A | T/T | A/A | C/C | C/C | G/G |
| 11245 (IV) [1.778] | C/C | C/C | T/A | A/A | T/T | C/T | G/A | T/T | A/A | C/C | C/C | G/G |
| 11637 (IV) [1.778] | C/T | T/T | A/A | G/G | C/C | C/C | G/G | C/C | A/C | C/C | T/T | G/G |
| 15885 (IR) [1.959] | C/T | C/T | T/T | A/A | T/T | T/T | G/G | T/C | A/C | C/C | nd | C/C |
| OBHJ6 (IR) [2.153] | C/C | C/C | T/T | A/A | T/T | T/T | G/G | C/C | C/C | C/C | C/C | G/G |
| 9413 (IV) [2.222] | C/C | C/T | T/T | G/G | C/C | T/T | G/A | T/T | C/C | C/T | C/C | C/C |
| OBPR6 (IR) [2.283] | C/C | C/T | T/T | A/A | T/T | C/T | G/G | T/C | A/C | C/C | nd | G/G |
| Z776 (IV) [2.477] | C/C | T/T | T/T | G/G | C/C | C/C | G/G | T/T | C/C | C/C | nd | G/G |
| 10515 (IV) [2.526] | C/C | C/T | T/T | A/G | T/C | C/C | G/G | T/T | A/C | C/C | T/T | C/C |
| 10228 (IV) [2.565] | C/C | T/T | T/T | G/G | C/C | C/T | G/A | T/C | A/C | C/C | nd | G/G |
| 15461 (IR) [2.613] | C/C | C/C | T/A | A/A | T/T | C/T | G/A | C/C | A/C | C/C | T/T | G/G |
| 4763 (IV) [4.326] | C/C | C/T | T/A | G/G | C/C | C/T | G/G | T/T | A/C | C/C | C/C | G/G |
| 8102 (IV) [4.506] | C/C | C/T | T/T | G/G | C/C | C/T | G/G | T/T | A/A | T/T | C/C | G/G |
| 20483 (IV) [4.586] | C/C | C/T | T/A | A/G | T/C | T/T | G/A | T/T | A/C | C/T | nd | G/G |
| 11360 (IV) [4.828] | T/T | C/C | T/T | G/G | C/C | C/T | G/A | C/C | A/A | C/C | C/C | G/G |
| Z860 (IV) [4.968] | C/T | C/C | T/A | A/A | T/T | C/C | G/A | C/C | A/A | T/T | C/T | G/G |
| Z857 (IV) [5.239] | C/T | C/T | T/T | G/G | C/C | C/C | G/G | T/C | A/C | C/T | C/C | C/C |
| OBRF6 (IR) [5.265] | T/T | C/T | T/A | A/A | T/T | C/C | G/A | C/C | A/C | C/C | nd | G/G |
| 10435 (IV) [5.399] | C/T | T/T | T/T | G/G | C/C | C/C | G/G | T/C | A/C | C/T | T/T | C/C |
| 10465 (IV) [5.559] | C/T | C/T | T/T | A/G | T/C | C/C | G/G | T/C | A/C | C/T | C/T | C/C |
The odds ratio and 95% confidence interval (CI) were calculated as detailed in Materials and Methods.
Probabilities of association (p) were obtained using the Fisher exact test. Boldface p values indicate significance levels <5 × 10−2. Applying the Bonferroni correction (because we studied 11 candidate genes, the threshold of significance is 4.5 × 10−3), only IL37 was had a significant association with PVL. In the case of IRF9, because the two SNPs yielded homologues genotypes in 23 animals, only one of the two SNPs was studied for the additional animals (see Table 3).
Genotypes of the three NS-SNPs with 22 animals above the median log-PVL and 22 animals below the median log-PVL.
| Locus name | CD244 | IRF9 | IL37 (IL37) |
|---|---|---|---|
| Chromosome | Chr.1 | Chr.7 | Chr.13 |
| Location | 90,806,827 | 87,945,244 | 17,064,452 |
| Ref./Var. | C/T | A/G | C/T |
| Odds Ratio | 4.59 | 1.34 |
|
| 95% confidence | 1.27 to 21.09 | 0.51 to 3.57 | 2.70 to 863.87 |
|
| 0.016 | 0.656 |
|
|
| |||
| 23037 (IR) [0.000] | C/C | A/A | C/C |
| 14468 (IR) [1.732] | C/C | A/A | C/C |
| 9204 (IV) [1.778] | C/C | A/A | C/C |
| 11137 (IV) [1.778] | C/C | A/A | C/C |
| 11245 (IV) [1.778] | C/C | A/A | C/C |
| 11637 (IV) [1.778] | C/T | G/G | C/C |
| 15885 (IR) [1.959] | C/T | A/A | C/C |
| OBHJ6 (IR) [2.153] | C/C | A/A | C/C |
| 9413 (IV) [2.222] | C/C | G/G | C/T |
| OBG7 (IR) [2.228] | C/C | A/A | C/C |
| OBPR6 (IR) [2.283] | C/C | A/A | C/C |
| Z776 (IV) [2.477] | C/C | G/G | C/C |
| 10515 (IV) [2.526] | C/C | A/G | C/C |
| 10228 (IV) [2.565] | C/C | G/G | C/C |
| 15461 (IR) [2.613] | C/C | A/A | C/C |
| 11296 (IV) [2.771] | C/C | A/G | C/C |
| OBFE6 (IR) [2.899] | C/C | A/G | C/C |
| 9691 (IV) [2.905] | C/C | A/A | C/C |
| 20595 (IV) [2.964] | C/C | A/A | C/C |
| 8249 (IV) [2.998] | C/C | A/A | C/C |
| 23060 (IR) [3.094] | C/T | A/G | C/C |
| 20351 (IR) [3.094] | C/T | G/G | C/C |
| 20654 (IV) [3.137] | C/C | A/A | C/C |
| 15232 (IR) [3.273] | T/T | A/G | C/T |
| 10116 (IV) [3.398] | C/C | A/A | C/C |
| 15693 (IV) [3.428] | C/C | A/A | C/T |
| 473 (IV) [3.499] | T/T | A/G | C/C |
| 10024 (IV) [3.544] | C/T | A/A | C/C |
| 9345 (IV) [3.614] | C/C | A/A | C/C |
| 20525 (IV) [3.925] | C/C | A/G | C/T |
| 9859 (IV) [4.004] | C/C | A/A | C/T |
| 8141 (IV) [4.053] | C/C | A/G | C/C |
| 15596 (IV) [4.053] | C/T | A/A | C/T |
| 23014 (IR) [4.124] | C/C | A/A | C/T |
| OBRG6 (IR) [4.262] | C/C | A/G | C/C |
| 4763 (IV) [4.326] | C/C | G/G | C/C |
| 8102 (IV) [4.506] | C/C | G/G | T/T |
| 20483 (IV) [4.586] | C/C | A/G | C/T |
| 11360 (IV) [4.828] | T/T | G/G | C/C |
| Z860 (IV) [4.968] | C/T | A/A | T/T |
| Z857 (IV) [5.239] | C/T | G/G | C/T |
| OBRF6 (IR) [5.265] | T/T | A/A | C/C |
| 10435 (IV) [5.399] | C/T | G/G | C/T |
| 10465 (IV) [5.559] | C/T | A/G | C/T |
The odds ratio and 95% confidence interval (CI) were calculated as detailed in Materials and Methods. Probabilities of association (p) were obtained using the Fisher exact test. Significant p values are represented in bold.
We tested the effect of these three gene variants on the PVL using a linear model. We also introduced information on the MHC haplotypes, since these haplotypes were previously shown to influence PVL. The seven haplotypes described in this population were treated as binary variables (presence or absence). Starting from the full model, we removed the non-significant terms in a stepwise manner. This left only three significant predictors, namely IL37 SNP (p = 0.0026), M2 (p = 0.0027), and M6 (p = 0.0023) haplotypes. These three factors in conjunction explained 48% of the PVL variance (p = 4.8 × 10−6).
Figure 1The polymorphic positions (1), (2), (3) and (4) correspond to positions 17064328, 17064347, 17064414, and 17064452 of Mafa chr.13 (whole genome annotation v5.0), respectively. The SNP at position (4) (Chr.13 :17064452) was significantly associated to the SP-PVL. Mafa-ref correspond to the sequence of the Macaca_fascicularis_5.0 genome annotation. Arrow 1. The macaque exon 1 displays an in-frame ATG codon nine codon upstream of the ATG start codon of the human IL-37 gene. In the human genome, this start codon is absent. Arrow 2. Location of the human IL-37 start codon. Arrow 3. A 27 bp motif is duplicated in the most frequent IL-37 macaque alleles studied here. This insertion was absent in one allele of animal 10465_IL1F7. A similar allele was found in another animal (animal 8102) among the 44 Mauritian cynomolgus macaques studied here (see text for details). Arrow 4. The 27 bp repeat is absent from Mafa and Mamu whole genome annotations. It is also absent from the human IL-37 gene. The 3′ duplicated 27 bp motif differs from its homologue by one base (there is a T at position 207 instead of a C at position 180). This substitution is synonymous.
Figure 2The amino acid positions are numbered by reference to the macaque sequences characterized in this study (number in italics) or by reference to the human IL37 isoform b (NP_055254 corresponding to coding sequence #NM014439). The macaque protein sequences were deduced from cDNA sequences of animals #8141, #8102 and #OBHJ6 that were characterized from cDNA amplified by RT-PCR from buffy coats RNA (see Materials and Methods for details). The 46 first bases in 5′ were characterized at the genomic level by amplifying and sequencing exon 1 and the surrounding regions. Other macaque protein sequences were deduced from genomic sequences of six animals (#9204, #11137, #11245, #Z857, #10435 and #10465), which were characterized in these six animals by means of sequence capture. The NS variant associated with PVL is responsible for the amino acid polymorphism (T- > M) at position 20. Animals #Z857 and #10435 are heterozygous at this position. Animal #11137 was heterozygous for a non-synonymous variant (codon 7) that is not associated with PVL. Two animals (#10465 and #8141) have alleles (“#10465_2” and “#8141_2”) missing the 27 bp repeat in exon 1 (for more details, see text). Ref_Mafa and Ref_Mamu sequences correspond to XP_015288882 (predicted transcript XM_015433396) and XP_002808097 (predicted transcript XM_002808051), respectively. Note that the Mamu IL-37 predicted mRNA sequence (XM_002808051) present a nonsense mutation at codon 136 (homologous to codon 145 of Mafa) that is indicated by a “*” in the figure. Another Mamu IL-37 predicted amino acid sequence (EHH22430; locus tag: EGK_05691) does not have a stop codon at position 136 and is 8 amino acids shorter than XP_002808097 at the amino-terminal extremity[2]. All Mafa or Mamu IL37 coding sequences available in databanks lack the 27 bp motif duplication in exon1.
Figure 3Macaque sequences are described in the legend for Fig. 1. The human variants (var 1 to 13) are from Kang et al.[47]. Enzyme cleavage sites were predicted using a Prosper server[55]. The underlined predicted cleavage sites are those observed in macaque proteins but not in human ones. The amino acid substitution observed at position 7 does not modify the MMP9 cleavage site. The presence of a threonine at position 20 of macaque protein (instead of a methionine as in other macaque variants and as in human and most nonhuman primates) destroys an elastase 2 cleavage site (rectangle). Six human polymorphic cleavage sites are absent from macaque proteins. Also note that the macaque proteins can have two caspase 1 cleavage sites instead of only one in human proteins (and rare macaque variants). The second caspase 1 cleavage site (rectangle) of macaque proteins is encoded by a duplicated 27 bp motif.
Comparison of IL37 allele frequencies in HIV controllers and the corresponding reference populations (1000 genomes data set).
|
| |||||
|
|
|
|
| ||
|
|
|
|
| ||
| Caucasians | 22 | 212 | 19 | 215 | 0.7441 |
| Africans | 18 | 90 | 19 | 89 | 0.8592 |
|
| |||||
|
|
|
|
| ||
|
|
|
|
| ||
| Caucasians | 21 | 213 | 19 | 215 | 0.7446 |
| Africans | 18 | 88 | 18 | 88 | >0.9999 |
|
| |||||
|
|
|
|
| ||
|
|
|
|
| ||
| Caucasians | 81 | 155 | 71 | 165 | 0.3753 |
| Africans | 84 | 24 | 89 | 19 | 0.4012 |
|
| |||||
|
|
|
|
| ||
|
|
|
|
| ||
| Caucasians | 80 | 156 | 71 | 165 | 0.3768 |
| Africans | 76 | 30 | 81 | 25 | 0.6230 |
aAllele numbers observed among HIV controllers (CODEX cohort). For some individuals, we did not obtain the genotype of the four SNPs, so the sums of alleles vary from one SNP to another.
bThe deduced allele numbers were calculated using the allele frequencies from the 1000 genomes dataset.
cFisher’s exact test probabilities.