| Literature DB >> 33902453 |
Elena N Judd1, Alison R Gilchrist1, Nicholas R Meyerson1, Sara L Sawyer2.
Abstract
BACKGROUND: The Type I interferon response is an important first-line defense against viruses. In turn, viruses antagonize (i.e., degrade, mis-localize, etc.) many proteins in interferon pathways. Thus, hosts and viruses are locked in an evolutionary arms race for dominance of the Type I interferon pathway. As a result, many genes in interferon pathways have experienced positive natural selection in favor of new allelic forms that can better recognize viruses or escape viral antagonists. Here, we performed a holistic analysis of selective pressures acting on genes in the Type I interferon family. We initially hypothesized that the genes responsible for inducing the production of interferon would be antagonized more heavily by viruses than genes that are turned on as a result of interferon. Our logic was that viruses would have greater effect if they worked upstream of the production of interferon molecules because, once interferon is produced, hundreds of interferon-stimulated proteins would activate and the virus would need to counteract them one-by-one.Entities:
Keywords: Arms race; Evolution; Interferon; Positive selection; Virus
Mesh:
Substances:
Year: 2021 PMID: 33902453 PMCID: PMC8074226 DOI: 10.1186/s12862-021-01783-z
Source DB: PubMed Journal: BMC Ecol Evol ISSN: 2730-7182
Some examples of genes in Type I interferon pathways that bear the signature of successive rounds of positive natural selection
| Category | Gene under positive selection | Known direct virus interactions | Literature showing positive selection |
|---|---|---|---|
| induction | MB21D1/cGAS | Many classes of viruses | Mozzi et al. 2015, Ma et al. 2016 |
| Induction | IFI16 | HCMV | van der Lee 2017, Dell’Oste et al. 2014 |
| Induction | ISG15 | Influenza | Zhao et al. 2013, Zhao et al. 2010 |
| Induction | MAVS | Hepatitis C virus | van der Lee 2017, Anggakusuma et al. 2016 |
| Induction | STING | Flaviviruses | Mozzi et al. 2015, Stabell et al. 2018, Ding et al. 2018 |
| Induction | TRIM25 | Influenza | Malfavon-Borja et al. 2013, Gack et al. 2009 |
| ISG | ADAR | RNA viruses | Forni et al. 2014, Pfaller et al. 2018 |
| ISG | MxB | Many classes of viruses | Mitchell et al. 2015, Haller et al. 2011 |
| ISG | EIF2AK2/PKR | Influenza | Elde et al. 2009, Dauber et al. 2009 |
| ISG | RNAse L | TMEV | van der Lee 2017, Sorgeloos et al. 2013 |
| ISG | Tetherin | HIV | Lim et al. 2010, McNatt et al. 2009 |
| ISG | TRIM15 | Retroviruses | Malfavon-Borja et al. 2013, Uchil et al. 2008 |
| ISG | TRIM22 | Influenza | Sawyer et al. 2007, Di Pietro et al. 2013 |
| ISG | TRIM31 | Retroviruses | Malfavon-Borja et al. 2013, Uchil et al. 2008 |
| ISG | TRIM38 | Retroviruses | Malfavon-Borja et al. 2013, Uchil et al. 2008 |
| ISG | TRIM5α | HIV | Johnson et al. 2009, Sawyer et al. 2005 |
| ISG | RSAD2/Viperin | RNA viruses | Lim et al. 2012, Panayiotou et al. 2018 |
| ISG | SAMHD1 | HIV-2 | Laguette et al. 2012, Coquel et al. 2018 |
Fig. 1Definition of the gene classes analyzed in this study. A highly simplified illustration of the Type I interferon response is shown, to represent the two classes of genes analyzed. An infected, interferon-producing cell is shown on the left, and on the right is a cell then responding to the secreted interferon. In this study, “induction genes” are genes encoding any protein that acts in a way that ultimately leads to the expression of interferon-stimulated genes. Induction genes encode sensors of initial infection (pattern recognition receptors, toll-like receptors, and nucleic acid sensors), signaling cascade proteins, interferon molecules, interferon receptors, and transcription factors acting to induce interferon-stimulated genes. Also included are signaling molecules in the response to the interferon molecules that are produced and secreted (right cell). The second gene class, the interferon-stimulated genes, are a hugely diverse group of genes upregulated when cells are activated by interferon signaling. A relatively small number of these genes have been functionally characterized, but many encode proteins that interact directly with viruses or inhibit cellular processes that can be hijacked by viruses during infection
Fig. 2Quality and equity metrics for the three groups of multiple sequence alignments created. a A cladogram representing a species tree of the primates used in this analysis [44]. For each species, the branch width represents the percentage of multiple sequence alignments produced (out of 331) that contain that ortholog. All species were represented in over 50% of the alignments. Only white-cheeked gibbon and black snub-nosed monkey were represented in fewer than 75% of the alignments. b The number of species/sequences represented in the final 331 multiple sequence alignments, illustrated for each of the three categories of genes. c Tree length is the sum of the branch lengths along the tree or, in other words, the average number of nucleotide substitutions per site in an alignment. The relative frequencies of lengths are plotted as a separate histogram for each category, and the average tree length of each category is indicated in the legend
Fig. 3Interferon-stimulated genes are enriched for sequence signatures of recurrent positive natural selection. a Graphical illustrations of the M8 and M8a nested codon models in PAML (Yang 2007). M8a is a null model where all codons in the multiple sequence alignment must be placed into one of 11 bins of specific dN/dS values. Ten of these bins are distributed along a beta distribution of dN/dS values bounded between 0 and 1. The 11th bin is defined to have a dN/dS value = 1. M8 is identical, except that the 11th bin can have a dN/dS value greater than one [43]. The double-sided arrow indicates that the dN/dS value of this bin is optimized in the fitting of the data to the model. b 331 gene alignments were fit to both the M8 and the M8a models. A likelihood ratio test of the two nested models was conducted, and the final column indicates the number of genes in each category for which the null model M8a could be rejected in favor of the model of positive selection (p < 0.05). We did a Benjamini–Hochberg correction at 10% FDR to control for false positives. In the pie charts, the proportion of genes in each category that are under positive selection (red) is shown, from the table above. Using a two-tailed Fisher's exact test and Benjamini–Hochberg correction at 10% FDR, the difference in the number of genes rejecting the neutral model (M8a) between random genes and interferon-stimulated genes was significant
Genes identified as evolving under positive selection
| Induction genes evolving under positive selection | Interferon-stimulated genes evolving under positive selection |
|---|---|
| CASP10 | ADAR*† |
| CIITA* | APOBEC3F† |
| CISH | APOBEC3G*† |
| DDX58*† | APOL2* |
| DDX60*† | APOL6* |
| EPOR | BST2*† |
| IFI16* | CCL8 |
| IFNAR1* | CD47 |
| IFNAR2 | CEACAM1† |
| MAVS*† | CRP |
| MB21D1*† | DAPK1 |
| MNDA | EIF2AK2*† |
| OAS1*† | GBP2*† |
| OAS2*† | IFI27† |
| PTPRC† | IFI44 |
| PYHIN1 | IFI44L |
| RNASEL*† | IFI6 |
| SPP1 | IFIT1* |
| STAT2*† | IFIT2* |
| TLR1*† | MLKL |
| TLR2*† | MX1*† |
| TLR4*† | MX2*† |
| TLR5*† | PHF11 |
| TLR6*† | RSAD2*† |
| TLR8*† | RTP4*† |
| TMEM173*† | SAMD9 |
| TRIM21*† | SAMHD1*† |
| TRIM25*† | SLFN12* |
| TYK2 | TAGAP |
| TMEM140 | |
| TNK2 | |
| TRIM22*† | |
| TRIM5*† | |
| ZC3HAV1*† |
Genes that have been previously identified as rapidly evolving in primates (*) and which genes have known interactions with pathogenic elements (†) are from the following studies: [27, 42, 49–64]
* Previously identified as being under positive selection in primates
† Published interaction with pathogen
Fig. 4Interferon-stimulated genes have a higher whole-gene dN/dS value and more codons under positive selection than other genes. a Top: M0 is a codon model in PAML where all codons in an alignment are assigned to a single estimated dN/dS value. Below: box plot of the whole gene average dN/dS values determined by M0 in each category. *p-value < 0.05. a Top: The M8 model of codon evolution, as explained in the legend to Fig. 3a. Below: box plot of percentage of codon sites per gene in the dN/dS > 1 bin in the M8 model. *p-value < 0.05; ***p-value < 0.001. C Top: M2, a simple model that allows for positive selection, places all codons into one of three bins: a bin at dN/dS < 1 (conserved), a bin at dN/dS = 1 (neutral), and a bin at dN/dS > 1 (positive selection). The double-sided arrow indicates that the dN/dS value of this bin is optimized in the fitting of the data to the model. Below: box plot of the proportion of codon sites per gene in the dN/dS < 1 bin in model M2. *p-value < 0.05