| Literature DB >> 24092772 |
Benedict Yuen1, Joanne M Bayes, Sandie M Degnan.
Abstract
The "Nucleotide-binding domain and Leucine-rich Repeat" (NLR) genes are a family of intracellular pattern recognition receptors (PRR) that are a critical component of the metazoan innate immune system, involved in both defense against pathogenic microorganisms and in beneficial interactions with symbionts. To investigate the origin and evolution of the NLR gene family, we characterized the full NACHT domain-containing gene complement in the genome of the sponge, Amphimedon queenslandica. As sister group to all animals, sponges are ideally placed to inform our understanding of the early evolution of this ancient PRR family. Amphimedon queenslandica has a large NACHT domain-containing gene complement that is dominated by bona fide NLRs (n = 135) with varied phylogenetic histories. Approximately half of these have a tripartite architecture that includes an N-terminal CARD or DEATH domain. The multiplicity of the A. queenslandica NLR genes and the high variability across the N- and C-terminal domains are consistent with involvement in immunity. We also provide new insight into the evolution of NLRs in invertebrates through comparative genomic analysis of multiple metazoan and nonmetazoan taxa. Specifically, we demonstrate that the NLR gene family appears to be a metazoan innovation, characterized by two major gene lineages that may have originated with the last common eumetazoan ancestor. Subsequent lineage-specific gene duplication, gene loss and domain shuffling all have played an important role in the highly dynamic evolutionary history of invertebrate NLRs.Entities:
Keywords: NACHT domain; Porifera; innate immunity; invertebrate genomics
Mesh:
Substances:
Year: 2013 PMID: 24092772 PMCID: PMC3879445 DOI: 10.1093/molbev/mst174
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
The Full Complement of 151 Predicted NACHT Domain-Containing Genes Encoded by the Genome of Amphimedon queenslandica.
| Clade ID | Contig | Gene Model Code | Assigned Name |
|---|---|---|---|
| AqDEATH-NACHT | Contig5503 | Aqu1.202261 | AqDN1 |
| AqNLR clade A | Contig8999 | Aqu1.204459 | AqNLRX5 |
| AqDEATH-NACHT | Contig9585 | Aqu1.204933 | AqDN2 |
| AqNLR clade A | Contig9959 | Aqu1.205319 | AqNLRX6 |
| AqDEATH-NACHT | Contig10075 | Aqu1.205456 | AqDN3 |
| AqNLR clade A | Contig10163 | Aqu1.205553-snap.11240 | AqNLRC1 |
| AqDEATH-NACHT | Contig10378 | Aqu1.206247 | AqDN5 |
| AqDEATH-NACHT | Contig10379 | Aqu1.205832 | AqDN4 |
| AqNLR clade A | Contig10757 | Aqu1.206272 | AqNLRX7 |
| AqNLR clade C | Contig10761 | Aqu1.206280 | AqNLRD1 |
| AqNLR clade A | Contig10879 | Aqu1.206401 | AqNLRX8 |
| AqNLR clade A | Contig10879 | hom.g7908.t1 | AqNLRC2 |
| AqNLR clade A | Contig11119 | Aqu1.206703 | AqNLRX9 |
| AqNLR clade A | Contig11252 | Aqu1.206887-hom.g8479.t1 | AqNLRD10 |
| AqNLR clade A | Contig11309 | Aqu1.206954 | AqNLRX10 |
| AqNLR clade A | Contig11352 | Aqu1.207007 | AqNLRX11 |
| AqNLR clade A | Contig11422 | Aqu1.207127 | AqNLRX12 |
| AqNLR clade A | Contig11546 | Aqu1.207322 | AqNLRX13 |
| AqNLR clade A | Contig11679 | Aqu1.207562 | AqNLRX14 |
| AqNLR clade A | Contig11719 | Aqu1.207649 | AqNLRX15 |
| AqNLR clade A | Contig11725 | Aqu1.207664 | AqNLRX16 |
| AqNLR clade A | Contig11740 | Aqu1.207697 | AqNLRX17 |
| AqDEATH-NACHT | Contig11763 | Aqu1.207748 | AqDN6 |
| AqNLR clade A | Contig11787 | Aqu1.207797 | AqNLRX18 |
| AqNLR clade A | Contig11837 | Aqu1.207890-hom.g9670.t1 | AqNLRC3 |
| AqNLR clade A | Contig11954 | Aqu1.208150 | AqNLRD11 |
| AqNLR clade A | Contig11961 | Aqu1.208167 | AqNLRX19 |
| AqNLR clade A | Contig11972 | Aqu1.208189 | AqNLRX20 |
| AqNLR clade A | Contig12017 | Aqu1.208293 | AqNLRD12 |
| AqNLR clade A | Contig12054 | Aqu1.208371 | AqNLRX21 |
| AqNLR clade A | Contig12055 | Aqu1.208372 | AqNLRX22 |
| AqNLR clade A | Contig12282 | Aqu1.208919 | AqNLRX23 |
| AqNLR clade A | Contig12315 | Aqu1.209017 | AqNLRC4 |
| AqNLR clade A | Contig12346 | Aqu1.209113 | AqNLRC5 |
| AqNLR clade A | Contig12356 | hom.g11149.t1-Aqu0.1446184 | AqNLRX24 |
| AqNLR clade A | Contig12364 | Aqu1.209168 | AqNLRX25 |
| AqNLR clade A | Contig12383 | hom.g11247.t1 | AqNLRD13 |
| AqNLR clade A | Contig12389 | Aqu1.209241 | AqNLRX26 |
| AqDEATH-NACHT | Contig12407 | Aqu1.209295 | AqDN7 |
| AqNLR clade A | Contig12431 | Aqu1.209372-hom.g11418.t1 | AqNLRD14 |
| AqNLR clade A | Contig12433 | Aqu1.209379 | AqNLRX27 |
| AqNLR clade A | Contig12433 | hom.g11429.t1 | AqNLRD15 |
| AqNLR clade A | Contig12489 | Aqu1.209557 | AqNLRX28 |
| AqNLR clade A | Contig12517 | Aqu1.209696 | AqNLRX29 |
| AqNLR clade A | Contig12522 | Aqu1.209720 | AqNLRX30 |
| AqNLR clade A | Contig12541 | Aqu1.209792 | AqNLRX31 |
| AqNLR clade A | Contig12563 | Aqu1.209880 | AqNLRX32 |
| AqNLR clade A | Contig12563 | Aqu1.209876 | AqNLRX33 |
| AqNLR clade A | Contig12595 | hom.g12176.t1 | AqNLRX34 |
| AqNLR clade A | Contig12595 | Aqu1.210033 | AqNLRD16 |
| AqNLR clade A | Contig12612 | Aqu1.210112 | AqNLRX35 |
| AqNLR clade A | Contig12676 | Aqu1.210376 | AqNLRX36 |
| AqNLR clade A | Contig12677 | Aqu1.210377 | AqNLRX37 |
| AqNLR clade C | Contig12691 | aq_ka12691x00220-12691x00230 | AqNLRD2 |
| AqNLR clade A | Contig12692 | Aqu1.210459 | AqNLRX38 |
| AqNLR clade A | Contig12704 | Aqu1.210507 | AqNLRX39 |
| AqNLR clade A | Contig12733 | Aqu1.210691 | AqNLRX40 |
| AqNLR clade A | Contig12734 | Aqu1.210692 | AqNLRX41 |
| AqNLR clade A | Contig12746 | Aqu1.210730 | AqNLRX42 |
| AqNLR clade A | Contig12746 | Aqu1.210737 | AqNLRD17 |
| AqNLR clade A | Contig12749 | Aqu1.210748-hom.g12993.t1 | AqNLRX43 |
| AqNLR clade A | Contig12812 | snap.24323-1447583 | AqNLRX44 |
| AqNLR clade A | Contig12829 | Aqu1.211218 | AqNLRX45 |
| AqNLR clade A | Contig12852 | Aqu1.211347 | AqNLRX46 |
| AqNLR clade A | Contig12852 | Aqu1.211348 | AqNLRD18 |
| AqNLR clade A | Contig12853 | Aqu1.211358 | AqNLRX47 |
| AqNLR clade A | Contig12862 | Aqu0.1447787 | AqNLRC6 |
| AqNLR clade A | Contig12862 | Aqu1.211413 | AqNLRX48 |
| AqNLR clade A | Contig12883 | Aqu1.211532 | AqNLRX49 |
| AqNLR clade A | Contig12887 | Aqu0.1447879-Aqu1.211549 | AqNLRD19 |
| AqNLR clade A | Contig12894 | Aqu1.211616-snap.25520 | AqNLRX50 |
| AqNLR clade A | Contig12897 | Aqu1.211634 | AqNLRX51 |
| AqNLR clade A | Contig12934 | Aqu1.211923 | AqNLRX52 |
| AqNLR clade A | Contig12934 | Aqu0.1448139 | AqNLRD20 |
| AqNLR clade A | Contig12950 | Aqu1.212035 | AqNLRX53 |
| AqNLR clade A | Contig12951 | Aqu0.1448222-snap.26675 | AqNLRX54 |
| AqNLR clade A | Contig12955 | Aqu1.212081-snap.26736 | AqNLRC7 |
| AqNLR clade A | Contig12956 | Aqu1.212082 | AqNLRC8 |
| AqNLR clade A | Contig12968 | Aqu1.212194-hom.g14686.t1 | AqNLRX55 |
| AqNLR clade A | Contig12968 | Aqu1.212193 | AqNLRD21 |
| AqNLR clade A | Contig12974 | Aqu1.212264 | AqNLRX56 |
| AqNLR clade A | Contig12983 | Aqu1.212346 | AqNLRD22 |
| AqNLR clade A | Contig12996 | Aqu1.212453 | AqNLRX57 |
| NACHT-WD40 | Contig13075 | hom.g15978.t1 | AqNWD40ii |
| AqNLR clade A | Contig13099 | Aqu1.213597 | AqNLRX58 |
| AqNLR clade A | Contig13103 | Aqu1.213662 | AqNLRX59 |
| AqNLR clade C | Contig13105 | aq_ka13105x00240 | AqNLRD9 |
| AqNLR clade C | Contig13105 | Contig13105:47,116-50,520 | AqNLRX2 |
| AqNLR clade C | Contig13105 | Aqu1.213698 | AqNLRX3 |
| AqNLR clade C | Contig13105 | Aqu1.213699-Aqu1.213700 | AqNLRX4 |
| AqNLR clade A | Contig13113 | Aqu1.213834 | AqNLRX60 |
| AqNLR clade A | Contig13117 | hom.g16620.t1 | AqNLRD23 |
| AqNLR clade A | Contig13133 | Aqu1.214051 | AqNLRD24 |
| AqNLR clade A | Contig13133 | hom.g16827.t1 | AqNLRD25 |
| AqNLR clade A | Contig13134 | Aqu1.214053-snap.31711 | AqNLRD26 |
| AqNLR clade A | Contig13140 | Aqu1.214168-snap.31947 | AqNLRX61 |
| AqNLR clade A | Contig13141 | Aqu1.214170 | AqNLRX62 |
| AqNLR clade A | Contig13142 | hom.g16965.t1 | AqNLRX63 |
| AqNLR clade A | Contig13142 | Aqu1.214186 | AqNLRX64 |
| AqNLR clade C | Contig13153 | aq_ka13153x00250 | AqNLRD6 |
| AqNLR clade C | Contig13153 | Aqu0.1449710 | AqNLRD7 |
| AqNLR clade C | Contig13153 | Aqu0.1449711 | AqNLRD8 |
| NACHT-WD40 | Contig13169 | hom.g17436.t1 | AqNWD40iii |
| AqNLR clade A | Contig13182 | hom.g17680.t1 | AqNLRC9 |
| AqNLR clade A | Contig13206 | Aqu1.215215 | AqNLRX65 |
| AqNLR clade A | Contig13206 | Aqu1.215210 | AqNLRD27 |
| AqNLR clade A | Contig13206 | hom.g18193.t1 | AqNLRD28 |
| AqNLR clade A | Contig13234 | hom.g18804.t1 | AqNLRX66 |
| AqNLR clade A | Contig13234 | Aqu1.215789 | AqNLRX67 |
| AqNLR clade A | Contig13234 | Aqu1.215792 | AqNLRX68 |
| AqNLR clade A | Contig13234 | Aqu1.215790 | AqNLRX69 |
| AqNLR clade A | Contig13234 | Aqu1.215785-snap.35932 | AqNLRC10 |
| AqNLR clade A | Contig13234 | Aqu1.215794 | AqNLRC11 |
| AqNLR clade A | Contig13245 | hom.g19060.t1 | AqNLRX70 |
| AqDEATH-NACHT | Contig13309 | Aqu1.217513 | AqDN8 |
| AqNLR clade A | Contig13332 | Aqu1.218194 | AqNLRX71 |
| AqNLR clade A | Contig13332 | Aqu1.218191 | AqNLRX72 |
| AqNLR clade A | Contig13332 | Aqu1.218192 | AqNLRX73 |
| AqNLR clade C | Contig13337 | Aqu1.218328 | AqNLRD3 |
| AqNLR clade A | Contig13346 | hom.g21949.t1 | AqNLRX74 |
| AqNLR clade A | Contig13346 | Aqu0.1452529-snap.42481 | AqNLRC12 |
| AqNLR clade A | Contig13346 | hom.g21925.t1-snap.42422 | AqNLRD29 |
| AqNLR clade A | Contig13354 | Aqu1.218980 | AqNLRX75 |
| AqNLR clade A | Contig13354 | Aqu1.218975 | AqNLRX76 |
| AqNLR clade A | Contig13354 | ab.g20734.t1 | AqNLRD30 |
| AqNLR clade A | Contig13358 | Aqu1.219134 | AqNLRX77 |
| AqNLR clade A | Contig13377 | Aqu1.219760 | AqNLRX78 |
| AqNLR clade A | Contig13377 | Aqu1.219777 | AqNLRX79 |
| AqNLR clade A | Contig13377 | Aqu1.219767-snap.44836 | AqNLRD31 |
| AqNLR clade A | Contig13379 | Aqu1.219814 | AqNLRX80 |
| AqNLR clade A | Contig13382 | Aqu0.1453371-hom.g23298.t1 | AqNLRC13 |
| AqNLR clade C | Contig13382 | aq_ka13382x00520 | AqNLRD4 |
| AqNLR clade C | Contig13382 | Aqu0.1453359 | AqNLRD5 |
| NACHT-WD40 | Contig13402 | hom.g24038.t1 | AqNWD40i |
| AqDEATH-NACHT | Contig13409 | Aqu1.220886 | AqDN10 |
| AqDEATH-NACHT | Contig13409 | Aqu1.220887 | AqDN11 |
| AqDEATH-NACHT | Contig13409 | Aqu1.220885 | AqDN9 |
| AqNLR clade A | Contig13412 | Aqu1.220996 | AqNLRX81 |
| AqNLR clade A | Contig13430 | hom.g25320.t1 | AqNLRX82 |
| AqNLR clade A | Contig13430 | Aqu1.221813 | AqNLRX83 |
| AqNLR clade A | Contig13430 | Aqu1.221810 | AqNLRC14 |
| AqNLR clade A | Contig13430 | hom.g25317.t1 | AqNLRC15 |
| AqNLR clade A | Contig13430 | snap.49468-Aqu1.221803 | AqNLRC16 |
| AqNLR clade A | Contig13430 | snap.49466-Aqu0.1454627 | AqNLRD32 |
| AqNLR clade B | Contig13467 | Aqu1.223871-Aqu0.1455876 | AqNLRX1 |
| AqNLR clade A | Contig13472 | Aqu1.224254 | AqNLRX84 |
| AqNLR clade A | Contig13473 | Aqu1.224303 | AqNLRX85 |
| AqNLR clade A | Contig13512 | Aqu1.228172 | AqNLRX86 |
| AqDEATH-NACHT | Contig13514 | Aqu1.228453 | AqDN12 |
| AqDEATH-NACHT | Contig13514 | Aqu1.228454 | AqDN13 |
| AqNLR clade A | Contig13518 | Aqu1.229088 | AqNLRX87 |
Note.—The list comprises 3 genes characterized by a WD40-NACHT domain combination, 13 genes characterized by a DEATH-NACHT domain combination, and 135 genes characterized by a NACHT-LRR domain combination. These latter 135 genes represent bona fide NLRs and include 48 characteristic tripartite NLR genes that also contain an N-terminal CARD or DEATH effector domain.
aGene model codes were obtained from the Joint Genome Institute Amphimedon queenslandica genome browser accessible at www.metazome.net/amphimedon (last accessed April 20, 2013) and include gene models derived from multiple different gene prediction algorithms and indicated as ab.g (Augustus ab initio); aq_ka (PASA and Augustus); snap (SNAP ab initio); hom (Augustus homology); Aqu0; Aqu1. Gene models that have been concatenated are separated by a hyphen.
bAqNLR nomenclature is based on the convention proposed by Ting, Harton et al. (2008) and accepted by the HUGO Gene Nomenclature Committee. Therefore, the name AqNLRD, for example, indicates a tripartite architecture of DEATH-NACHT-LRRs. Domain architecture: D, Death domain; C, Card domain; X, No N-terminal domain; N, NACHT domain.
cA transmembrane domain signal was detected at the N-terminus of this gene by TMHMM Server v.2.0 – CBS (available from: www.cbs.dtu.dk/services/TMHMM/, last accessed April 25, 2013).
FPhylogenetic analysis of the Amphimedon queenslandica NACHT domain-containing genes. The tree presented is a midpoint-rooted Bayesian tree, with branch lengths representing the number of substitutions per site. Posterior probabilities and ML bootstrap support values greater than 50% are indicated. AqNLRs form three discrete clusters: AqNLR clade A, AqNLR clade B, and AqNLR clade C. A small subset of the 122 AqNLR clade A genes was used in the final analysis. Summary domain architectures characteristic of each major clade are shown below the clade name. The intron/exon organizations of individual AqDEATH-NACHT clade, AqNLR clade B, and AqNLR clade C genes are depicted to the right of the tree. Exons are represented by boxes and are drawn to scale. Introns are represented as lines between exons; they range from 45 bp to 7 kbp in size but are all depicted as the same size (i.e., not to scale). Assembly gaps, represented by the line breaks, range from 386 bp to 1.975 kbp. Refer to supplementary file S3, Supplementary Material online, for the alignment used in the analyses.
FPhylogenetic analysis of the Amphimedon queenslandica NLR clade A gene expansion of 122 genes that make up the majority of the A. queenslandica NACHT domain complement. This unrooted Bayesian tree was constructed from an alignment of the A. queenslandica NACHT domains. Posterior probabilities for the major clades are indicated. Neither the presence nor the precise identity of an N-terminal domain—CARD (AqNLRC, shown in blue), DEATH (AqNLRD, shown in red), or absent (AqNLRX, shown in black)—appears to predict phylogenetic position of the gene. The alignment used to generate this phylogenetic tree is available upon request.
FArchitectures of potential NLR adaptor and signaling/effector proteins encoded by the Amphimedon queenslandica (sponge) genome, in comparison with those identified in other animal genomes. Data for mammals, sea urchin, and amphioxus have been copied directly from figure 3 in Messier-Solek et al. (2010). These proteins could be involved in NLR signaling pathways via homotypic interactions of the death-fold domains. The A. queenslandica (sponge) list is a conservative selection based on structural similarity to the mammalian ASC adaptor protein (Pyrin-CARD) and RIP2 kinase (Protein kinase-CARD). Not all A. queenslandica death-fold domain-containing gene models are depicted. The DEATH-UDP/PNP domain combination is a novel architecture identified in the A. queenslandica genome. Although proteins of this structure are not known to be involved in NLR signaling, they were included because the UDP/PNP domain has also been identified at the N-terminus of NLRs in Acropora digitifera (fig. 4) (Hamada et al. 2012).
FPhylogenetic analysis of the metazoan NLR genes constructed from an alignment of the NACHT domains (provided in supplementary file S4, Supplementary Material online). The tree presented is an unrooted Bayesian tree, with branch lengths representing the number of substitutions per site. Posterior probabilities and ML bootstrap support values greater than 50% are indicated for the clades of interest. The two major metazoan NLR clades are circled by dashed lines and are consistent in both Bayesian and ML trees (supplementary file S5, Supplementary Material online). N-terminal effector domain types are shown adjacent to the lineage in which they are observed. Amphimedon queenslandica lineage is in red; cnidarian lineages (Acropora digitifera and Nematostella vectensis) are in green; human NLRs are in blue; Capitella teleta NLRs are in orange; Strongylocentrotus purpuratus NLRs are in purple; mollusc NLRs are in dark pink; arthropod NLRs are in black. For clarity, only a subset of divergent representatives from each taxon was selected for inclusion in the alignment. The numbers to the right of the name of each taxon indicate the size of the NLR complement in that clade. Refer to supplementary file S5, Supplementary Material online, for the corresponding ML tree.