Philip J Kranzusch1. 1. Department of Microbiology, Harvard Medical School, Boston, MA 02115, USA; Department of Cancer Immunology and Virology, Dana-Farber Cancer Institute, Boston, MA 02115, USA; Parker Institute for Cancer Immunotherapy at Dana-Farber Cancer Institute, Boston, MA 02115, USA. Electronic address: philip_kranzusch@dfci.harvard.edu.
Abstract
Cyclic GMP-AMP synthase (cGAS) is a signaling enzyme in human cells that controls immune-sensing of cytosolic DNA. The recent discoveries of diverse structural homologs of cGAS in animals and bacteria reveal that cGAS-like signaling is surprisingly ancient and widespread in biology. Together with the Vibrio cholerae protein dinucleotide cyclase in Vibrio (DncV), cGAS and DncV homologs comprise a family of cGAS/DncV-like nucleotidyltransferase (CD-NTase) enzymes that synthesize noncanonical RNA signals including cyclic dinucleotides, cyclic trinucleotides, and linear oligonucleotides. Structural and biochemical breakthroughs provide a framework to understand how CD-NTase signaling allows cells to respond to changing environmental conditions. The CD-NTase family also includes uncharacterized human genes like MB21D2 and Mab21L1, highlighting emerging functions of cGAS-like signaling beyond innate immunity.
Cyclic GMP-AMP synthase (cGAS) is a signaling enzyme in human cells that controls immune-sensing of cytosolic DNA. The recent discoveries of diverse structural homologs ofcGAS in animals and bacteria reveal that cGAS-like signaling is surprisingly ancient and widespread in biology. Together with the Vibrio cholerae protein dinucleotide cyclase in Vibrio (DncV), cGAS and DncV homologs comprise a family ofcGAS/DncV-like nucleotidyltransferase (CD-NTase) enzymes that synthesize noncanonical RNA signals including cyclic dinucleotides, cyclic trinucleotides, and linear oligonucleotides. Structural and biochemical breakthroughs provide a framework to understand how CD-NTase signaling allows cells to respond to changing environmental conditions. The CD-NTase family also includes uncharacterized human genes like MB21D2 and Mab21L1, highlighting emerging functions ofcGAS-like signaling beyond innate immunity.
Cyclic GMP–AMP synthase (cGAS) is a vertebrate enzyme that catalyzes production of the nucleotide second messenger 2′–5′/3′–5′ cyclic GMP–AMP (2′3′-cGAMP) in response to cytosolic double-stranded DNA [1,2,3,4,5]. Since the discovery ofcGAS in late 2012 [4], cGAS-dependent sensing of mislocalized DNA has emerged as a central event in the cellular responses to pathogen replication, nuclear and mitochondrial stress, and DNA damage [6]. cGAS signaling is also an important component of antitumor immunity [7,8], and analogs of 2′3′-cGAMP are currently in development for cancer immunotherapy. The diverse roles for cGAS in cell biology and the potent therapeutic potential of 2′3′-cGAMP illustrate the importance of discovering and mechanistically understanding the enzymes that synthesize nucleotide signals in human cells.Recent findings overturn the idea that cGAS signaling is a specialized pathway unique to the innate immune response to cytosolic DNA and instead reveal a striking diversity ofcGAS-like enzymes in nature. Bioinformatic and structural findings identified homology between cGAS and the innate immunity enzyme oligo adenylate synthase 1 (OAS1) that senses cytosolic double-stranded RNA [3,4,9,10,11,12, 13, 14,15]. Likewise, structural analysis of the Vibrio cholerae enzyme dinucleotide cyclase in Vibrio (DncV), a 3′–5′/3′–5′ cGAMP (3′3′-cGAMP) synthase, revealed that DncV is a structural and functional homolog ofhumancGAS and that cGAS-like enzymes exist in the bacterial kingdom [16,17,18]. Thousands of functional cGAS homologs have now been identified in bacteria [19,20], revealing that cGAS and DncV are founding members of a large family ofcGAS/DncV-like nucleotidyltransferase (CD-NTase) enzymes. Bacterial CD-NTases comprise at least seven distinct protein clades including newly identified enzymes like Escherichia coli CdnE that produces 3′3′ cUMP–AMP and Enterobacter cloacae CdnD that produces the cyclic trinucleotide 3′3′3′ cAMP–AMP–GMP [19]. Eukaryotic CD-NTases include cGAS and OAS1, hundreds of metazoan enzymes, and nine proteins in the human genome of unknown function.The remarkable abundance ofCD-NTases encoded in animal and bacterial genomes provides a unique opportunity to understand the cellular roles of specialized nucleotide signals. Recent breakthroughs in biochemistry and structural biology establish a model ofcGAS and CD-NTase function, and demonstrate that CD-NTase enzymes synthesize diverse nucleotide signals including cyclic dinucleotides, cyclic trinucleotides, and oligonucleotides with an assortment of base and phosphodiester-linkage specificities. This review outlines the results that define CD-NTases as a new family of signaling enzymes, overviews conserved structural features that control CD-NTase function, and provides a framework for classification ofCD-NTase signaling pathways. Although individual CD-NTase enzymes synthesize distinct nucleotide signals and control independent downstream effector functions, conserved biochemical mechanisms unite activation and regulation of each signaling pathway. Advances in understanding ofCD-NTase biology provide an opportunity to explain the role of uncharacterized CD-NTase family members and potentially unveil new nucleotide signals in human cells.
cGAS and CD-NTase enzymes as chemical sensors
CD-NTase enzymes function as nucleic acid and chemical sensors, allowing animal and bacterial cells to respond to changing environmental conditions and rapidly initiate signaling responses. In the active state, CD-NTases catalyze multi-turnover synthesis ofcyclic and oligonucleotide signals built from common nucleotide triphosphate precursors [3,4,11,17,19]. This iterative process allows relatively few copies of an active CD-NTase enzyme to produce many nucleotide second messenger molecules and significantly amplify a signaling response (Figure 1
a). For example, activation of a few molecules ofcGAS bound to DNA results in multi-turnover production, with >750 molecules of 2′3′-cGAMP produced for each active enzyme [3]. 2′3′-cGAMP diffuses throughout the cell, and is recognized by the receptor Stimulator of Interferon Genes (STING) to initiate downstream IRF3/NF-κB immune responses and autophagy [4,6,21,22]. In addition to STING-dependent responses in the primary activated cell, 2′3′-cGAMP is highly stable [23] and can be transferred to neighboring cells in gap-junctions, released from tumors to activate immune cells, and packaged into viral particles [8,24, 25, 26]. The dramatic signal amplification following activation ofcGAS and CD-NTase enzymes is a key feature of how these pathways propagate responses.
Figure 1
CD-NTase signaling in animals and bacteria.
(a) Cartoon schematic of the human cGAS-STING signaling pathway. cGAS binds to cytosolic double-stranded DNA and is activated to produce the second messenger 2′3′-cGAMP. 2′3′-cGAMP is recognized by STING, and initiates a downstream IRF3/NF-κB-dependent immune response. (b) Overview of Type I and Type II CD-NTase Signaling. Type I CD-NTase pathways like human cGAS-STING rely on detection of an activating ligand (e.g. cytosolic DNA) depicted here as a star. In contrast, Type II CD-NTase pathways like Vibrio DncV-CapV are active in the absence of ligand, and may rely on depletion of a ligand (e.g. folate-like metabolites) that represses enzyme activation. In either case, activated CD-NTases catalyze multi-turnover synthesis of a nucleotide second messenger (e.g. 2′3′-cGAMP or 3′3′-cGAMP) to amplify signaling and initiate downstream effector responses.
CD-NTase signaling in animals and bacteria.(a) Cartoon schematic of the humancGAS-STING signaling pathway. cGAS binds to cytosolic double-stranded DNA and is activated to produce the second messenger 2′3′-cGAMP. 2′3′-cGAMP is recognized by STING, and initiates a downstream IRF3/NF-κB-dependent immune response. (b) Overview of Type I and Type II CD-NTase Signaling. Type I CD-NTase pathways like humancGAS-STING rely on detection of an activating ligand (e.g. cytosolic DNA) depicted here as a star. In contrast, Type II CD-NTase pathways like Vibrio DncV-CapV are active in the absence of ligand, and may rely on depletion of a ligand (e.g. folate-like metabolites) that represses enzyme activation. In either case, activated CD-NTases catalyze multi-turnover synthesis of a nucleotide second messenger (e.g. 2′3′-cGAMP or 3′3′-cGAMP) to amplify signaling and initiate downstream effector responses.HumancGAS and Vibrio DncV use distinct mechanisms to couple enzyme activation and ligand detection, illustrating the existence of at least two separate types ofCD-NTase pathways (Table 1
). Designated here as ‘Type I’ CD-NTase signaling, cGAS and OAS1 are inactive and unable to catalyze nucleotide signal synthesis until ligand binding induces a conformational change that remodels the active site into an enzymatically competent state (Figure 1b) [3,9,10,11,12,13,27,28,29]. In contrast, ‘Type II’ CD-NTases like Vibrio DncV and many bacterial CD-NTases are catalytically active in the absence of ligand and constitutively synthesize nucleotide signals in vitro [16,17,18,19]. Vibrio DncV is inhibited upon binding to folate-like compounds including 5-methyltetrahydrofolic acid [18], a metabolite required for nucleotide biosynthesis, suggesting that regulation of Type II enzymes can occur through ligand binding events that repress enzyme function (Figure 1b). Bacterial CD-NTase operons are frequently encoded within defense islands and have been hypothesized to be involved in phage resistance [19,20]. Although the exact function of Type II CD-NTase pathways in bacteria remains unknown [16,18,19,30], one current model is that these CD-NTases respond to conditions like phage infection by sensing depletion of intracellular metabolites or nutrient starvation.
Table 1
CD-NTase signaling pathways
CD-NTase enzyme
Classification
Ligand
Nucleotide signal
Effector
Downstream response
Signal degradation
Mammalian
cGAS
Type I
dsDNA
2′3′-cGAMP
STING
Type I interferon, NF-κB signaling
ENPP1, poxin, CdnP
OAS1/2/3
Type I
dsRNA
2′–5′ Oligoadenylate
RNase L
Translational arrest
PDE12, AKAP7, coronavirus NS2, rotavirus VP3
MB21D2
Type I?
?
?
?
Cancer resistance?
?
Mab21L1/2/3/4
Type I?
?
- - /?
?
Transcription control
?
MID49/51
?
- -
- -
DRP1
Mitochondrial fission
- - /?
Bacterial
V. cholerae DncV
Type II
Folate metabolites
3′3′-cGAMP
CapV
Nutrient sensing
cGAP1/2/3
E. coli CdnE
Type II
?
3′3′-cUMP–AMP
CapE
?
?
E. cloacae CdnD
Type II
?
3′3′-cAMP–AMP–GMP
?
?
Ring nucleases?
Summary of known CD-NTase signaling pathways including recognized ligand, product nucleotide signal, downstream receptor and effector response, and dedicated machinery for signal degradation. Undefined pathway components are designated as unknown (?) or likely to be not involved (--).
CD-NTase signaling pathwaysSummary of known CD-NTase signaling pathways including recognized ligand, product nucleotide signal, downstream receptor and effector response, and dedicated machinery for signal degradation. Undefined pathway components are designated as unknown (?) or likely to be not involved (--).
Structural anatomy of CD-NTase enzymes
All CD-NTases share a common structural architecture. The enzymatic domain is typically 250–400 amino acids (∼35–50 kDa) and adopts a caged tertiary structure formed by a long α-helix ‘spine’ that bridges an N-terminal nucleotidyltransferase (NTase) core and a C-terminal helix bundle (Figure 2
a) [3,9,10,11,17,19]. The extreme N-terminal and C-terminal residues pack against each other in the helix bundle, and the overall CD-NTase cage-like architecture creates a recessed pocket to catalyze nucleotide signal formation. The N-terminal NTase core is a common fold found in many diverse enzymes [14]. There is no one structural feature that uniquely identifies CD-NTases from classical NTase proteins, and the commonality of the core NTase fold is in part why bioinformatic identification ofCD-NTase homologs from primary sequence alone remains a challenge [20]. Whereas classical NTases like DNA polymerase-β adopt a flat, open conformation for recruitment of a nucleic acid template [31], the deep catalytic pocket ofCD-NTase enzymes allows amino-acid side chains extending from the active-site lid to control nucleotide base specificity and direct product formation (Figure 2b). Structures of the cGAS–DNA, OAS1–RNA and DncV–folate complexes reveal that both Type I and Type II CD-NTases utilize a shared ligand binding surface located behind the enzyme active site, along the α-helix spine (Figure 2c) [3,9,12,18,27,28,29]. In cGAS, a zinc-ribbon insertion within the ligand binding surface in part explains how this enzyme evolved to bind double-stranded DNA [9,10]. The shared location ofCD-NTase–ligand interactions indicates that diversification of this surface likely allows enzyme activity to be coupled to recognition of a wide variety of chemical signals beyond nucleic acids and folate-like compounds.
Figure 2
Structural anatomy of cGAS and CD-NTase enzymes.
(a) Structural overview of the human cGAS–DNA complex. CD-NTase enzymes adopt a conserved cage-like structure where a long α-helix ‘spine’ braces an N-terminal NTase core and C-terminal helix bundle. Note, a 1:1 unit of cGAS–DNA is shown for clarity, but the minimally active complex is a 2:2 unit. (b) The cage-like architecture creates a deep cavity for nucleotide substrate coordination and enzymatic catalysis. Unlike classical nucleotidyltransferases (NTases) like DNA polymerase-β that require a nucleic acid template (orange, right) to dictate nucleotide substrate specificity, amino-acid side chains in the CD-NTase active-site lid (orange, left) directly control nucleotide interactions. (c) Structural overview of the ligand binding surfaces in the cGAS–DNA, OAS–RNA, and DncV–folate complexes. Structures are shown rotated 180° from the orientation in (a). Activating and inactivating CD-NTase ligands (yellow) bind on the back of the enzyme along the α-helix spine. Insertion of a zinc-ribbon (Zn-Ribbon) along the ligand-binding surface in part explains evolution of cGAS specificity for DNA interactions. (d) CD-NTase active sites contain a highly conserved h[QT]GS, DE[h[DE]h, h[DE]h catalytic triad and distinct ‘acceptor’ and ‘donor’ nucleotide pockets. Each CD-NTase catalyzes a multi-step reaction that proceeds through a defined reaction path (e.g. the cGAS intermediate product is pppG[2′–5′]pA).
Structural anatomy ofcGAS and CD-NTase enzymes.(a) Structural overview of the humancGAS–DNA complex. CD-NTase enzymes adopt a conserved cage-like structure where a long α-helix ‘spine’ braces an N-terminal NTase core and C-terminal helix bundle. Note, a 1:1 unit ofcGAS–DNA is shown for clarity, but the minimally active complex is a 2:2 unit. (b) The cage-like architecture creates a deep cavity for nucleotide substrate coordination and enzymatic catalysis. Unlike classical nucleotidyltransferases (NTases) like DNA polymerase-β that require a nucleic acid template (orange, right) to dictate nucleotide substrate specificity, amino-acid side chains in the CD-NTase active-site lid (orange, left) directly control nucleotide interactions. (c) Structural overview of the ligand binding surfaces in the cGAS–DNA, OAS–RNA, and DncV–folate complexes. Structures are shown rotated 180° from the orientation in (a). Activating and inactivating CD-NTase ligands (yellow) bind on the back of the enzyme along the α-helix spine. Insertion of a zinc-ribbon (Zn-Ribbon) along the ligand-binding surface in part explains evolution ofcGAS specificity for DNA interactions. (d) CD-NTase active sites contain a highly conserved h[QT]GS, DE[h[DE]h, h[DE]h catalytic triad and distinct ‘acceptor’ and ‘donor’ nucleotide pockets. Each CD-NTase catalyzes a multi-step reaction that proceeds through a defined reaction path (e.g. the cGAS intermediate product is pppG[2′–5′]pA).The active site ofCD-NTase enzymes comprises a catalytic triad at the base and additional amino acids lining the lid that are required for nucleotide substrate coordination. All catalytically active CD-NTases encode a highly conserved h[QT]GS [X8–20] [DE]h[DE]h [X50–90] h[DE]h sequence (where h is a hydrophobic amino acid) that is located on a short α-helix (h[QT]GS) and two adjacent beta strands ([DE]h[DE]h, h[DE]h) of the nucleotidyltransferase core (Figure 2d) [3,9,10,11,14,17,19]. The active site ofCD-NTases contains discrete ‘donor’ and ‘acceptor’ nucleotide binding pockets. Once activated, CD-NTases bind nucleotide substrates and catalyze an SN2 nucleophilic substitution reaction that results in the transfer of the α-phosphate and nucleobase from the donor nucleotide onto the ribose 2′ or 3′ OH of the acceptor nucleotide. The reaction is divalent metal-dependent (Mg2+ or Mn2+), requiring two metals in the enzyme–substrate complex and likely a third metal transiently coordinated in the transition or product-bound states [3,9,17,32]. Metal ion A is coordinated by all three Asp/Glu side chains of the active-site triad ([DE]h[DE]h, h[DE]h; e.g. E225, D227, D319 in humancGAS) and participates in deprotonation and positioning of the 2′ or 3′ OH of the accepting nucleotide to attack the donor nucleotide α-phosphate. Metal ion B is coordinated by carboxyl groups from the first and second Asp/Glu side chains of the active-site triad ([DE]h[DE]h; e.g. E225, D227 in humancGAS) and the triphosphateof the donor nucleotide, and functions to position the donor nucleotide and help stabilize the reaction intermediate (Figure 2d) [3,9,17,18,19,27,28,29]. The donor pocket contains a highly conserved hydrophobic side chain (typically Tyr/Phe/Ile; e.g. Y436 in humancGAS) in the C-terminal helix bundle that stacks against the nucleobase, and sequence-specific hydrogen-bond interactions with the side chains in the active-site lid control donor and acceptor pocket nucleotide specificity (Figure 2d). Importantly, recent studies with cGAS have demonstrated that a space formed above the nucleotide donor pocket in CD-NTase enzymes is amenable to targeting by small-molecule therapeutics [33,34].A unique feature of the CD-NTase family is the use of a single site to catalyze nucleotide signal production through a sequential, multi-step reaction. The CD-NTase reaction is distinct from other nucleotide second messenger synthase families including GGDEF-like and DisA-like enzymes that require protein dimerization to produce 3′3′ c-di-GMP and 3′3′ c-di-AMP [35,36]. In the case ofcGAS, in the first reaction state ATP and GTP occupy the donor and acceptor nucleotide positions respectively, and AMP is transferred onto the 2′ OH ofGTP to form pppG[2′–5′]pA [3,9]. In the second reaction state, the intermediate molecule is reoriented such that the 3′ OH ofAMP attacks the guanine α-phosphate to produce a final cyclized c(G[2′–5′]pA[3′–5′]p) product 2′3′-cGAMP. Each CD-NTase catalyzes signal production in a specific reaction order. For example, Vibrio DncV catalyzes the opposite reaction path where GTP first occupies the donor pocket to produce a pppA[3′–5′]pG intermediate during 3′3′-cGAMP synthesis [17,18], while the E. coli CdnE enzyme that produces 3′3′ cUMP–AMP uses a reaction order more similar to cGAS that proceeds through a pppU[3′–5′]pA intermediate [19]. For CD-NTase enzymes like OAS1 that produce linear oligonucleotide signals, sequential transfer and repositioning of the donor nucleotide to the acceptor pocket allows extension of the oligonucleotide chain [13]. Open mechanistic questions include how repositioning ofCD-NTase reaction intermediates is controlled and how product chain-length is specified in the active site [15,37].Although CD-NTase activity requires a single monomeric active site, an emerging regulatory feature is the role of enzyme oligomerization. The minimally active cGAS–DNA complex is a 2:2 unit where two molecules ofcGAS embrace two molecules of double-stranded DNA [27,28,29]. Recent results highlight the role of further higher-order complex formation in cGAS regulation where DNA ‘laddering’ and a liquid–liquid phase separation process are required for robust enzyme activation [38,39]. Other complex oligomerization events include an enigmatic star-shaped pentameric complex formed by the human protein Male abnormal 21-Like 1 (Mab21L1) [40]. Eukaryotic CD-NTases often contain >100 amino acid residues of charged, unstructured sequence within N-terminal or C-terminal tails appended to the core enzymatic module (Figure 1a). The cGAS N-terminal tail has been implicated in DNA interactions, protein stability, localization, and higher-order complex formation, suggesting that these extensions are important to regulate enzyme activation [10,39,41, 42, 43]. In addition to oligomerization, CD-NTase function is further regulated by intrinsic species-specific substitutions that fine-tune enzyme activation [29,44], additional domains like tandem arrays of the CD-NTase module or fusion to ubiquitin-like proteins in some OAS homologs [45], and post-translational modifications [46, 47, 48].
Nucleotide signals and downstream effector functions
CD-NTase enzymes synthesize a remarkable array of nucleotide products to control downstream effector functions. Although the overall protein structure is conserved between distantly related animal and bacterial CD-NTases, a high degree of amino-acid sequence variability exists in the side chains extending from the active-site lid and in the ligand binding surface formed on the back of the enzyme behind the active site (Figure 2c,d). This variability affords CD-NTases the plasticity necessary to recognize diverse chemical ligands and to synthesize distinct nucleotide products with alternative base and phosphodiester-linkage specificities. Currently, known CD-NTase products include di-purine, di-pyrimidine, and hybrid purine–pyrimidine cyclic dinucleotides [2,3,4,9,16,19]. In addition to cyclic dinucleotides, CD-NTase enzymes also produce linear oligonucleotides and larger cyclic RNA species including cyclic trinucleotides (Table 1). Thus far, the products of bacterial CD-NTases contain canonical 3′–5′ phosphodiester linkages [16,17,18,19], while both known metazoan CD-NTase products contain noncanonical 2′–5′ linkages [1,2,3,5]. Given the simple reaction chemistry and enormous diversity ofCD-NTase enzymes in bacteria and metazoan systems, it is likely that many more CD-NTase products remain to be discovered.Once synthesized, CD-NTase products function as nucleotide signals to control downstream effector functions. In mammalian cells, 2′3′-cGAMP and 2′–5′ oligoadenylate are recognized by the protein receptors STING and RNase L to activate innate immune responses [21,49]. In bacteria, known receptors include cGAMP-activated phospholipase in Vibrio (CapV) and cUMP–AMP-activated phospholipase in E. coli (CapE), phospholipases that are selectively activated by 3′3′-cGAMP and 3′3′-cUMP–AMP [19,30]. Additionally, candidate receptors in bacteria include riboswitches [50,51], and a diverse array of nuclease, protease, and pore-forming effector proteins in CD-NTase-containing operons [20]. Interestingly, animal receptors including STING and the mouse oxidoreductase RECON recognize diverse bacterial CD-NTase products, indicating that cross-kingdom CD-NTase signaling may be an important component of host–microbe interactions [19,21,52]. Specific nuclease and phosphodiesterase enzymes have been identified that degrade CD-NTase product signals [23,53, 54, 55, 56, 57, 58], and it is likely that many CD-NTase pathways require dedicated mechanisms for signal inactivation in animal and bacterial cells (Table 1).CD-NTase family members sometimes lack the conserved catalytic triad and instead perform non-enzymatic functions. MID49 and MID51 are CD-NTases crucial for recruitment ofdynamin-related protein 1 to the mitochondria outer-membrane, and crystal structures ofMID51 bound to an adenosine diphosphate molecule demonstrate that the CD-NTase domain may have been coopted for nucleotide recognition [59,60]. Catalytically inactive OAS homologs also function as co-factors to modulate cellular RNA and DNA sensing [61], regulate mRNA translation [62], and even appear to have been co-opted as viral factors for host immune suppression [63]. Other CD-NTases, like homologs of the developmental regulatory protein Mab21L1 [40], exhibit partial loss of conserved active-site residues and it remains unclear if these factors are capable of synthesizing nucleotide signals.
Evolutionary origin and diversification of CD-NTase signaling
The conservation ofCD-NTase enzymes in both the animal and bacterial kingdoms provides a framework to understand pathway regulation and evolution. Structural and biochemical comparisons ofhumancGAS signaling with ancestrally related bacterial and primitive metazoan CD-NTase pathways have revealed key details in cGAS activation and downstream STING signaling [17,22,64, 65, 66, 67]. No CD-NTase enzymes have been identified in plants or single-cell eukaryotes, supporting an evolutionary model where animal CD-NTases may have originated through horizontal transfer (Figure 3
). Interestingly, the active-site lid ofOAS proteins contains an extended α-helix and shares structural homology with non-templated nucleotidyltransferases like CCA-adding enzyme and poly-A polymerase [45]. This extended α-helix is not conserved in the structures ofcGAS, DncV, or CdnE but is predicted to exist in the bacterial CD-NTase clade C01 [19], indicating that metazoan OAS1 and cGAS potentially represent separate evolutionary lineages ofCD-NTase enzymes. Bacterial CD-NTases are ubiquitously encoded on mobile genetic elements found in nearly every bacterial phylum, and thus may represent an ancestral reservoir of antiviral defense genes from which multiple immune-factors like humancGAS and OAS1 separately originated.
Figure 3
Conservation and evolution of the CD-NTase enzyme family.
(a, b) Overview of shared conservation of animal (top, magenta) and bacterial (bottom, green) CD-NTases. Distinct branches of individual animal CD-NTase proteins may have arisen from ancestral enzymes rooted in bacteria. Known structures are shown as schematics, and the hypothesized horizontal transfer events are indicated as a dashed line.
Conservation and evolution of the CD-NTase enzyme family.(a, b) Overview of shared conservation of animal (top, magenta) and bacterial (bottom, green) CD-NTases. Distinct branches of individual animal CD-NTase proteins may have arisen from ancestral enzymes rooted in bacteria. Known structures are shown as schematics, and the hypothesized horizontal transfer events are indicated as a dashed line.While the best understood roles for CD-NTase signaling are in innate immunity, the function of uncharacterized CD-NTases in the human genome represents an important opportunity for future discovery. Mab-21 domain-containing protein 2 (MB21D2), the human gene most closely related to cGAS, is predicted to have a complete active site and appears capable of catalytic function. MB21D2 is frequently mutated in lung cancers [68], supporting a possible role in tumor suppression and cancer resistance. Additionally, the Male abnormal 21 genes are CD-NTase family proteins conserved in nearly all metazoan animals. Human mutations in Mab21L1 and Mab21L2 cause severe birth defects including skeletal dysplasia, eye malformations, and intellectual disability [69]. This suggests possible new non-immune roles ofCD-NTase signaling pathways in early embryonic development.
Conclusion
Although only recently discovered, cGAS-dependent signaling has emerged as a key feature of many aspects ofhuman biology and disease [6]. The recognition ofcGAS as part of a diverse family ofCD-NTase enzymes in animals and bacteria now provides a foundation to define unified mechanisms that control nucleotide signaling and to address open questions in the field. What are the functions of diverse metazoan CD-NTases including uncharacterized enzymes like MB21D2 and Mab21L1 in the human genome? How can bioinformatic classification ofCD-NTase homologs be improved to predict nucleotide product signals from primary enzyme sequence? Which ligands are recognized by CD-NTases other than nucleic acid and folate-like metabolites? Are bacterial CD-NTase operons involved in defense against phage infection, and is there a direct evolutionary link between animal and bacterial antiviral immunity? Combined genetic, biochemical, and structural studies ofCD-NTases in both animal and bacterial cells provide a path to answer these questions and expand our understanding of the role of nucleotide signaling in human cells.
Conflict of interest statement
Nothing declared.
References and recommended reading
Papers of particular interest, published within the period of review, have been highlighted as:• of special interest•• of outstanding interest
Authors: Liudmila Andreeva; Björn Hiller; Dirk Kostrewa; Charlotte Lässig; Carina C de Oliveira Mann; David Jan Drexler; Andreas Maiser; Moritz Gaidt; Heinrich Leonhardt; Veit Hornung; Karl-Peter Hopfner Journal: Nature Date: 2017-09-13 Impact factor: 49.962
Authors: Wen Zhou; Aaron T Whiteley; Carina C de Oliveira Mann; Benjamin R Morehouse; Radosław P Nowak; Eric S Fischer; Nathanael S Gray; John J Mekalanos; Philip J Kranzusch Journal: Cell Date: 2018-07-12 Impact factor: 41.582
Authors: Oliver C Losón; Raymond Liu; Michael E Rome; Shuxia Meng; Jens T Kaiser; Shu-ou Shan; David C Chan Journal: Structure Date: 2014-02-06 Impact factor: 5.006
Authors: Sebastian Michalski; Carina C de Oliveira Mann; Che A Stafford; Gregor Witte; Joseph Bartho; Katja Lammens; Veit Hornung; Karl-Peter Hopfner Journal: Nature Date: 2020-09-10 Impact factor: 49.962