Literature DB >> 22661976

Evolutionary and structural perspectives of plant cyclic nucleotide-gated cation channels.

Alice K Zelman¹, Adam Dawe, Christoph Gehring, Gerald A Berkowitz.

Abstract

Ligand-gated cation channels are a frequent component of signaling cascades in eukaryotes. Eukaryotes contain numerous diverse gene families encoding ion channels, some of which are shared and some of which are unique to particular kingdoms. Among the many different types are cyclic nucleotide-gated channels (CNGCs). CNGCs are cation channels with varying degrees of ion conduction selectivity. They are implicated in numerous signaling pathways and permit diffusion of divalent and monovalent cations, including Ca(2+) and K(+). CNGCs are present in both plant and animal cells, typically in the plasma membrane; recent studies have also documented their presence in prokaryotes. All eukaryote CNGC polypeptides have a cyclic nucleotide-binding domain and a calmodulin binding domain as well as a six transmembrane/one pore tertiary structure. This review summarizes existing knowledge about the functional domains present in these cation-conducting channels, and considers the evidence indicating that plant and animal CNGCs evolved separately. Additionally, an amino acid motif that is only found in the phosphate binding cassette and hinge regions of plant CNGCs, and is present in all experimentally confirmed CNGCs but no other channels was identified. This CNGC-specific amino acid motif provides an additional diagnostic tool to identify plant CNGCs, and can increase confidence in the annotation of open reading frames in newly sequenced genomes as putative CNGCs. Conversely, the absence of the motif in some plant sequences currently identified as probable CNGCs may suggest that they are misannotated or protein fragments.

Entities: Chemical Disease Gene Species

Keywords: CNGC; Ca2+ signaling; amino acid motif; calmodulin binding domain; channel evolution; cyclic nucleotide; cyclic nucleotide-binding domain; ion channel

Year: 2012 PMID： 22661976 PMCID： PMC3362136 DOI： 10.3389/fpls.2012.00095

Source DB: PubMed Journal: Front Plant Sci ISSN： 1664-462X Impact factor: 5.753

Plant cyclic nucleotide-gated cation channels (CNGCs) comprise a large family of non-selective cation-conducting channels in plants. Of the 56 coding sequences identified at present as cation-conducting channels in the Arabidopsis thaliana genome, 20 are members of the CNGC family (Ward et al., 2009). CNGCs have been functionally characterized by expression in heterologous systems, or by analysis of cation-related phenotypes of mutant plants (typically A. thaliana) that have specific CNGC genes silenced (Talke et al., 2003; Ma et al., 2010). All relevant experimental evidence indicates that plant CNGCs are specifically localized to the plasma membrane, although CNGC20 may be targeted to the chloroplast envelope according to a putative annotation in UniProt (accessed August 28, 2011). Functional analyses of members of this channel family have associated many of them with inward K+ and Ca2+currents, and at least in several cases, Na+ conductance (Ma et al., 2010). It may be relevant to their function in plants that in one case, K+ conductance by a CNGC was restricted in the presence of millimolar concentrations of external Ca2+ (Leng et al., 2002). Ca2+ block of monovalent cation conductance is central to the function of animal CNGCs (Alam et al., 2007). CNGC function in plant biology may be more related to their ability to conduct Ca2+ rather than monovalent cations into plant cells. Abdel-Hamid et al. (2011) provide additional evidence consistent with this conjecture. Plant CNGCs were first identified as such by the presence of a cyclic nucleotide-binding domain (CNBD), and by their overall structural similarity to animal CNGCs (Talke et al., 2003). CNGCs have also recently been identified in prokaryotes (Nimigean et al., 2004; Kuo et al., 2007), although the prokaryote channel primary sequences show some notable differences in their functional domains (Cukkemane et al., 2011). The most conserved region of the CNGC CNBD is a phosphate binding cassette (PBC) which binds the sugar and phosphate moieties of the cyclic nucleotide (cNMP) ligand (Cukkemane et al., 2011). A “hinge” region adjacent to the PBC is also conserved and is thought to contribute to ligand binding efficacy and selectivity (Young and Krougliak, 2004). The hinge affects cAMP and cGMP activation by modifying the energy requirement of ligand binding and does so independently from other regions (i.e., structural motifs) of the animal CNBD (Young and Krougliak, 2004). Young and Krougliak (2004) modified residues in the hinge, affecting cNMP dependent gating of the channel without causing structural changes in other regions of the CNBD. Additional features of the CNBD of plant CNGCs will be discussed below. Like animal CNGCs, plant CNGCs are members of the “P-loop” superfamily of cation channels present in all prokaryotic and eukaryotic cells (Ward et al., 2009). P-loop channels arose early in evolution and their structure provides a basic form that has evolved to give rise to channels capable of conducting various cations, thereby fulfilling a broad range of functions in cells (Zhorov and Tikhonov, 2004). The core primary structure of a P-loop channel polypeptide (Figure 1A) can be represented by two α helices forming membrane-spanning domains (M1 and M2 in Figure 1A) surrounding a membrane re-entering pore loop (P-loop; “P” in Figure 1A). Alternatively, this core structure can also be formed by a polypeptide with six transmembrane (TM) regions (S1–S6 in Figure 1B); in this case the P-loop is present between the fifth (S5) and sixth (S6) transmembrane regions. The amino (N)- and carboxyl (C)-termini are both on the same side of the membrane, and typically cytosolic. The highly conserved P-loop contains a short α helix, a turn, and a random coil (Zhorov and Tikhonov, 2004). It dips into and then out of the membrane from the exterior side of the membrane, forming the ion conducting pathway across the membrane, and includes the amino acids that form an ion selective filter (Ward et al., 2009). The quaternary structure of a P-loop cation conduction pathway of a P-loop channel is formed by four P-loops assembling into an “inverted teepee” structure across the membrane (Figure 2). The M1–P-loop–M2 structure of the P-loop polypeptide is present in bacterial cation channels, while the six TM polypeptide is characteristic of many cation channels in eukaryotes (Hua et al., 2003b).

Figure 1

Figure 2

In animals, P-loop cation conduction pathways are known to be formed from four P-loops in an “inverted teepee” structure stretching across the membrane. Shown is an example of a pore structure generated from the corresponding regions of four A. thaliana CNGC2 polypeptides illustrating the “inverted teepee” conformation formed by four P-loops arranged symmetrically. Ca2+ ions are shown moving through the conduction pathway of the channel.

Representative structures of P-loop cation channels. Adapted from Hua et al., 2003a. (A) Representation of the core primary loop of a two transmembrane/one pore P-loop channel. (B) A six transmembrane/one pore P-loop plant CNGC channel polypeptide. P designates the P-loop of the channel. (C) A representation of the secondary structure of an animal Ca2+ channel, containing four repeats of the six transmembrane/one pore structure in one large polypeptide. In animals, P-loop cation conduction pathways are known to be formed from four P-loops in an “inverted teepee” structure stretching across the membrane. Shown is an example of a pore structure generated from the corresponding regions of four A. thaliana CNGC2 polypeptides illustrating the “inverted teepee” conformation formed by four P-loops arranged symmetrically. Ca2+ ions are shown moving through the conduction pathway of the channel. The super families of voltage-gated and ligand-gated channels share this six TM P-loop structure (Mäser et al., 2001). The fact that many channel families share this primary polypeptide structure with common folds is consistent with the concept that the number of ion channel proteins with specific functional properties is substantially larger than the possible folding patterns that could accommodate the generation of an ion-conducting pathway across membranes (Zhorov and Tikhonov, 2004). In animals, voltage-gated Na+, K+, and Ca2+-selective channels evolved from the rudimentary progenitor P-loop channels. In higher animals, voltage-gated Na+- and Ca2+-selective channels are formed by one large polypeptide that has four repeat sections; each section contains the six TM P-loop structure (Zhorov and Tikhonov, 2004). The secondary structure of a polypeptide that forms animal Ca2+-selective ion channels is shown in Figure 1C. Genes encoding canonical “four repeat P-loop” Ca2+-selective channels present in animal genomes are absent in plants. Potassium channels are formed by four separate polypeptides. In each case, four P-loops are aligned to form the ion-conducting pathway (Zheng and Zagotta, 2004). Figure 2 portrays the ion-conducting pathway formed by four P-loops of four different polypeptides (in different colors); also shown is the S6 TM domain of each of the four polypeptides. The configuration of voltage-gated channels is affected by the membrane potential (Em) such that the pore is physically occluded and therefore closed at some membrane potentials (blocking conductance), and open at other potentials. Evenly spaced, positively charged amino acids in the S4 domain of P-loop channels (represented as a “+” symbol on S4 in Figure 1B) act as a voltage sensor and provide a mechanism for the channel to sense and then respond to the Em by altering the gating of the pore (Hua et al., 2003b). Ligand-gated ion channels, including CNGCs, are also represented in the six TM P-loop superfamily. Conductance through the pore of channels can also be gated by the binding of ligands to the N- and C-terminal extensions into the cytosol (Biel, 2009). Animal CNGCs are gated allosterically by calmodulin (CaM) and cyclic nucleotides (cAMP and/or cGMP). CaM (in the presence of cytosolic Ca2+) binds to a CaM binding domain (CaMBD) of animal CNGCs at the N-terminus; CaM binding closes the CNGC channel thus preventing ion conductance. Cyclic nucleotides can bind at the C-terminus of both plant and animal CNGCs. CNMP binding to the CNBD of CNGC polypeptides activates the channel, opening it and thereby enabling cation conductance. The aforementioned S4 loop voltage sensor of voltage-gated channels is also present in plant CNGCs. Plant CNGCs have been demonstrated to function as channels with hyperpolarization-activated (i.e., inwardly rectified), voltage-dependent conductance (Hua et al., 2003b). There are two classes of animal channels whose conductance is regulated by cyclic nucleotides (Biel, 2009). Animal CNGCs are directly activated upon binding of cyclic nucleotides and their open probability does not change at varying Em. Hyperpolarization-activated cyclic nucleotide-gated channels (HCNs) conduct ions (i.e., are open) at hyperpolarizing Em, and binding of cyclic nucleotides increases the percentage of open channels at a given hyperpolarizing Em. Thus, conductance of these channels changes at varying Em as well as in response to changes in the level of cyclic nucleotides in the cytosol. In plants CNGCs may be incorrectly classified as solely “ligand-gated” channels without response to voltage when in fact they may function like animal HCNs. As indicated above, the quaternary structure of a member of the P-loop channel family is dependent on the assembly of four gene products in some cases (e.g., in animal and plant K+-selective voltage-gated channels) where only one P-loop “cassette” is encoded by the gene (i.e. S1–S6 with the P-loop between S5 and S6). It is therefore conceivable that plant CNGC channel complexes are formed by the assembly of four polypeptides, with each polypeptide corresponding to a structure similar to that shown in Figure 1B. Threading regions (S6 and pore loop, an example is shown in Figure 3 below) of plant CNGC coding sequences through the crystal structure of a quaternary P-loop channel indicates that the plant CNGC polypeptide may be capable of forming the tetrameric structure common to P-loop channels (Hua et al., 2003b). However, this model has not been verified experimentally. It should be noted that functional animal CNGC channel proteins are in all cases generated from such a tetrameric assembly of P-loop cassette polypeptides (Zheng and Zagotta, 2004) and that plant and animal CNGC polypeptides share a similar general S1–S6 topography. Thus, the aforementioned conjecture about the tetrameric structure of plant CNGCs is entirely consistent with the general concept of ion channel assembly and function.

Figure 3

Superimposed images (redrawn from Leng et al., . Threading of the A. thaliana CNGC2 S6 transmembrane helix and pore region through the KcsA structure identifies the residues “AND” as in the ion conducting pathway where the residues “GYG” (shown in pink) that form the ion selectivity filter of a K+-selective channel are positioned. This suggests that plant CNGCs may have a mechanism for cation conductance that is conserved in P-loop channels. In plants, four separate CNGC peptides are presumed to form a tetrameric structure that shares the “inverted teepee” quaternary structure of animal P-loop channels. The amino acids that line the pore selectivity filter of P-loop channels determine the relative conductance for Na+, K+, and Ca2+. This region of a plant CNGC (A. thaliana, CNGC2, isoform 2) is shown superimposed over the corresponding region of a K+-selective channel in Figure 3. The triplet “GYG” forms the selectivity filter of the K+ channel. The corresponding residues in CNGC2 are “AND,” with the negatively charged aspartic acid (D) residue positioned at the outer mouth of the pore. No other plant CNGC has a selectivity filter with an acidic D or glutamic acid (E) at this position in the pore. Animal voltage-gated Ca2+-selective channels have an “EEEE” ring at the extracellular end of the ion conducting pathway formed by the corresponding residues in each of the four P-loop repeats that form the functional channel. This E ring of Ca2+ channels has been experimentally verified to coordinate divalent Ca2+ ions as they enter the pore; and cause Ca2+ selectivity in animal channels (Tikhonov and Zhorov, 2011). The selectivity filter of plant CNGCs does not appear to be conserved among isoforms within this protein family, and is different from the conserved pore of animal CNGCs. However, some plant CNGCs have an acidic E two residues away from the D of CNGC2 (the D residue can be seen close to the outer mouth of the channel pore in Figure 3). The D in CNGC2, or alternatively the two E residues close to the outer mouth of the pore may conceivably provide a basis for Ca2+ conductance by some of the plant CNGCs. In A. thaliana, CNGCs 2, 4, 11, 12, 5, 6, and 9 have the E at the outer mouth of the pore and CNGCs 1, 2, 10, 11, 12, and 18 have been experimentally demonstrated (either directly or indirectly) to be involved in Ca2+ conductance (Leng et al., 1999; Ali et al., 2006; Frietsch et al., 2007; Urquhart et al., 2007; Guo et al., 2010). The speculative functional assignment of this acidic residue of the plant CNGC selectivity filter awaits further experimental confirmation. A defining structural aspect of plant CNGCs is their CNBD, which is unique for a number of reasons. The presence of CNGC coding sequences with CNBDs in animal and plant is intriguing from an evolutionary perspective. Unicellular fungi lack channels with such CNBDs (Talke et al., 2003). Considering that plants branched from animal progenitors on the evolutionary tree before the animal–fungus division, CNGCs might have evolved independently in these two lineages. Alternatively, fungi might have lost CNGC channel genes. Some aspects of the structural features of the plant CNBD correspond to that of their bacterial and animal cyclic-nucleotide-binding analogs. The CNBD of plant CNGCs overlaps with a CaMBD near the C-terminus, and CaM (in the presence of Ca2+) prevents cNMP activation of plant CNGCs (Hua et al., 2003a). However there is no corresponding CaMBD in bacterial CNGC sequences (Cukkemane et al., 2011). In animals the functional CaMBD is located distal of the CNBD, near the N-terminus (Ungerer et al., 2011). As mentioned above, CaM binds to and regulates the conductance of animal CNGCs and this is also the case in plant CNGCs. However, in plant CNGCs, the amino acids that form the CaM binding domain overlap with the region of the polypeptide that forms the CNBD (Figure 1B). The predicted three-dimensional structure of a plant CNGC CNBD (Figure 4A) has been generated by threading this portion of the polypeptide through the crystal structure of a bovine CNBD shown in Figure 4B (Hua et al., 2003a). Similar structures can be found in the literature (Chikayama et al., 2004; Bridges et al., 2005; Kaplan et al., 2007). The Protein Data Bank contains a solved structure for the calmodulin binding domain (CaMBD) of a small conductance Ca2+-activated potassium channel (SK2) in Rattus norvegicus (Wissmann et al., 2002; PDB record 1KKD). Threading the plant CNGC CaMBD sequences through the rat CaMBD structure may shed light on the possible convergent evolution of animal and plant CaMBDs present in ion channels.

Figure 4

Predicted three-dimensional structures of the plant CNGC CNBD. (A) A proposed three-dimensional structure of the A. thaliana CNGC2 CNBD generated by threading the sequence through the structure of the cAMP-binding domain of PDB record 1RGS, bovine cAMP-dependent protein kinase A (RIα; Su et al., 1995). (B) Three-dimensional structure of the CNBD of RIα that was used to create the model of the CNGC2 CNBD. The CNBD structures of CNGC2 and RIα share an N-terminal α-helix (αA) and a β-barrel (depicted as ribbons with arrows denoting antiparallel strands) that together bind the cNMP ligand. Republished from Hua et al. (2003a). (C) The tertiary structure of the A. thaliana CNGC6 CNBD (Chikayama et al., 2004; PDB record 1WGP). Amino acid residues comprising the hinge domain are highlighted in yellow; α-helices are depicted as purple cylinders. The β-barrel is shown as straight antiparallel purple ribbons. Two cGMP molecules are shown with the structure, one in the pocket formed between an αA helix and the β-barrel, and a second cGMP molecule (not bound) is depicted at the α-C helix adjacent to the hinge domain. It should be noted that plants contain two known domains, GAF and CNBD, which bind cyclic nucleotides. The GAF domain evolved billions of years ago and was first identified as a structural component of some cGMP phosphodiesterases and some light-sensing proteins in bacteria, yeast, humans, A. thaliana, and several species of cyanobacteria (Aravind and Ponting, 1997). Besides cAMP and cGMP, GAF domains can bind a number of other small molecules and are not specific to cNMPs (Bridges et al., 2005). The CNBD found in all plant CNGCs is also present in members of three plant K+-selective channel families; the “Shaker-like” (so named for the shaking phenotype of some animals with null mutations) KAT and AKT channel families, and the outwardly conducting channels SKOR and GORK. However, these K+-selective channels are affected by cyclic nucleotides in a very different manner than CNGCs. The voltage threshold for activation is shifted to more negative potentials in the presence of cGMP (experimentally verified for isoforms of the KAT and AKT families (Hoshi, 1995; Gaymard et al., 1996); the outward rectifiers have not been studied). This effectively means that cNMP elevation in the cytosol would reduce conductance through plant K+ channels that have CNBDs. Electrophysiological analysis of plant CNGCs has shown that the same elevation of cNMP activates the channel thereby increasing conductance (Leng et al., 1999, 2002; Lemtiri-Chlieh and Berkowitz, 2004; Ali et al., 2007). To date, no CNGC-specific motifs have been reported, however, Jackson et al. (2007) aligned different animal CNGCs, and in particular their PBC and hinge regions from which the following consensus motif can be derived: FGE-[IT]-[CIA]-LL-X(3,4)-[RK]-R-X-A-SV-X(11)-[SH]-[VRA]-[FY]-[HNQ]-X-[LV]-[LA] (the animal CNGC hinge sequence spans from the conserved serine (S) to the C-terminus of the motif). We noted that such a motif does not occur in plants but hypothesized that some of the functionally critical residues might be conserved. We therefore aligned A. thaliana CNGCs and identified putative PBCs and hinges. Within the putative PBCs we identified a conserved phenylalanine (F), a stabilizing glycine (G) and an acidic residue (either D or E) followed by two aliphatic leucines (L). We also observed that the putative hinge also contains an E, F, and one aliphatic residue much like in the animal CNGCs. The plant CNGC hinge occurs in between the CNBD and CaMBD regions (Figure 4C). We subsequently built a stringent motif ([LI]-X(2)-[GS]-X-[FYIVS]-X-G-X(0,1)-[DE]-LL-X(8,25)-[SA]-X(9)-[VLIT]-E-X-F-X-[IL]) that recognizes 20 A. thaliana CNGC proteins and no other sequences in A. thaliana. This subsequence includes the hinge domain and PBC (Figure 5). This conserved sequence differs from the animal CNBD; it occurs in between the CNBD and CaMBD while in animals the hinge occurs within the CNBD itself. Additionally it lacks, for example, the conserved proline (P) that was shown to affect gating in animal CNGCs (Jackson et al., 2007). Possible functional similarities may derive from the correspondence between the F in the center of the hinge in A. thaliana CNGCs and the F/Y in a bovine and a catfish CNGC (Young and Krougliak, 2004). The animal PBC region differs from the A. thaliana PBC conserved residues as well (Young and Krougliak, 2004). A scan of UniProt (Swiss-Prot/TrEMBL, The UniProt Consortium, 2012) including splice variants and excluding fragments using the ScanProSite tool (Sigrist et al., 2010) further revealed that the motif is restricted to land plants. Notably, the three predicted algal CNGC sequences from Chlamydomonas reinhardtii (Verret et al., 2010) lacked the motif. CNGC-related sequences matched against the 20 A. thaliana CNGCs using BLASTP (E-value <0.01) in UniProt for rice (Oryza sativa var. japonica) and the pin-cushion spikemoss (Selaginella moellendorffii) were then extracted, along with all hits for A. thaliana. Duplicates were removed and the remaining 53 sequences checked for the motif, before being used to construct a cladogram (Figure 6). The genes included in this cladogram are listed in Table A1 in Appendix. In this cladogram it can be seen that the majority of the rice and A. thaliana sequences tend to group together intraspecifically, suggesting numerous gene duplication events. The sequences across all three species partition into two clusters: a larger, more diverse cluster containing the bulk of the CNGC gene family, and a smaller cluster containing CNGC2, 4, 19, and 20. Of particular interest is that the only sequences to contain a conserved alanine (A) in place of the more frequent S in the motif are all from rice (denoted by blue diamonds) and all group together closely with CNGC19 and 20. The cladogram shown in Figure 7 is based upon only the conserved CNGC-specific motif. This phylogeny has striking congruence to previously published cladograms generated from aligning the full-length A. thaliana CNGC gene sequences (Mäser et al., 2001; Talke et al., 2003; Ward et al., 2009), as well as alignments of the pore domain amino acid sequences and the CNDB domain amino acid sequences (Kaplan et al., 2007). This supports the idea that the motif is informative for identifying and comparing putative CNGCs in other plants. Figure 7 also suggests that mutations in the motif have occurred concurrently with the branching of the CNGC gene family.

Figure 5

Figure 6

Evolutionary relationships within a conserved motif in plant CNGCs. Molecular phylogenetic analysis using the Maximum Likelihood method for 53 cyclic nucleotide-gated channel (CNGC) amino acid sequences from A. thaliana (green squares), S. moellendorffii (red triangles) and O. sativa var. japonica (blue circles and diamonds). All sequences contain a conserved S between the two motif subsequences except for the rice sequences labeled with a diamond. All A. thaliana sequences are from confirmed CNGCs while the remaining sequences from the other two species are from uncharacterized proteins unless stated otherwise as putative CNGCs. The percentage of trees in which the associated taxa clustered together is shown next to the branches. The sequences used to create the tree were obtained using BLASTP (Camacho et al., 2009) with an E-value <0.01 against the UniProt Knowledgebase (The UniProt Consortium, 2012) for Viridiplantae from the European Bioinformatics Institute website (http://www.ebi.ac.uk/Tools/sss/ncbiblast/) for all 20 annotated CNGCs from A. thaliana. All sequences from S. moellendorffii, O. sativa var. japonica, and A. thaliana were then extracted from the hits with exact duplicates (same identifiers, same sequence and different identifiers, same sequence) as well as annotated fragments being removed. The remaining sequences containing the motif ([LI]-X(2)-[GS]-X-[FYIVS]-X-G-X(0,1)-[DE]-LL-X(8,25)-[SA]-X(9)-[VLIT]-E-X-F-X-[IL]) were then aligned using MUSCLE (Edgar, 2004) and inspected for isoforms (near 100% identity) and unannotated fragments. The bootstrap consensus tree was generated using the JTT matrix-based model with discrete gamma distribution (five categories (+ G, parameter = 1.0524); Jones et al., 1992) by MEGA5 (Tamura et al., 2011) from 1000 bootstraps (Felsenstein, 1985). It is drawn to scale with branch lengths measured in the number of substitutions per site and all ambiguous positions removed for each sequence pair. It is based on 1082 positions in the final data set. Branches corresponding to less than 50% bootstrap replicates are collapsed and the percentage of replicate trees in which the associated taxa clustered together are shown next to the branches. The BioNJ method with the MCL distance matrix was used to generate the initial trees except where the common sites were less than 100 or a quarter of the total sites, in which case, maximum parsimony was used.

Table A1

Accessions for sequences used to construct Figures .

Key	Database	Species	Accession	Description/Gene Name
AT1G01340 CNGC10	TAIR	A. thaliana	AT1G01340	CNGC10
AT1G15990 CNGC7	TAIR	A. thaliana	AT1G15990	CNGC7
AT1G19780 CNGC8	TAIR	A. thaliana	AT1G19780	CNGC8
AT2G23980 CNGC6	TAIR	A. thaliana	AT2G23980	CNGC6
AT2G24610 CNGC14	TAIR	A. thaliana	AT2G24610	CNGC14
AT2G28260 CNGC15	TAIR	A. thaliana	AT2G28260	CNGC15
AT2G46430 CNGC3	TAIR	A. thaliana	AT2G46430	CNGC3
AT2G46440 CNGC11	TAIR	A. thaliana	AT2G46440	CNGC11
AT2G46450 CNGC12	TAIR	A. thaliana	AT2G46450	CNGC12
AT3G17690 CNGC19	TAIR	A. thaliana	AT3G17690	CNGC19
AT3G17700 CNGC20	TAIR	A. thaliana	AT3G17700	CNGC20
AT3G48010 CNGC16	TAIR	A. thaliana	AT3G48010	CNGC16
AT4G01010 CNGC13	TAIR	A. thaliana	AT4G01010	CNGC13
AT4G30360 CNGC17	TAIR	A. thaliana	AT4G30360	CNGC17
AT4G30560 CNGC9	TAIR	A. thaliana	AT4G30560	CNGC9
AT5G14870 CNGC18	TAIR	A. thaliana	AT5G14870	CNGC18
AT5G15410 CNGC2	TAIR	A. thaliana	AT5G15410	CNGC2
AT5G53130 CNGC1	TAIR	A. thaliana	AT5G53130	CNGC1
AT5G54250 CNGC4	TAIR	A. thaliana	AT5G54250	CNGC4
AT5G57940 CNGC5	TAIR	A. thaliana	AT5G57940	CNGC5
TRB9F3N3 ORYSJ	UniProt	O. sativa var. japonica	B9F3N3_ORYSJ	Putative uncharacterized protein
TRB9F4U7 ORYSJ	UniProt	O. sativa var. japonica	B9F4U7_ORYSJ	Putative uncharacterized protein
TRB9FKH0 ORYSJ	UniProt	O. sativa var. japonica	B9FKH0_ORYSJ	Putative uncharacterized protein
TRB9FRX2 ORYSJ	UniProt	O. sativa var. japonica	B9FRX2_ORYSJ	Putative uncharacterized protein
TRB9FTK2 ORYSJ	UniProt	O. sativa var. japonica	TRB9FTK2_ORYSJ	Putative uncharacterized protein
TRB9FTK3 ORYSJ	UniProt	O. sativa var. japonica	B9FTK3_ORYSJ	Putative uncharacterized protein
TRB9FZX0 ORYSJ	UniProt	O. sativa var. japonica	B9FZX0_ORYSJ	Putative uncharacterized protein
TRB9GC07 ORYSJ	UniProt	O. sativa var. japonica	B9GC07_ORYSJ	Putative uncharacterized protein
TRB9GD29 ORYSJ	UniProt	O. sativa var. japonica	B9GD29_ORYSJ	Putative uncharacterized protein
TRQ0DQ06 ORYSJ	UniProt	O. sativa var. japonica	Q0DQ06_ORYSJ	Os03g0646300 protein (CNGC7-like)
TRQ0DWX3 ORYSJ	UniProt	O. sativa var. japonica	Q0DWX3_ORYSJ	Os02g0789100 protein (CNGC17-like)
TRQ0IPW9 ORYSJ	UniProt	O. sativa var. japonica	Q0IPW9_ORYSJ	Os12g0163000 protein (CNGC18-like)
TRQ10G24 ORYSJ putative CNGC9	UniProt	O. sativa var. japonica	Q10G24_ORYSJ	Putative CNGC9
TRQ2QRA3 ORYSJ putative CNGC7	UniProt	O. sativa var. japonica	Q2QRA3_ORYSJ	Putative CNGC7
TRQ2QXB8 ORYSJ putative CNGC14	UniProt	O. sativa var. japonica	Q2QXB8_ORYSJ	Putative CNGC14
TRQ5ZAU3 ORYSJ	UniProt	O. sativa var. japonica	Q5ZAU3_ORYSJ	Os01g0782700 protein (CNGC4-like)
TRQ60EI8 ORYSJ	UniProt	O. sativa var. japonica	Q60EI8_ORYSJ	Os05g0502000 protein (CNGC4-like)
TRQ652Z5 ORYSJ	UniProt	O. sativa var. japonica	Q652Z5_ORYSJ	Os06g0527100 protein (CNGC1-like)
TRQ653S0 ORYSJ	UniProt	O. sativa var. japonica	Q653S0_ORYSJ	Os09g0558300 protein (ATCNGC14-like)
TRQ69KL7 ORYSJ Putative CNGC	UniProt	O. sativa var. japonica	Q69KL7_ORYSJ	Putative CNGC
TRQ6K4N3 ORYSJ CNGC-like	UniProt	O. sativa var. japonica	Q6K4N3_ORYSJ	CNGC-like
TRQ6K6P3 ORYSJ	UniProt	O. sativa var. japonica	Q6K6P3_ORYSJ	Os02g0627700 protein (CNGC15-like)
TRQ6ZG24 ORYSJ putative CNGC	UniProt	O. sativa var. japonica	Q6ZG24_ORYSJ	putative CNGC
TRQ7X641 ORYSJ	UniProt	O. sativa var. japonica	Q7X641_ORYSJ	OSJNBa0033G05.7 protein (CNGC-like)
TRA3B9H5 ORYSJ	UniProt	O. sativa var. japonica	A3B9H5_ORYSJ	Putative uncharacterized protein
TRQ69T65 ORYSJ	UniProt	O. sativa var. japonica	Q69T65_ORYSJ	Putative cyclic nucleotide-regulated ion channel
TRB9F3H5 ORYSJ	UniProt	O. sativa var. japonica	B9F3H5_ORYSJ	Putative uncharacterized protein
TRQ6ZHE3 ORYSJ	UniProt	O. sativa var. japonica	Q6ZHE3_ORYSJ	Putative cyclic nucleotide-binding transporter 1
TRD8QYM9 SELML	UniProt	S. moellendorffii	D8QYM9_SELML	Putative uncharacterized protein
TRD8RB40 SELML	UniProt	S. moellendorffii	D8RB40_SELML	Putative uncharacterized protein
TRD8SCK7 SELML	UniProt	S. moellendorffii	D8SCK7_SELML	Putative uncharacterized protein
TRD8SGT0 SELML	UniProt	S. moellendorffii	D8SGT0_SELML	Putative uncharacterized protein
TRD8T0W5 SELML	UniProt	S. moellendorffii	D8T0W5_SELML	Putative uncharacterized protein

Database accessions and gene names were derived from UniProt (.

Figure 7

Molecular phylogenetic history of the CNGC-specific conserved motif ([LI]-X(2)-[GS]-X-[FYIVS]-X-G-X(0,1)-[DE]-LL-X(8,25)-[SA]-X(9)-[VLIT]-E-X-F-X-[IL]) in . The analysis used the Maximum Likelihood method based on the JTT matrix-based model with a discrete gamma distribution [five categories (+G, parameter = 1.0852); Jones et al., 1992] in MEGA5 (Tamura et al., 2011) with 1000 bootstraps (Felsenstein, 1985). The initial heuristic trees were obtained automatically using the BioNJ method with MCL distance, unless the number of common sites was less than 100 or one quarter of the total number of sites, in which case the maximum parsimony method was used. The tree with the highest log likelihood (−771.1381) is shown, drawn to scale with branch lengths measured in the number of substitutions per site. All ambiguous positions were removed for each sequence pair with a total of 52 positions in the final dataset of 20 sequences.

The . The diagram at the top portrays three regions of plant CNGCs: the six transmembrane domains (TM), a CNBD containing a PBC and the hinge which presumably make direct contact with the cyclic nucleotide, and a calmodulin binding domain (CaMBD) toward the C-terminus. The CNGC-specific amino acid motif is shown below the cartoon. In the square brackets “[]” are the amino acids allowed in this position of the motif, “X” represents any amino acid and the round brackets “()” indicate the number of amino acids. The red amino acids in the PBC and the hinge are conserved in both animal and plant CNBDs based on comparison with Jackson et al. (2007). Below the CNGC-specific motif is an alignment of the motif regions of the 20 A. thaliana CNGCs. For each of the A. thaliana CNGCs, the residues at the N- and C-termini are indicated to the left of the motif. The green shaded box demarcates the putative PBC region of the CNBDs (corresponding to the animal CNBD PBC). The orange shaded box marks the presumed hinge region. Residues in white highlighted in blue indicate >90% identity among the 20 A. thaliana CNGCs (in this case “N” is counted as a dissimilarity) indicating a high level of conservation. A “*” above the alignment marks a position with 100% conservation between the A. thaliana sequences shown. Residues in red denote conservation with the animal CNBD alignment generated by Jackson et al. (2007). Note, there is experimental evidence identifying the residues which interact with cyclic nucleotides in animal CNBDs according to Jackson et al., but there is no empirical confirmation of which residues in plant CNBDs bind cyclic nucleotides. The alignment of the A. thaliana CNGC-specific motif was generated by the MEGA5 program and ClustalW (Saitou and Nei, 1987). Evolutionary relationships within a conserved motif in plant CNGCs. Molecular phylogenetic analysis using the Maximum Likelihood method for 53 cyclic nucleotide-gated channel (CNGC) amino acid sequences from A. thaliana (green squares), S. moellendorffii (red triangles) and O. sativa var. japonica (blue circles and diamonds). All sequences contain a conserved S between the two motif subsequences except for the rice sequences labeled with a diamond. All A. thaliana sequences are from confirmed CNGCs while the remaining sequences from the other two species are from uncharacterized proteins unless stated otherwise as putative CNGCs. The percentage of trees in which the associated taxa clustered together is shown next to the branches. The sequences used to create the tree were obtained using BLASTP (Camacho et al., 2009) with an E-value <0.01 against the UniProt Knowledgebase (The UniProt Consortium, 2012) for Viridiplantae from the European Bioinformatics Institute website (http://www.ebi.ac.uk/Tools/sss/ncbiblast/) for all 20 annotated CNGCs from A. thaliana. All sequences from S. moellendorffii, O. sativa var. japonica, and A. thaliana were then extracted from the hits with exact duplicates (same identifiers, same sequence and different identifiers, same sequence) as well as annotated fragments being removed. The remaining sequences containing the motif ([LI]-X(2)-[GS]-X-[FYIVS]-X-G-X(0,1)-[DE]-LL-X(8,25)-[SA]-X(9)-[VLIT]-E-X-F-X-[IL]) were then aligned using MUSCLE (Edgar, 2004) and inspected for isoforms (near 100% identity) and unannotated fragments. The bootstrap consensus tree was generated using the JTT matrix-based model with discrete gamma distribution (five categories (+ G, parameter = 1.0524); Jones et al., 1992) by MEGA5 (Tamura et al., 2011) from 1000 bootstraps (Felsenstein, 1985). It is drawn to scale with branch lengths measured in the number of substitutions per site and all ambiguous positions removed for each sequence pair. It is based on 1082 positions in the final data set. Branches corresponding to less than 50% bootstrap replicates are collapsed and the percentage of replicate trees in which the associated taxa clustered together are shown next to the branches. The BioNJ method with the MCL distance matrix was used to generate the initial trees except where the common sites were less than 100 or a quarter of the total sites, in which case, maximum parsimony was used. Molecular phylogenetic history of the CNGC-specific conserved motif ([LI]-X(2)-[GS]-X-[FYIVS]-X-G-X(0,1)-[DE]-LL-X(8,25)-[SA]-X(9)-[VLIT]-E-X-F-X-[IL]) in . The analysis used the Maximum Likelihood method based on the JTT matrix-based model with a discrete gamma distribution [five categories (+G, parameter = 1.0852); Jones et al., 1992] in MEGA5 (Tamura et al., 2011) with 1000 bootstraps (Felsenstein, 1985). The initial heuristic trees were obtained automatically using the BioNJ method with MCL distance, unless the number of common sites was less than 100 or one quarter of the total number of sites, in which case the maximum parsimony method was used. The tree with the highest log likelihood (−771.1381) is shown, drawn to scale with branch lengths measured in the number of substitutions per site. All ambiguous positions were removed for each sequence pair with a total of 52 positions in the final dataset of 20 sequences. Interestingly, the motif recognizes all currently annotated CNGCs in higher plants only and identifies new unannotated candidate CNGCs. Two CNGC isoforms in A. thaliana [Swiss-Prot: Q94AS9-2 (an isoform of CNGC4) and Q8GWD2-2 (an isoform of CNG12)] lack the motif. However, these sequences are not experimentally confirmed and are truncated. The few annotated CNGCs in other plants (e.g., barley and poplar) that lack the motif – even if we relax it by allowing one or several mismatches – may suggest that they are either misannotated, or fragments that show insufficient coverage and similarity to be confidently classified as CNGCs. Furthermore, the motif recognizes neither any non-plant sequences nor any algal sequences. We therefore contend that the motif may serve as a supporting method for identifying CNGCs in other land plant species. Scanning the rice and pin-cushion spikemoss genomes with the ScanProsite tool yields putative CNGC homologs and uncharacterized proteins containing the motif (Figure 6). Since CNBDs are found in other proteins besides the CNGC ion channels in the A. thaliana proteome (as noted above, i.e., the K+-selective channels), it may be useful to develop motifs that can distinguish these sequences. Several recent publications predict the number of CNGCs present in the genomes of organisms ranging from brown and red algae to green algae and multicellular plants. For example, Ward et al. (2009) predict that no CNGCs exist in the green alga C. reinhardtii. In contrast, Verret et al. (2010) predict that C. reinhardtii has three CNGCs. These three sequences do not contain the land plant-specific CNGC motif. The fact that these authors disagree on the number of CNGCs attests to the need for alternative ways to assess the likelihood that a protein for which the primary sequence is known is likely to function as a CNGC. The motif we present here does not recognize any non-plant sequences or any algal sequences, and thus lends weight to the view that the C. reinhardtii proteins identified as putative CNGCs by Verret et al. (2010) may contain algal-specific CNGC motifs and we argue that higher plants may have evolved a specific cNMP-binding domain. A functional motif-based means of recognizing CNGC proteins will also contribute to our understanding the evolutionary history of this gene family including the possible role of genome and gene duplication events (Seoighe and Gehring, 2004). Analyzing orthologous CNGCs in the new Whole Genome Shotgun sequence of Arabidopsis lyrata, a cress whose genome is approximately 50% larger than that of A. thaliana, is the next obvious step in learning more about the evolutionary effects of ion channel gene duplication and subsequent sub-functionalization and changing expression patterns. These phenomena have already been partly addressed (Mäser et al., 2001), and it was remarked that CNGC12 has a CaMBD that was the least conserved compared to the other CNGCs this group (Group I). CNGC12 was the last gene in a sequence of three syntenic CNGCs: CNGC3 and CNG11 precede it. CNGC12 is likely the least homologous indicating that gene duplication allowed the original function of the Group I gene CNGC3 to be maintained, easing selection pressure on the duplicate copies and perhaps allowing for sub-functionalization.

Concluding Remarks

The pore and CNBD sequences of the plant cyclic nucleotide-gated channels differ from the pore and CNBDs in other plant ion channel families. They are also different from the pore and CNBD regions of animal CNGCs. Phylogenies based on alignments of the pore, CNBD, or full-length CNGC sequences are similar, showing that the evolution of these functional domains preceded the expansion of plant CNGC genes that occurred after the split between green algae and higher plants, and that the split between dicots and monocots occurred after the advent of the higher plant CNGCs. A protein motif specific to plant CNGCs is located within the CNBD. An alignment of just this motif region from the CNGCs produces a cladogram that is almost identical to published phylogenies of the full-length sequences. This motif may therefore be used to refine tentative annotations and to find novel CNGCs in newly sequenced plant genomes. Whether this motif represents differences in the function or regulation of plant CNGCs has yet to be evaluated.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

42 in total

Review 1. Cyclic nucleotide-gated channels in plants.

Authors: Boaz Kaplan; Tal Sherman; Hillel Fromm
Journal: FEBS Lett Date: 2007-02-16 Impact factor: 4.124

2. The structure of the potassium channel: molecular basis of K+ conduction and selectivity.

Authors: D A Doyle; J Morais Cabral; R A Pfuetzner; A Kuo; J M Gulbis; S L Cohen; B T Chait; R MacKinnon
Journal: Science Date: 1998-04-03 Impact factor: 47.728

3. Regulation of voltage dependence of the KAT1 channel by intracellular factors.

Authors: T Hoshi
Journal: J Gen Physiol Date: 1995-03 Impact factor: 4.086

4. Death don't have no mercy and neither does calcium: Arabidopsis CYCLIC NUCLEOTIDE GATED CHANNEL2 and innate immunity.

Authors: Rashid Ali; Wei Ma; Fouad Lemtiri-Chlieh; Dimitrios Tsaltas; Qiang Leng; Susannne von Bodman; Gerald A Berkowitz
Journal: Plant Cell Date: 2007-03-23 Impact factor: 11.277

5. Distinct structural determinants of efficacy and sensitivity in the ligand-binding domain of cyclic nucleotide-gated channels.

Authors: Edgar C Young; Natalia Krougliak
Journal: J Biol Chem Date: 2003-10-31 Impact factor: 5.157

6. Structural insight into Ca2+ specificity in tetrameric cation channels.

Authors: Amer Alam; Ning Shi; Youxing Jiang
Journal: Proc Natl Acad Sci U S A Date: 2007-09-18 Impact factor: 11.205

7. A cyclic nucleotide-gated channel is essential for polarized tip growth of pollen.

Authors: Sabine Frietsch; Yong-Fei Wang; Chris Sladek; Lisbeth R Poulsen; Shawn M Romanowsky; Julian I Schroeder; Jeffrey F Harper
Journal: Proc Natl Acad Sci U S A Date: 2007-08-28 Impact factor: 11.205

8. Cyclic nucleotide binding proteins in the Arabidopsis thaliana and Oryza sativa genomes.

Authors: Dave Bridges; Marie E Fraser; Greg B G Moorhead
Journal: BMC Bioinformatics Date: 2005-01-11 Impact factor: 3.169

9. MUSCLE: a multiple sequence alignment method with reduced time and space complexity.

Authors: Robert C Edgar
Journal: BMC Bioinformatics Date: 2004-08-19 Impact factor: 3.169

10. A cyclic nucleotide modulated prokaryotic K+ channel.

Authors: Crina M Nimigean; Tania Shane; Christopher Miller
Journal: J Gen Physiol Date: 2004-09 Impact factor: 4.086

41 in total

1. Genome-wide identification and functional analysis of the cyclic nucleotide-gated channel gene family in Chinese cabbage.

Authors: Qingqing Li; Siqiang Yang; Jie Ren; Xueling Ye; Xin Jiang; Zhiyong Liu
Journal: 3 Biotech Date: 2019-03-01 Impact factor: 2.406

Review 2. Evolution and Structural Characteristics of Plant Voltage-Gated K⁺ Channels.

Authors: Timothy Jegla; Gregory Busey; Sarah M Assmann
Journal: Plant Cell Date: 2018-11-01 Impact factor: 11.277

3. Phytosulfokine Regulates Growth in Arabidopsis through a Response Module at the Plasma Membrane That Includes CYCLIC NUCLEOTIDE-GATED CHANNEL17, H+-ATPase, and BAK1.

Authors: Friederike Ladwig; Renate I Dahlke; Nils Stührwohldt; Jens Hartmann; Klaus Harter; Margret Sauter
Journal: Plant Cell Date: 2015-06-12 Impact factor: 11.277

4. Cyclic nucleotide gated channel genes (CNGCs) in Rosaceae: genome-wide annotation, evolution and the roles on Valsa canker resistance.

Authors: Xia Mao; Chao Wang; Qianqian Lv; Yuzhen Tian; Dongdong Wang; Baihong Chen; Juan Mao; Wenfang Li; Mingyu Chu; Cunwu Zuo
Journal: Plant Cell Rep Date: 2021-09-04 Impact factor: 4.570

5. A cyclic nucleotide-gated channel (CNGC16) in pollen is critical for stress tolerance in pollen reproductive development.

Authors: Meral Tunc-Ozdemir; Chong Tang; Maryam Rahmati Ishka; Elizabeth Brown; Norman R Groves; Candace T Myers; Claudia Rato; Lisbeth R Poulsen; Stephen McDowell; Gad Miller; Ron Mittler; Jeffrey F Harper
Journal: Plant Physiol Date: 2012-12-12 Impact factor: 8.340

6. Multiple Calmodulin-Binding Sites Positively and Negatively Regulate Arabidopsis CYCLIC NUCLEOTIDE-GATED CHANNEL12.

Authors: Thomas A DeFalco; Christopher B Marshall; Kim Munro; Hong-Gu Kang; Wolfgang Moeder; Mitsuhiko Ikura; Wayne A Snedden; Keiko Yoshioka
Journal: Plant Cell Date: 2016-06-22 Impact factor: 11.277

7. The Arabidopsis thaliana proteome harbors undiscovered multi-domain molecules with functional guanylyl cyclase catalytic centers.

Authors: Aloysius Wong; Chris Gehring
Journal: Cell Commun Signal Date: 2013-07-08 Impact factor: 5.712

Evolutionary and structural perspectives of plant cyclic nucleotide-gated cation channels.

Concluding Remarks

Conflict of Interest Statement

Review 1. Cyclic nucleotide-gated channels in plants.

2. The structure of the potassium channel: molecular basis of K+ conduction and selectivity.

3. Regulation of voltage dependence of the KAT1 channel by intracellular factors.

4. Death don't have no mercy and neither does calcium: Arabidopsis CYCLIC NUCLEOTIDE GATED CHANNEL2 and innate immunity.

5. Distinct structural determinants of efficacy and sensitivity in the ligand-binding domain of cyclic nucleotide-gated channels.

6. Structural insight into Ca2+ specificity in tetrameric cation channels.

7. A cyclic nucleotide-gated channel is essential for polarized tip growth of pollen.

8. Cyclic nucleotide binding proteins in the Arabidopsis thaliana and Oryza sativa genomes.

9. MUSCLE: a multiple sequence alignment method with reduced time and space complexity.

10. A cyclic nucleotide modulated prokaryotic K+ channel.

1. Genome-wide identification and functional analysis of the cyclic nucleotide-gated channel gene family in Chinese cabbage.

Review 2. Evolution and Structural Characteristics of Plant Voltage-Gated K⁺ Channels.

3. Phytosulfokine Regulates Growth in Arabidopsis through a Response Module at the Plasma Membrane That Includes CYCLIC NUCLEOTIDE-GATED CHANNEL17, H+-ATPase, and BAK1.

4. Cyclic nucleotide gated channel genes (CNGCs) in Rosaceae: genome-wide annotation, evolution and the roles on Valsa canker resistance.

5. A cyclic nucleotide-gated channel (CNGC16) in pollen is critical for stress tolerance in pollen reproductive development.

6. Multiple Calmodulin-Binding Sites Positively and Negatively Regulate Arabidopsis CYCLIC NUCLEOTIDE-GATED CHANNEL12.

7. The Arabidopsis thaliana proteome harbors undiscovered multi-domain molecules with functional guanylyl cyclase catalytic centers.

Review 8. Conserved Functional Motifs and Homology Modeling to Predict Hidden Moonlighting Functional Sites.

9. Molecular Targets and Biological Functions of cAMP Signaling in Arabidopsis.

10. Ion and lipid signaling in apical growth-a dynamic machinery responding to extracellular cues.

Concluding Remarks

Conflict of Interest Statement

Review 1. Cyclic nucleotide-gated channels in plants.

Review 2. Evolution and Structural Characteristics of Plant Voltage-Gated K+ Channels.

Review 8. Conserved Functional Motifs and Homology Modeling to Predict Hidden Moonlighting Functional Sites.

Review 2. Evolution and Structural Characteristics of Plant Voltage-Gated K⁺ Channels.