Literature DB >> 34764137

The evolution of metazoan shelterin.

Logan R Myler1, Charles G Kinzig1, Nanda K Sasi1, George Zakusilo1, Sarah W Cai1, Titia de Lange1.   

Abstract

The mammalian telomeric shelterin complex-comprised of TRF1, TRF2, Rap1, TIN2, TPP1, and POT1-blocks the DNA damage response at chromosome ends and interacts with telomerase and the CST complex to regulate telomere length. The evolutionary origins of shelterin are unclear, partly because unicellular organisms have distinct telomeric proteins. Here, we describe the evolution of metazoan shelterin, showing that TRF1 emerged in vertebrates upon duplication of a TRF2-like ancestor. TRF1 and TRF2 diverged rapidly during vertebrate evolution through the acquisition of new domains and interacting factors. Vertebrate shelterin is also distinguished by the presence of an HJRL domain in the split C-terminal OB fold of POT1, whereas invertebrate POT1s carry inserts of variable nature. Importantly, the data reveal that, apart from the primate and rodent POT1 orthologs, all metazoan POT1s are predicted to have a fourth OB fold at their N termini. Therefore, we propose that POT1 arose from a four-OB-fold ancestor, most likely an RPA70-like protein. This analysis provides insights into the biology of shelterin and its evolution from ancestral telomeric DNA-binding proteins.
© 2021 Myler et al.; Published by Cold Spring Harbor Laboratory Press.

Entities:  

Keywords:  Apollo; MCPH1; NBS1; PNUTS; POT1; Rap1; TIN2; TPP1; TRF1; TRF2; evolution; shelterin; telomerase; telomere

Mesh:

Substances:

Year:  2021        PMID: 34764137      PMCID: PMC8653790          DOI: 10.1101/gad.348835.121

Source DB:  PubMed          Journal:  Genes Dev        ISSN: 0890-9369            Impact factor:   11.361


Telomeres have been studied in a wide variety of eukaryotes, revealing that some aspects of telomere biology are highly conserved, whereas others have diverged considerably (de Lange 2009; Lue 2018). A highly conserved aspect of telomere biology is the telomerase reverse transcriptase (TERT), which is similar in all eukaryotes (Wu et al. 2017; Nguyen et al. 2019) and evolved from the reverse transcriptases of ancient retroelements (Nakamura et al. 1997; de Lange 2015). Likewise, telomerase template RNAs, which dictate the synthesis of TTAGGG-like repeats onto the 3′ ends of metazoan chromosomes, share several conserved structural elements in highly divergent eukaryotes (Wu et al. 2017; Nguyen et al. 2019; Ghanim et al. 2021; He et al. 2021). Finally, the Ctc1–Stn1–Ten1 (CST) complex responsible for polymerase α/primase-mediated fill-in at the 5′ ends of telomeres is also highly conserved and probably evolved from a single-stranded (ss) DNA-binding complex related to replication protein A (RPA) (Lue 2018; Lim et al. 2020). In contrast, the proteins that associate with the duplex telomeric repeat array have diversified more, despite the presence of TTAGGG-like repeats (including variants such as TTGGGG, TTTTGGGG, and TTAGG) at the telomeres of most eukaryotes. Telomeric proteins have been identified in mammals, plants, fungi, ciliates, trypanosomes, and other species. Apart from two common DNA-binding domains (the oligonucleotide/oligosaccharide-binding [OB] fold and the Myb/SANT domain, referred to here as Myb), the telomere-specific TRFH (telomeric repeat factor homology) domain, and the BRCT (BRCA1 C-terminal) domain (de Lange 2018), the telomeric protein complexes appear to lack conserved features. Because the species studied are from disparate clades, it is difficult to discern how telomeric proteins evolved from common ancestors. Therefore, we set out to trace the evolution of shelterin, first identified in mammals (de Lange 2005), through the vertebrate and invertebrate metazoan lineages. Shelterin binds to the TTAGGG repeats that denote the telomeres of most metazoans, where it ensures that telomere ends are not recognized and/or processed by pathways that act on double-strand breaks (DSBs) (de Lange 2018). Shelterin also regulates the extension of telomeric sequences by telomerase and facilitates the semiconservative replication of telomeric DNA (Hockemeyer and Collins 2015). Despite its myriad functions, shelterin is comprised of just six distinct proteins: TRF1, TRF2, Rap1, TIN2, TPP1, and POT1. The architecture of shelterin and the function of its subunits have been studied extensively in human and mouse cells (O'Sullivan and Karlseder 2010; Lazzerini-Denchi and Sfeir 2016; de Lange 2018). In these organisms, shelterin contains subunits that bind to the double-stranded (ds) telomeric DNA and subunits that engage telomeric DNA in single-stranded (ss) form, a conserved feature of telomere ends. Two paralogous homodimeric DNA-binding proteins—TRF1 and TRF2—anchor shelterin onto the ds telomeric TTAGGG repeats. Both TRF1 and TRF2 bind TIN2, which promotes the stability of TRF1 and TRF2 at telomeres. TRF2 (but not TRF1) binds to Rap1. The ss telomeric repeats are bound by POT1, which forms a heterodimer with TPP1. TIN2 binds to TPP1, thereby connecting POT1/TPP1 to TRF1 and TRF2. Repression of the DNA damage response at telomeres is primarily achieved through TRF2 and POT1. TRF2 prevents ATM kinase activation at chromosome ends and ensures that the nonhomologous end joining pathway does not ligate one telomere to another (van Steensel et al. 1998; Karlseder et al. 1999; Smogorzewska et al. 2002; Celli et al. 2006). TRF2 blocks these pathways through the formation of the t-loop structure, in which the 3′ telomeric overhang strand invades the duplex part of the telomere (Griffith et al. 1999; Doksani et al. 2013). Although TRF1 is a paralog of TRF2, it is not directly involved in the protection of telomeres. Rather, TRF1 facilitates the duplication of telomeres by preventing the stalling/arrest of the replication fork in the TTAGGG repeat array (Martínez et al. 2009; Sfeir et al. 2009). ATR signaling, the second DNA damage signaling pathway that endangers telomeres, is repressed by POT1, which excludes the ssDNA sensor RPA from telomeric DNA (Denchi and de Lange 2007; Guo et al. 2007; Gong and de Lange 2010; Kratz and de Lange 2018). Shelterin has a dual role in the maintenance of telomeric DNA. Telomerase-mediated telomere elongation requires the recruitment of the enzyme by TPP1, which interacts directly with the TERT component (Nandakumar et al. 2012; Zhong et al. 2012; Hockemeyer and Collins 2015; Lim and Cech 2021). In addition, shelterin regulates telomere length homeostasis in a process that is thought to involve a negative feedback loop whereby telomerase is inhibited at hyperelongated telomeres (Hockemeyer and Collins 2015). The main players in this negative feedback loop are TRF1, TIN2, POT1, and the CST complex, but the mechanism of telomerase inhibition is unclear. Human and mouse shelterin complexes are similar in many respects, allowing cross-species extrapolation of functional studies. However, some differences are notable. For instance, mice and rats have two POT1 proteins (POT1a and POT1b) rather than the single POT1 protein present in primate shelterin (Hockemeyer et al. 2006; Wu et al. 2006). Mouse and human shelterin also have distinct shelterin accessory factors, which make an important contribution to shelterin function (Diotti and Loayza 2011). Here, we describe how metazoan shelterin evolved. We report that the shelterin complexes of invertebrates contain only one telomeric repeat binding factor (TRF) with features of TRF2, including sequences related to the iDDR (inhibitor of the DNA damage response) (Okamoto et al. 2013). Therefore, we infer that TRF1 was formed through a duplication of a TRF2-like gene near the emergence of the vertebrate lineage. Possibly, the replication function specific to vertebrate TRF1 is executed by the TRF2-like shelterin subunit of invertebrates. Once duplicated, TRF1 and TRF2 diverged rapidly during vertebrate evolution, resulting in their carrying distinct N termini and having different interacting partners. Another major change occurred in the C-terminal OB fold of POT1, which is split by a Holliday junction resolvase-like (HJRL) domain in vertebrates but has a different insert in other metazoans. Finally, we report that the three-OB-fold architecture of primate and rodent POT1 proteins is an exception. Most metazoans, including most mammals, have POT1 orthologs with an additional N-terminal domain that is predicted to be an OB fold, which we term OB-N. This suggests that POT1 first evolved as a four-OB-fold protein, potentially from an RPA70-like ancestor. These findings should lend impetus to unraveling the perplexing evolution of telomeric protein complexes in other eukaryotes, potentially revealing how early eukaryotes solved the problems inherent to linear chromosomes.

Results

TRF1 arose in vertebrates

The TRF1 and TRF2 subunits of shelterin (Supplemental Fig. S1A) are encoded by paralogous genes that are annotated in most vertebrate genomes, but the presence of invertebrate TRF1/2 orthologs, defined as proteins containing a TRFH domain paired with a C-terminal TRF1/2-like Myb domain, has not been examined. To catalog TRF1 and/or TRF2 paralogs in all branches of metazoans, we identified genes annotated as TRF1- or TRF2-like. Additional candidate genes were uncovered through targeted genome BLASTp searches with the TRFH and Myb domains of human TRF2 and TRF1. Only genes with a C-terminal Myb domain paired with a TRFH domain were considered TRF1/2 candidates (Supplemental Fig. S1A) because TRFH-mediated homodimerization is required for the telomere binding activity of TRF1 and TRF2 (Bianchi et al. 1997; van Steensel and de Lange 1997; van Steensel et al. 1998). Although our analysis was limited by the availability of high-quality genome sequences, we identified TRF genes in 16 invertebrates and compared these with the TRF orthologs of 11 vertebrates (Supplemental Figs. S1, S2). We excluded arthropods and nematodes because they have noncanonical telomeric repeat DNA, often lack telomerase, and have a distinct set of telomeric DNA-binding proteins (Fulcher et al. 2014). A sequence identities and similarities (SIAS) tool (http://imed.med.ucm.es/Tools/sias.html) indicated that the Myb domains of invertebrate TRF orthologs had, on average, 50% ± 10% amino acid identity to the Myb domains of vertebrate TRF1/2 orthologs. The TRFH domains of invertebrate TRFs appeared to be less conserved, showing 36% ± 10% identity compared with vertebrate TRF1/2. The Myb domains of all candidate TRF1/2 proteins have the conserved K-D-R/K-W/Y-R motif in helix 3, which is critical for recognition of the telomeric TTAGGG repeats (Supplemental Fig. S1B; Court et al. 2005). Because it has been reported that TRF1 evolved the ability to bind to telomeric DNA in the therian lineage (Kappei et al. 2017), we determined whether two TRF1 proteins from nontherian vertebrates (the lizard Anolis carolinensis and the platypus Ornithorhynchus anatinus) have the ability to localize to telomeres (Supplemental Fig. S3A,B). Both proteins localized to telomeres in mouse embryonic fibroblasts (MEFs), and their Myb domains allowed mouse TRF1 (missing its Myb domain) to localize telomeres (Supplemental Fig. S3B). Importantly, the telomeric localization of the lizard and platypus TRF1s was abolished by the equivalent of the R425V mutation, which inactivates DNA binding by the human Myb domain (Supplemental Fig. S3B; Fairall et al. 2001). Based on these data, we consider it likely that all vertebrate TRF1 proteins can bind telomeric DNA. Each invertebrate species examined contained only a single candidate TRF gene. The absence of a second TRF gene in each invertebrate was further supported by BLASTp searches with the Myb domain of the TRF gene from that species. Bayesian phylogenetic analysis segregated the TRFH domains of all TRF genes according to phylum, as expected (Fig. 1A). However, the TRF1 and TRF2 orthologs of vertebrates clustered together independently of the phylogenetic tree, indicating that TRF1 and TRF2 diverged from a common ancestor within the vertebrate lineage (Fig. 1A, arrowhead). This implies that the ancestral gene of TRF1 and TRF2 was duplicated around the time that vertebrates diverged from the cephalochordates. Because we have been unable to find TRF orthologs in tunicates, the exact point in evolution when the duplication took place remains to be determined.
Figure 1.

TRF1 arose in vertebrates from a duplication of a TRF2-like gene. (A) Phylogenetic tree of the TRFH domains of TRF orthologs from representative metazoans, including invertebrate TRF orthologs (black) and vertebrate TRF2 (red) and TRF1 (blue) orthologs. The arrowhead indicates the predicted gene duplication event. The scale bar indicates branch length in substitutions per sequence site. Node probabilities are indicated. (B) MUSCLE alignment of iDDR domains from the indicated TRF2 (red) or TRF (black) orthologs. Conserved residues from the iDDR domain are indicated in black (hydrophobic or aromatic residues), red (acidic residues), blue (basic residues), and green (V/S/T). (*) Identical residues, (:) conserved substitutions. (C) Sequence logo of conserved residues in the iDDR domains from B. “N” represents sequence position starting from the left. Bits indicate relative frequency of each amino acid at that position. (D) Comparison of the 60 amino acids immediately N-terminal of the Myb domains in the indicated TRF1/TRF2 orthologs. (Red) Acidic residues, (black) tryptophan (W), (green) V/S/T positions that approximate the motif in C. (E) Graphs showing the local pI of the sequence immediately upstream of the Myb domain of the indicated TRF1/2 orthologs. (F) Model of vertebrate and invertebrate shelterin highlighting the TRF2-like nature of the single TRF in invertebrate shelterin.

TRF1 arose in vertebrates from a duplication of a TRF2-like gene. (A) Phylogenetic tree of the TRFH domains of TRF orthologs from representative metazoans, including invertebrate TRF orthologs (black) and vertebrate TRF2 (red) and TRF1 (blue) orthologs. The arrowhead indicates the predicted gene duplication event. The scale bar indicates branch length in substitutions per sequence site. Node probabilities are indicated. (B) MUSCLE alignment of iDDR domains from the indicated TRF2 (red) or TRF (black) orthologs. Conserved residues from the iDDR domain are indicated in black (hydrophobic or aromatic residues), red (acidic residues), blue (basic residues), and green (V/S/T). (*) Identical residues, (:) conserved substitutions. (C) Sequence logo of conserved residues in the iDDR domains from B. “N” represents sequence position starting from the left. Bits indicate relative frequency of each amino acid at that position. (D) Comparison of the 60 amino acids immediately N-terminal of the Myb domains in the indicated TRF1/TRF2 orthologs. (Red) Acidic residues, (black) tryptophan (W), (green) V/S/T positions that approximate the motif in C. (E) Graphs showing the local pI of the sequence immediately upstream of the Myb domain of the indicated TRF1/2 orthologs. (F) Model of vertebrate and invertebrate shelterin highlighting the TRF2-like nature of the single TRF in invertebrate shelterin. To determine whether the common ancestor of TRF1 and TRF2 was more like TRF1 or more like TRF2, we examined the invertebrate orthologs for features that distinguish the two vertebrate paralogs. TRF1 and TRF2 differ in their N terminus, which is acidic for TRF1 and basic for TRF2. However, as detailed below (see Fig. 2A,B), these domains evolved rapidly in vertebrates, making them unsuitable for this analysis. Instead, we focused on another discriminating feature: the TRF2-specific iDDR domain, which resides immediately upstream of the Myb domain (Okamoto et al. 2013). The iDDR of human TRF2 opposes RNF168-mediated histone ubiquitination to block the accumulation of the DNA damage factor 53BP1 at dysfunctional telomeres (Okamoto et al. 2013). Alignment of the iDDRs of vertebrate TRF2 proteins revealed a short motif rich in acidic residues surrounding a conserved tryptophan (Fig. 1B,C). This iDDR motif was also present in the cephalochordates, suggesting that their single TRF protein is similar to TRF2 (Fig. 1B).
Figure 2.

Emergence of the N-terminal domains in vertebrate TRFs and the Y/H-x-L-x-P motifs in human TRF2-binding partners. (A) Right-aligned Basic/GAR domains of vertebrate TRF2 orthologs, with glycines (G) and arginines (R) indicated in blue. (*) LANA-related motif residues. (B) Right-aligned acidic domains of vertebrate TRF1 orthologs with acidic (D/E) residues indicated in red. (C) Cladogram showing the acquisition of the Y/H-x-L-x-P motif in shelterin accessory proteins that interact with the TRFH domain of TRF2.

Emergence of the N-terminal domains in vertebrate TRFs and the Y/H-x-L-x-P motifs in human TRF2-binding partners. (A) Right-aligned Basic/GAR domains of vertebrate TRF2 orthologs, with glycines (G) and arginines (R) indicated in blue. (*) LANA-related motif residues. (B) Right-aligned acidic domains of vertebrate TRF1 orthologs with acidic (D/E) residues indicated in red. (C) Cladogram showing the acquisition of the Y/H-x-L-x-P motif in shelterin accessory proteins that interact with the TRFH domain of TRF2. The iDDR sequence motif was not sufficiently conserved to definitively determine whether it was present in the achordate TRF proteins (Fig. 1D). However, all vertebrate TRF2 orthologs were distinct from TRF1 in that they possessed clusters of glutamate and aspartate residues within 60 amino acids upstream of the Myb domain (Fig. 1D). Calculation of the local isoelectric point (pI) upstream of the Myb domains of the vertebrate TRFs revealed a markedly low local pI immediately upstream of the Myb domain in TRF2 but not in TRF1 (Fig. 1E). The iDDR motif and the low local pI were found in the cephalochordates’ TRFs, further arguing that their TRF is TRF2-like (Fig. 1C,E). Consistent with the invertebrate TRFs displaying TRF2-like features, the cnidarians had an iDDR-like region upstream of the Myb domain of their TRFs, featuring clusters of acidic residues flanking a tryptophan residue and a pronounced drop in the local pI (Fig. 1D,E). The echinoderms also had these features, although some lacked the tryptophan associated with acidic residues, and the low local pI segment was farther upstream of the Myb domain. Only the mollusks’ TRFs did not have an obvious acidic region upstream of the Myb domain, suggesting that the iDDR-like feature may have been lost or repositioned in this clade. Based on the presence of an iDDR-like region upstream of the Myb domain of the cephalochordates, cnidarians, and echinoderms, we propose that the single TRF gene in invertebrates resembles TRF2 more than TRF1 (Fig. 1F). Consistent with this view, these TRFs also feature a highly conserved proline near the end of the TRFH domain that is specific to vertebrate TRF2 paralogs (Supplemental Fig. S2A, arrow). Finally, vertebrate TRF1 and TRF2 proteins differ significantly in length, primarily due to TRF1 having a shorter hinge region between the TRFH and Myb domains (Supplemental Fig. S2B). In agreement with their similarity to TRF2, the invertebrate TRFs are significantly longer than the vertebrate TRF1 orthologs and have a longer hinge region (Supplemental Fig. S2B). In conclusion, except for the vertebrates, all metazoans have a shelterin that contains a single double-stranded (ds) telomeric DNA-binding protein with features of TRF2. In vertebrates, shelterin gained a second ds telomeric DNA-binding protein, TRF1, that likely arose through the duplication of the gene encoding the ancestral metazoan TRF2-like protein. TRF1 presumably lost the iDDR-like region and the Rap1- and TIN2-binding sites from the hinge region, possibly resulting in its smaller size. We assume that, as in mammals, the TTAGGG repeats in invertebrate telomeres challenge replication forks, potentially requiring a shelterin-based solution to this problem. It is possible that the single TRF of invertebrate shelterin has this function and that, in vertebrates, TRF1 retained the ability to promote the duplication of the telomeric DNA while TRF2 lost this feature.

TRF1 and TRF2 evolved distinct N-terminal domains in mammals

The single TRF of cephalochordates possesses neither of the N-terminal domains typical of mammalian TRF1 and TRF2 (Fig. 2A), indicating that the N termini of TRF1 and TRF2 evolved during vertebrate evolution, as others have suggested (Poulet et al. 2012). The basic N terminus of mammalian TRF2 is rich in arginine and glycine residues and is referred to as the Basic or the GAR (glycine arginine-rich) domain (Supplemental Fig. S1A). The TRF2 paralogs of placental mammals express two forms of TRF2 that differ in the length of the GAR/Basic domain (Fig. 2A; Timashev and de Lange 2020). Marsupials and monotremes lack the long form of TRF2 but have a GAR/Basic domain very similar to that of the placental mammals (Fig. 2A). Each of these GAR/Basic domain sequences features the LANA-related motif that is thought to allow the N terminus of TRF2 to bind to the acidic patch of the nucleosome (Konishi et al. 2016). However, most other vertebrate TRF2 paralogs lack a region that can be qualified as a GAR or Basic domain, or the N-terminal domain is very short (Fig. 2A). The acidic domain of TRF1 is also a mammalian idiosyncrasy (Fig. 2B). With the exception of a few amphibians, nonmammalian vertebrates contain TRF1 orthologs with an N terminus that either is very short or lacks the preponderance of acidic amino acids typical of mammalian TRF1 orthologs. This is consistent with the previously noted lack of an acidic N terminus in the TRF1 orthologs of Gallus gallus, Xenopus laevis, and Danio rerio (Bassham et al. 1998; De Rycker et al. 2003; Xie et al. 2011). Thus, the acidic and basic nature of the TRF1 and TRF2 N termini, respectively, likely evolved around the time that mammals diverged from other vertebrates.

Emergence of TRF2 TRFH-binding partners

The TRFH domain of human TRF2 has a binding site for several shelterin accessory factors, including Apollo/SNM1B, SLX4, and NBS1 (Demuth et al. 2004; Lenain et al. 2006; van Overbeek and de Lange 2006; Chen et al. 2008). These factors use a Y/H-x-L-x-P motif to bind to the same site in TRF2 (surrounding F162 in human TRF2), which is present only in vertebrates (Supplemental Fig. S2A). TRF1 has a similar binding site surrounding F142 that is required for its interaction with TIN2 (Supplemental Fig. S2A; Chen et al. 2008). The TRF2-interacting partners evolved their Y/H-x-L-x-P motifs at different points in vertebrate evolution (Fig. 2C; Supplemental Fig. S4). In the PNUTS phosphatase (Kim et al. 2009) and Apollo/SNM1B orthologs, this motif is present in amniotes; in PNUTS, the acquisition of the TRF2-binding motif may have occurred earlier, since some amphibians and fish have a closely related Y-x-I-x-P sequence (Supplemental Fig. S4). The second addition to the TRF2 arsenal is NBS1, which evolved its TRF2-binding motif in placental mammals. The Y/H-x-L-x-P motif in MCPH1 evolved in the primates, with the TRF2-binding motif in SLX4 being the last to appear, arising in the haplorrhines (simians and tarsiers) (Fig. 2C). Since detailed information on TRF2-binding proteins largely derives from studies of mouse and human TRF2, it is possible that other TRF2 interactions evolved in nonmammalian vertebrates through acquisition of Y/H-x-L-x-P motifs in proteins that are distinct from the mammalian TRF2 interactors.

Evolution of the TRF2–Rap1 interface

Rap1 has long been known to be one of the most conserved shelterin subunits, and it is the only shelterin subunit found at the telomeres of budding yeast (Shore 1994; Kabir et al. 2010, 2014). The most conserved regions of Rap1 are the N-terminal BRCT domain (pfam16589) and the central Myb domain (pfam08914), which are found not only in all metazoans but also in many unicellular organisms (Fig. 3A). The Rap1 C-terminal (RCT) domain, which interacts with TRF2, is much less conserved, yet structural analysis has revealed that this region folds into a similar six-helix bundle in mammals and fungi (Fig. 3B,C; Chen et al. 2011). The interaction of the RCT domain with its binding partners is primarily mediated by helices 1 and 2 (Fig. 3B,C; Chen et al. 2011). Consistent with the conservation of the RCT in vertebrate Rap1 orthologs, the Rap1 binding site in the hinge region of TRF2 is conserved in mammals, birds, most reptiles, and some amphibians (Fig. 3C). The sequence conservation of the Rap1-binding site is accompanied by the structural conservation of the two helices involved in Rap1 binding (as predicted by RaptorX) (Fig. 3C). In contrast, most TRF2 orthologs in bony and cartilaginous fish appear to lack the Rap1-binding site, and it is not found in invertebrates (Fig. 3C; data not shown). Although there is some sequence similarity between the vertebrate Rap1 binding site and the sequences in the TRF2 hinge domain of Branchiostoma belcheri and Acanthaster planci, this is unlikely to represent a Rap1 binding site, since these regions are not predicted to fold into the two helices required for binding to the Rap1 RCT (data not shown).
Figure 3.

Evolution of the TRF2–Rap1 interface. (A) Domain architecture of human TRF2 and Rap1 with the indicated RBM–RCT interaction (arrow) and the structure of the TRF2 RBM (yellow) with the Rap1 RCT (green) (from PDB 3K6G). (B) Alignment of the RCT of Rap1. Helices are indicated by green boxes. (C) Alignment of the RBM of TRF2. Helices are indicated by yellow boxes. The helices in the presumed RBM of Callorhinchus milii are predicted by RaptorX. All alignments were performed with MUSCLE and are shown with ClustalX coloring.

Evolution of the TRF2–Rap1 interface. (A) Domain architecture of human TRF2 and Rap1 with the indicated RBM–RCT interaction (arrow) and the structure of the TRF2 RBM (yellow) with the Rap1 RCT (green) (from PDB 3K6G). (B) Alignment of the RCT of Rap1. Helices are indicated by green boxes. (C) Alignment of the RBM of TRF2. Helices are indicated by yellow boxes. The helices in the presumed RBM of Callorhinchus milii are predicted by RaptorX. All alignments were performed with MUSCLE and are shown with ClustalX coloring. These data do not imply, however, that the Rap1 orthologs from fish and invertebrates do not bind to their cognate TRF2 proteins. In this regard, Gaullier et al. (2016) proposed that a Y-x-L-x-P motif in the BRCT domain of Rap1 binds to the TRF2 F162 site. As this motif in Rap1 is absent from many vertebrate Rap1 orthologs (Supplemental Fig. S5), it is unlikely to represent an ancestral Rap1–TRF2 interface. An alternative Rap1–TRF binding mode, involving the Myb domain of Rap1, was noted in the unicellular parasite Trypanosoma brucei (Afrin et al. 2020). Whether this represents an ancestral state remains to be determined.

Only the TRFH domain of TIN2 is highly conserved

The TRFH domain of TIN2 is conserved in all metazoans (23% ± 9% identity) (Fig. 4A,B; http://imed.med.ucm.es/Tools/sias.html), consistent with the ancient origin of the TRFH domain (Hu et al. 2017; Xue et al. 2017). In contrast, the F-x-L-x-P TRF1-binding motif (cd11741), C-terminal to the TRFH domain of TIN2, is found only in mammals (Fig. 4C; data not shown). Nonetheless, in Xenopus, TRF1 has been shown to interact with TIN2 (Vizlin-Hodzic et al. 2009). The short region where dyskeratosis congenita (DC) (Savage 2018) mutations cluster is conserved in mammals, but the homology is strongly reduced in reptiles and amphibians (Fig. 4C). No significant homology with the DC cluster was detected in other vertebrates (Fig. 4C; data not shown).
Figure 4.

Conservation of the TIN2 TRFH domain and its interaction motifs in TRF2 and TPP1. (A) Domain architecture of human TIN2. Sites of interaction with other shelterin subunits are indicated. (B) Alignment of the TIN2 TRFH domains with conserved residues is indicated. Helices are indicated by teal boxes. (C) Alignments of the TBM and the DC patch. Arrows indicate DC mutations (Q269X, Q271X, K280E, K280fs, R282H, R282S, P283S, P283A, P283H, T284A, T284fs, L287P, P289S, R291G, and Q298fs). (D) Alignment of the TIN2-binding motif of TRF2. (E) Alignment of the TIN2-binding motif of TPP1. Helices from the human structure are noted with the white boxes above the sequences. Predicted helices (RaptorX) in TPP1 orthologs from cartilaginous fish are shown in white boxes below the sequences. Alignments in B–E were done with MUSCLE with ClustalX coloring.

Conservation of the TIN2 TRFH domain and its interaction motifs in TRF2 and TPP1. (A) Domain architecture of human TIN2. Sites of interaction with other shelterin subunits are indicated. (B) Alignment of the TIN2 TRFH domains with conserved residues is indicated. Helices are indicated by teal boxes. (C) Alignments of the TBM and the DC patch. Arrows indicate DC mutations (Q269X, Q271X, K280E, K280fs, R282H, R282S, P283S, P283A, P283H, T284A, T284fs, L287P, P289S, R291G, and Q298fs). (D) Alignment of the TIN2-binding motif of TRF2. (E) Alignment of the TIN2-binding motif of TPP1. Helices from the human structure are noted with the white boxes above the sequences. Predicted helices (RaptorX) in TPP1 orthologs from cartilaginous fish are shown in white boxes below the sequences. Alignments in B–E were done with MUSCLE with ClustalX coloring. Consistent with the conservation of the TRFH domain, the regions in TRF2 and TPP1 that interact with the TRFH domain of TIN2 are conserved in all vertebrates (33% ± 18% identity for the TIN2-binding motif [TBM] in TPP1 and 46% ± 24% identity for the TBM in TRF2) (Fig. 4D,E; http://imed.med.ucm.es/Tools/sias.html). However, the TBM in TRF2 could not be identified in 12 invertebrates examined by alignment to the relevant sequences from vertebrate TRF2s, indicating a low level of conservation (Fig. 4D; data not shown). The helical motif that allows TPP1 to bind to TIN2 was also absent from most invertebrates, except for the echinoderms Acanthaster planci and Asterias rubens (Fig. 4E; data not shown). In these two starfish, the C terminus of TPP1 is predicted to be α-helical and has sequence similarity to the TBM in vertebrate TPP1s (Fig. 4E).

Conserved motifs of TPP1 and its highly variable linker

The TIN2 interaction motif (Fig. 4E) in the C terminus of TPP1 is separated from two other functional domains: the N-terminal OB fold and the adjacent recruitment domain (RD), which interacts with POT1 (Fig. 5A). The OB fold and RD are separated from the TIN2-binding site by linkers of variable length, ranging from 50 to 700 amino acids (Fig. 5A,B), which is the least conserved part of TPP1 (Supplemental Fig. S6A). While the linker in the vertebrate TPP1 orthologs has conserved sequences (Supplemental Fig. S6B), this conservation does not extend to invertebrates (data not shown). In the vertebrates, the linker is predicted to lack secondary structure, and it has a large number of serine and proline residues with low helical propensity, suggesting that it is intrinsically disordered (Supplemental Fig. S6C).
Figure 5.

Conserved motifs and the variable linker of TPP1. (A) Domain architecture of the main isoform of human TPP1. (B) Plot showing the variability in the length of the TPP1 linker. (C) Alignment of two telomerase recruitment motifs. (Left) The N-terminal OB (NOB) motif is indicated above. Helix 1 of the human TPP1 OB fold is indicated with a white box. (Right) The telomerase-binding TEL patch is indicated with arrows pointing to residues (E81, E82, E84, R93, L96, L125, and E128) critical for the TPP1–telomerase interaction. (D) Structure (PDB 5UN7) of the OB3/HJRL domain of POT1 (gray) showing the three helices of TPP1 critical for POT1 binding (orange). (E) Alignment of the RD/POT1-binding domain of TPP1. Positions of the three helices in the human TPP1 sequence are indicated in orange below. Alignments in C and E were done with MUSCLE with ClustalX coloring.

Conserved motifs and the variable linker of TPP1. (A) Domain architecture of the main isoform of human TPP1. (B) Plot showing the variability in the length of the TPP1 linker. (C) Alignment of two telomerase recruitment motifs. (Left) The N-terminal OB (NOB) motif is indicated above. Helix 1 of the human TPP1 OB fold is indicated with a white box. (Right) The telomerase-binding TEL patch is indicated with arrows pointing to residues (E81, E82, E84, R93, L96, L125, and E128) critical for the TPP1–telomerase interaction. (D) Structure (PDB 5UN7) of the OB3/HJRL domain of POT1 (gray) showing the three helices of TPP1 critical for POT1 binding (orange). (E) Alignment of the RD/POT1-binding domain of TPP1. Positions of the three helices in the human TPP1 sequence are indicated in orange below. Alignments in C and E were done with MUSCLE with ClustalX coloring. The OB fold of TPP1 is highly conserved in all metazoans as well as in a number of unicellular organisms (pfam10341). In mammals, the OB fold recruits telomerase to telomeres using the most N-terminal sequences (NOB) and the centrally located TEL patch, which bind to the TEN domain of TERT (Nandakumar et al. 2012; Zhong et al. 2012; Sexton et al. 2014; Grill et al. 2018, 2021). As previously noted, the TEL patch and the NOB are conserved in mammals (Fig. 5C; Grill et al. 2018, 2021). However, the residues known to contribute to the interaction of TPP1 with TERT in human cells are not conserved in other vertebrates (Fig. 5C). It remains to be addressed whether and how TPP1 brings telomerase to telomeres in nonmammalian animals. Mammalian TPP1 binds POT1 through the interaction of its central RD domain with the HJRL domain of POT1, which interrupts the third OB fold in vertebrates (Figs. 5D,E, 6; Chen et al. 2017; Rice et al. 2017). The sequence of the RD domain of TPP1 is conserved in vertebrates but less so in invertebrates, consistent with the HJRL domain being absent from invertebrate POT1 orthologs (see Fig. 6). Nonetheless, the invertebrate RD domains are predicted to contain the second helix and the third (split) helix that provide the POT1-binding interface in vertebrates (Fig. 5E).
Figure 6.

A predicted fourth OB fold in most metazoan POT1 orthologs. (A) Domain architecture of human POT1, with the N-terminal OB fold (OB-N), which is absent in rodents and primates, indicated by a dotted box. Pfam database numbers for OB1 and OB2 of POT1 are indicated. The HJRL domain, specific to vertebrates, is depicted in orange. (B) MUSCLE alignment with ClustalX coloring of the N-terminal OB fold region (OB-N) of POT1 from representative species. (C) Predicted OB fold structures of the N-terminal POT1 sequences of the three indicated species. Templates were selected by alignment by SWISS-MODEL, and the sequence of the input proteins was modeled against the known template structure. All three were modeled against OB-A of human RPA70 (PDB 1FGU; shown at the left) with a Qmean score greater than −4.00, indicating a high-quality estimation. Structures are colored in PyMOL using rainbow spectrum coloring to indicate the position along the sequence from the N (red) to the C terminus (violet). (D) Structure of the human POT1 OB3/HJRL in complex with the RD/POT1-binding domain (violet) of TPP1 (PDB 5UN7). Helices of TPP1 are indicated. POT1 is colored by sequence similarity according to the BLOSUM62 matrix as indicated in the legend. The four cysteine residues that coordinate a single Zn2+ ion (gray) are shown as sticks. (E) MUSCLE alignment with ClustalX coloring of the POT1 OB3/HJRL C termini of representative species colored as in D. The split OB3 (red) and HJRL insert (orange) are indicated above. Residues that coordinate Zn2+ are indicated. (F) Model for the evolution of POT1 from an RPA70-like protein with four OB folds. (G) Template-modeled structure of the full-length Capsaspora owczarzaki POT1 generated using intensive PHYRE2 modeling. Four distinct OB fold-like domains are indicated in red (OB-N), green (OB1), yellow (OB2), and violet (OB3). The C-x-x-C motif in OB3 is indicated.

A predicted fourth OB fold in most metazoan POT1 orthologs. (A) Domain architecture of human POT1, with the N-terminal OB fold (OB-N), which is absent in rodents and primates, indicated by a dotted box. Pfam database numbers for OB1 and OB2 of POT1 are indicated. The HJRL domain, specific to vertebrates, is depicted in orange. (B) MUSCLE alignment with ClustalX coloring of the N-terminal OB fold region (OB-N) of POT1 from representative species. (C) Predicted OB fold structures of the N-terminal POT1 sequences of the three indicated species. Templates were selected by alignment by SWISS-MODEL, and the sequence of the input proteins was modeled against the known template structure. All three were modeled against OB-A of human RPA70 (PDB 1FGU; shown at the left) with a Qmean score greater than −4.00, indicating a high-quality estimation. Structures are colored in PyMOL using rainbow spectrum coloring to indicate the position along the sequence from the N (red) to the C terminus (violet). (D) Structure of the human POT1 OB3/HJRL in complex with the RD/POT1-binding domain (violet) of TPP1 (PDB 5UN7). Helices of TPP1 are indicated. POT1 is colored by sequence similarity according to the BLOSUM62 matrix as indicated in the legend. The four cysteine residues that coordinate a single Zn2+ ion (gray) are shown as sticks. (E) MUSCLE alignment with ClustalX coloring of the POT1 OB3/HJRL C termini of representative species colored as in D. The split OB3 (red) and HJRL insert (orange) are indicated above. Residues that coordinate Zn2+ are indicated. (F) Model for the evolution of POT1 from an RPA70-like protein with four OB folds. (G) Template-modeled structure of the full-length Capsaspora owczarzaki POT1 generated using intensive PHYRE2 modeling. Four distinct OB fold-like domains are indicated in red (OB-N), green (OB1), yellow (OB2), and violet (OB3). The C-x-x-C motif in OB3 is indicated.

Most metazoan POT1 proteins have a predicted fourth OB fold

The ssDNA–binding domain of POT1 contains two highly conserved OB folds (OB1 and OB2; pfam 02765 and 16686, respectively), the first of which allowed the identification of both fission yeast and human POT1 orthologs based on sequence similarity to the ciliate telomere-binding protein TEBPα (Fig. 6A; Baumann and Cech 2001). Crystallography revealed the presence of a third OB fold in human POT1, which is split by an HJRL domain (Fig. 6A; Chen et al. 2017; Rice et al. 2017). The function of the HJRL has not been established. Interestingly, our analysis indicates that the three-OB-fold architecture of POT1 in rodents and primates differs from what is seen in all other animals (Fig. 6B). Most vertebrate POT1 proteins have a conserved N-terminal extension that is also present in the POT1 proteins of invertebrates (Fig. 6B). SWISS-MODEL analysis predicted that in each case, this N-terminal extension folds into a canonical OB fold (Fig. 6C). We conclude that most metazoan POT1 proteins likely have a fourth OB fold at their N termini, which we call OB-N. The loss of OB-N occurred during mammalian evolution ∼100 million years ago when the Euarchontoglires (which include primates, rodents, and lagomorphs) diverged from Laurasiatherian mammals (which include carnivores, insectivores, and bats) (Supplemental Fig. S7A). Analysis of the exon–intron structure of the human POT1 gene indicates that the absence of OB-N is due to the loss of a small exon (e.g., exon 3 in the Felis catus POT1 gene), which results in a frameshift and premature termination of the OB-N reading frame (Supplemental Fig. S7B,C). Downstream from this stop codon, a second ATG allows expression of the three-OB-fold version of POT1. This second ATG is also present in the feline POT1 mRNA, raising the possibility that cats, and perhaps other mammals, express a three- as well as a four-OB-fold POT1 isoform. Among the Euarchontoglires, we identified one species (the arctic ground squirrel, Urocitellus parryii) that has restored the four-OB-fold architecture of POT1, presumably by mutations that revive the lost exon. The C-terminal OB fold (OB3) of all metazoan POT1 orthologs is well conserved and often interrupted by a region with a much lower level of conservation (Fig. 6D,E). In vertebrate POT1 orthologs, OB3 contains a conserved HJRL domain, whereas the insert is highly variable in sequence and length in invertebrates (Fig. 6E,F; Supplemental Fig. S8A,B). For instance, the insert in several corals is predicted to be an OB fold, raising the odd possibility of an OB fold within an OB fold (Supplemental Fig. S8B). Structural analysis may therefore reveal a matryoshka doll version of POT1 in these animals. Despite the variability of the insert in the C-terminal OB fold, it always contains cysteine residues that coordinate Zn2+ in conjunction with cysteine residues in the N-terminal half of the third OB fold (Fig. 6E). Based on these findings, we propose that POT1 arose as a four-OB-fold protein, containing OB-N, OB1, OB2, and the split OB3 (Fig. 6F). Most likely, the original eukaryotic POT1 evolved from a duplication of a precursor similar to RPA70, which has four OB folds and is part of an ssDNA-binding complex (Fig. 6F; Caldwell and Spies 2020). The four-OB-fold POT1 proteins have the same domain architecture as RPA70, and in both proteins, the second and the third OB fold have high-affinity ssDNA-binding activity. Furthermore, the fourth (C-terminal) OB folds of both proteins have a Zn2+-binding motif and provide binding sites for interacting partners (TPP1 for POT1, and RPA32 and RPA14 for RPA70). Consistent with the idea that POT1 arose from a four-OB-fold RPA70-like protein, there are four predicted OB folds in the POT1 of Capsaspora owczarzaki (Fig. 6G), a protist that is thought to be one of the closest unicellular relatives of animals (Ferrer-Bonet and Ruiz-Trillo 2017). Furthermore, there are four OB folds in POT1 ortholog TEB1, which is a component of the Tetrahymena telomerase holoenzyme (He et al. 2021). If the original eukaryotic POT1 indeed had four OB folds, the N-terminal OB fold must have been lost in some fungi and ciliates, since there are only three OB folds in Schizosaccharomyces pombe pot1 and Oxytricha nova TEBPα (Horvath et al. 1998; Lloyd et al. 2016).

Duplication of POT1 in the rodent lineage

Rats and mice have two POT1 proteins at their telomeres, POT1a and POT1b, whose functions have diverged. POT1b recruits the CST complex to telomeres while POT1a serves as the primary repressor of ATR signaling (Hockemeyer et al. 2006; Denchi and de Lange 2007; Wu et al. 2012). Our analysis of the POT1 genes in rodents shows that this duplication occurred during the divergence of Muridae (e.g., rats, mice, and gerbils) and Cricetidae (e.g., voles, lemmings, and hamsters) from Spalacidae (mole rats) and Dipodidae (jerboas) (Supplemental Fig. S9A,B). POT1b binds to CST with two sequence motifs that are distinct from POT1a (Supplemental Fig. S9C; Wu et al. 2012). In the rodents with a single POT1, this protein has the C-terminal motif that allows POT1b to bind CST. Therefore, it is likely that the CST binding ability was the ancestral state and that POT1a lost this feature. Another difference between shelterin in mice and rats versus humans is that these rodents lack a tankyrase 1-binding site in the N terminus of TRF1 (Smith et al. 1998; Donigian and de Lange 2007). Remarkably, the presence of a tankyrase 1-binding site in rodent TRF1 orthologs is closely correlated with the presence of a single POT1 gene (Supplemental Fig. S9D). Thus, during the evolution of the Muridae and Cricetidae, the interaction between TRF1 and tankyrase 1 was lost, and POT1 was duplicated. Whether these two changes are functionally related remains to be determined.

Discussion

The data presented here illuminate how shelterin evolved in metazoans, revealing unanticipated modifications of the six shelterin subunits (summarized in Fig. 7). Although it remains to be determined whether the invertebrate shelterin subunits identified here actually bind to and function at telomeres, our findings indicate that in invertebrates, shelterin contains a single TRF. These invertebrate TRFs are more like TRF2 than TRF1, arguing that the original shelterin was built on a TRF2-like factor. In addition to the TRFH and Myb domains that define the TRF proteins, most metazoan TRFs contain an iDDR-like segment immediately upstream of the Myb domain. Given the conservation of this feature, it will be of interest to understand the full spectrum of functions associated with the iDDR. It will also be important to understand whether the role of TRF1 in promoting the replication of telomeric DNA is a feature it shares with the ancestral TRF or a novelty.
Figure 7.

Summary of the evolution of metazoan shelterin. (Top) Schematic of the five-subunit invertebrate shelterin complex and its conserved domains. (Bottom) Schematic of the six-subunit vertebrate shelterin, highlighting the domains absent from the invertebrate shelterin subunits. The indicated interactions are observed in human and mouse shelterin and have not been verified in most other vertebrates. The colors of the domains are clarified in the color legend.

Summary of the evolution of metazoan shelterin. (Top) Schematic of the five-subunit invertebrate shelterin complex and its conserved domains. (Bottom) Schematic of the six-subunit vertebrate shelterin, highlighting the domains absent from the invertebrate shelterin subunits. The indicated interactions are observed in human and mouse shelterin and have not been verified in most other vertebrates. The colors of the domains are clarified in the color legend. The duplication that gave rise to TRF1 and TRF2 occurred during the divergence of vertebrates. Following this duplication, several nonshelterin proteins (e.g., PNUTS, Apollo, and SLX4) evolved the Y/H-x-L-x-P motif, which interacts with the common protein interaction site in the TRFH domain of TRF2. PNUTS was the first and SLX4 was the last protein to become shelterin accessory factors by gaining the TRF2-binding site. The distinct N-terminal domains of TRF1 and TRF2, which are characterized by acidic and basic residues, respectively, did not appear until the divergence of mammals. Whereas the function of the acidic domain of TRF1 is not clear, the Basic/GAR domain of TRF2 is critical for the protection of the t-loop from inappropriate processing by branch migration and Holliday junction resolution (Wang et al. 2004; Saint-Léger et al. 2014; Rai et al. 2016; Schmutz et al. 2017). Possibly, this protective function is mediated by a TRF2-interacting protein in nonmammalian vertebrates. TIN2 is present in all metazoan shelterin complexes, and its TRFH domain, where TRF2 and TPP1 interact, is highly conserved. In contrast, the C-terminal half of TIN2, including the TRF1-binding site and the neighboring cluster of amino acids mutated in DC, is poorly conserved outside mammals. However, the interaction of Xenopus TIN2 and TRF1 suggests that TRF1 can bind to TIN2 in the absence of the TRF1-binding motif defined in mammals (Vizlin-Hodzic et al. 2009). TPP1 has three functional domains that are well conserved in all metazoans: the N-terminal OB fold where telomerase binds, the RD where POT1 binds, and the TIN2-binding site at its C terminus. The region between the POT1- and TIN2-binding sites is highly divergent in sequence and shows remarkable variation in length. This part of TPP1 may represent a flexible linker allowing POT1 and telomerase to be positioned in different orientations vis-à-vis the core of shelterin. Finally, the data reveal important aspects of metazoan POT1. Our data indicate that most metazoans have POT1 proteins with four rather than three OB folds, suggesting that POT1 evolved from a precursor that had an additional OB fold (OB-N) at its N terminus. We propose that POT1 evolved from a protein related to RPA70, the largest subunit of the RPA ssDNA-binding complex. RPA70 has four OB folds and, like POT1, interacts with its binding partners through a C-terminal OB fold with a Zn2+-binding site. OB-N was lost in primates and rodents, and it will be of interest to determine the function of OB-N in other vertebrates. The most C-terminal OB fold (OB3), which is split by the insertion of an HJRL domain in vertebrates, is also split in invertebrates. While the nature of the inserted sequence varies, the Zn2+-binding feature is conserved. This suggests that it is the split nature of OB3 and Zn2+ binding, rather than the added sequence, that is functionally significant. These data show that the basic architecture of shelterin—including the presence of duplex and single-stranded DNA-binding factors and subunits that allow the DNA-binding proteins to form a complex—is highly conserved in metazoans. Nonetheless, shelterin underwent major innovations, especially during vertebrate evolution, where one subunit was duplicated and others lost or gained entire domains. The only shelterin subunit that changed little during metazoan evolution is Rap1. Although Rap1 is the most conserved shelterin subunit and has a counterpart at telomeres of unicellular organisms, it makes relatively little contribution to shelterin function in mammals. It will be of interest to further trace the evolution of shelterin subunits in unicellular organisms to understand how this complex emerged from proteins that provided its basic building blocks: Myb domains, TRFH domains, and OB folds.

Materials and methods

Identification of genes encoding shelterin subunits

All accession numbers used in this study are listed in Supplemental Table S1. Metazoan shelterin subunits were identified using BLASTp and PSI-BLAST searches of the nonredundant protein sequence (nr) database. A core set of 11 vertebrate species (representing mammals, reptiles, birds, amphibians, and fish) that contained all six shelterin subunits was identified and used as a starting point for alignments. Invertebrate TRF orthologs were identified based on searches against the TRFH and Myb domains of TRF1 and TRF2. Other shelterin subunits were identified in these organisms by searching for sequences homologous to the BRCT, Myb, and/or RCT domains of human Rap1; the TRFH domain of human TIN2; the OB fold of human TPP1; the OB1–OB2 (ssDNA-binding) domains of human POT1; and the respective full-length proteins. An E-value threshold of 0.005 was used as a cutoff for alignment. Shelterin subunits from additional species in certain clades (e.g., primates and rodents) were identified in representative species chosen by intermediate evolutionary distance and high annotation quality. DNA sequences of POT1 exons 1–6 of Homo sapiens and Felis catus were identified from the EMBL-EBI Ensembl database and compared using SnapGene.

Alignments

Protein sequences obtained from BLAST were aligned using multiple sequence comparison by log expectation (MUSCLE) in either SnapGene (GSL Biotech) or Jalview (Waterhouse et al. 2009). To define individual domains, the full-length sequences were aligned, and then sections of the alignment were realigned to refine segments misaligned due to gaps. Alignments were colored using ClustalX unless otherwise indicated. Domains were identified by alignment to the corresponding Homo sapiens sequences. Some alignments were confirmed using secondary structure prediction using JPred4 (Drozdetskiy et al. 2015) or RaptorX (Källberg et al. 2014). For visualizing the variable linker region of TPP1, the NCBI Multiple Sequence Alignment (MSA) viewer was used by importing a MUSCLE alignment of the TPP1 proteins and set to color by “conservation” in Jalview. Helix propensity was visualized in Jalview using original values described by Chou and Fasman (1978). JPred secondary structure prediction of the TPP1 linker was performed in Jalview.

Phylogenetic tree inference and evolutionary time-scale analysis

Phylogenetic trees were inferred using MrBayes version 3.2 (Ronquist et al. 2012). Alignments were created using MUSCLE and formatted as a NEXUS file. MrBayes sampled across fixed amino acid rate matrices using one to three chains per analysis. After 105–106 generations, the models converged with a low standard deviation and were summarized into a tree format file. Trees were visualized in FigTree. For species-level analysis, TimeTree (Hedges et al. 2015; Kumar et al. 2017) was used.

iDDR logo and local pI analysis

An iDDR sequence logo was generated using WebLogo (Crooks et al. 2004). A custom script for analyzing the local pI of a protein sequence was written in R. A 20-amino-acid sliding window was used to calculate the pI at a given position using the EMBOSS pK scale for each amino acid.

Structure viewing and model building

Published structures were downloaded from the Protein Data Bank and visualized in PyMOL. The following PDB structures were used: 3K6G (TRF2-Rap1), 5UN7 (POT1-TPP1), and 1FGU [RPA70(OB-A)]. Protein modeling was run in PHYRE2 (Kelley et al. 2015) or SWISS-MODEL (Bordoli et al. 2009). For the Capsaspora owczarzaki model in Figure 6F and for verification of other four-OB-fold POT1 species (data not shown), PHYRE2 modeling was run in “intensive” mode to model the full protein structure rather than simply the well-aligned part. Identification of HJRL insert sequences was performed by first aligning the sequences of OB3 and then modeling the insert sequence in PHYRE2.

Mapping conservation of the POT1 OB3-HJRL onto the structure

The POT1 OB3-HJRL was extracted from the POT1 OB3-HJRL/TPP1 RD(PBM) structure (PDB 5UN7), and an alignment matching the sequence was generated in Jalview. A custom Perl script calculated the sequence similarity scores at each position using the BLOSUM62 matrix. These similarity scores were mapped onto the structure using the indicated cutoff values for each color. The colors were further mapped onto the sequence alignment using a custom script to output color commands and then visualized on the alignment using AlScript (Barton 1993).

Cell culture and expression constructs

MEFs were cultured in Dulbecco's modified Eagle medium (DMEM; Corning) supplemented with 15% fetal bovine serum (FBS; Gibco), nonessential amino acids (Gibco), 2 mM L-glutamine (Gibco), 100 U/mL penicillin, and 100 μg/mL streptomycin (Gibco). Phoenix cells were cultured in DMEM supplemented with 10% bovine calf serum (BCS). The doxycycline-inducible transactivator rtTA was delivered into MEFs using pQCXIB (Clontech) retroviral vector. Nontherian TRF1 cDNAs (Anolis carolinensis ENSACAT00000014506.4 and Ornithorhynchus anatinus ENSOANT00000002746.3) were synthesized as gBlocks gene fragments (IDT) and cloned (alongside mouse TRF1) into pRetroX-TRE3G (Takara) using Gibson assembly adding an N-terminal Myc tag. The TRF1-expressing retroviral constructs were delivered into rtTA-MEFs by retroviral transduction. Protein expression was induced with 2 μg/mL doxycycline for 24 h.

IF-FISH and immunoblotting

IF-FISH with Myc (CST 9B11, #2276) was performed as described previously (Dimitrova et al. 2008). Digital images were acquired on a DeltaVision system (Applied Precision) equipped with a cooled charge-coupled device camera (CoolSnap QE, Photometrics), a PlanApo 60× 1.40 NA objective (Olympus America, Inc.), and SoftWoRx software. For immunoblotting, whole-cell lysates were prepared using RIPA buffer (150 mM NaCl, 1% NP-40, 0.5% sodium deoxycholate, 0.1% SDS, 50 mM Tris-HCl at pH 8) containing protease inhibitors (100 μM PMSF, 1 mM benzamidine, 2.5 μg/mL pepstatin A, 10 μg/mL leupeptin, 10 μg/mL aprotinin) and phosphatase inhibitors (1 mM each NaF, Na3VO4, Na2P2O7). Proteins were fractionated by SDS/PAGE and blotted onto membranes. Membranes were blocked in 5% milk/0.1% Tween-20 in TBS (TBST), incubated in primary antibody for 1 h at room temperature, washed three times in TBST, and incubated for 1 h at room temperature with secondary antibodies. After three washes in TBST, membranes were developed using enhanced chemiluminescence (Amersham).
  89 in total

1.  Ku70 stimulates fusion of dysfunctional telomeres yet protects chromosome ends from homologous recombination.

Authors:  Giulia B Celli; Eros Lazzerini Denchi; Titia de Lange
Journal:  Nat Cell Biol       Date:  2006-07-16       Impact factor: 28.824

Review 2.  Telomerase Mechanism of Telomere Synthesis.

Authors:  R Alex Wu; Heather E Upton; Jacob M Vogan; Kathleen Collins
Journal:  Annu Rev Biochem       Date:  2017-01-30       Impact factor: 23.643

3.  Crystal structure of the Oxytricha nova telomere end binding protein complexed with single strand DNA.

Authors:  M P Horvath; V L Schweiker; J M Bevilacqua; J A Ruggles; S C Schultz
Journal:  Cell       Date:  1998-12-23       Impact factor: 41.582

Review 4.  Shaping human telomeres: from shelterin and CST complexes to telomeric chromatin organization.

Authors:  Ci Ji Lim; Thomas R Cech
Journal:  Nat Rev Mol Cell Biol       Date:  2021-02-09       Impact factor: 94.444

5.  TALEN gene knockouts reveal no requirement for the conserved human shelterin protein Rap1 in telomere protection and length regulation.

Authors:  Shaheen Kabir; Dirk Hockemeyer; Titia de Lange
Journal:  Cell Rep       Date:  2014-11-06       Impact factor: 9.423

6.  How the human telomeric proteins TRF1 and TRF2 recognize telomeric DNA: a view from high-resolution crystal structures.

Authors:  Robert Court; Lynda Chapman; Louise Fairall; Daniela Rhodes
Journal:  EMBO Rep       Date:  2005-01       Impact factor: 8.807

7.  Tankyrase, a poly(ADP-ribose) polymerase at human telomeres.

Authors:  S Smith; I Giriat; A Schmitt; T de Lange
Journal:  Science       Date:  1998-11-20       Impact factor: 47.728

Review 8.  Shelterin-Mediated Telomere Protection.

Authors:  Titia de Lange
Journal:  Annu Rev Genet       Date:  2018-09-12       Impact factor: 16.830

9.  Tree of life reveals clock-like speciation and diversification.

Authors:  S Blair Hedges; Julie Marin; Michael Suleski; Madeline Paymer; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2015-03-03       Impact factor: 16.240

10.  Dysfunctional telomeres activate an ATM-ATR-dependent DNA damage response to suppress tumorigenesis.

Authors:  Xiaolan Guo; Yibin Deng; Yahong Lin; Wilfredo Cosme-Blanco; Suzanne Chan; Hua He; Guohua Yuan; Eric J Brown; Sandy Chang
Journal:  EMBO J       Date:  2007-10-18       Impact factor: 14.012

View more
  6 in total

Review 1.  The E3 Ubiquitin Ligase Fbxo4 Functions as a Tumor Suppressor: Its Biological Importance and Therapeutic Perspectives.

Authors:  Shuo Qie
Journal:  Cancers (Basel)       Date:  2022-04-25       Impact factor: 6.575

2.  Ustilago maydis telomere protein Pot1 harbors an extra N-terminal OB fold and regulates homology-directed DNA repair factors in a dichotomous and context-dependent manner.

Authors:  Syed Zahid; Sarah Aloe; Jeanette H Sutherland; William K Holloman; Neal F Lue
Journal:  PLoS Genet       Date:  2022-05-19       Impact factor: 6.020

3.  The non-telomeric evolutionary trajectory of TRF2 in zebrafish reveals its specific roles in neurodevelopment and aging.

Authors:  Yilin Ying; Xuefei Hu; Peng Han; Aaron Mendez-Bermudez; Serge Bauwens; Rita Eid; Li Tan; Mélanie Pousse; Marie-Joseph Giraud-Panis; Yiming Lu; Eric Gilson; Jing Ye
Journal:  Nucleic Acids Res       Date:  2022-02-28       Impact factor: 16.971

4.  Shelterin is a dimeric complex with extensive structural heterogeneity.

Authors:  John C Zinder; Paul Dominic B Olinares; Vladimir Svetlov; Martin W Bush; Evgeny Nudler; Brian T Chait; Thomas Walz; Titia de Lange
Journal:  Proc Natl Acad Sci U S A       Date:  2022-07-26       Impact factor: 12.779

Review 5.  Connecting telomere maintenance and regulation to the developmental origin and differentiation states of neuroblastoma tumor cells.

Authors:  Eun Young Yu; Nai-Kong V Cheung; Neal F Lue
Journal:  J Hematol Oncol       Date:  2022-08-27       Impact factor: 23.168

Review 6.  Aging conundrum: A perspective for ovarian aging.

Authors:  Jiachen Wu; Yang Liu; Yinhua Song; Lingjuan Wang; Jihui Ai; Kezhen Li
Journal:  Front Endocrinol (Lausanne)       Date:  2022-08-19       Impact factor: 6.055

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.