| Literature DB >> 28176777 |
Dennis Kappei1,2, Marion Scheibe3,4, Maciej Paszkowski-Rogacz2, Alina Bluhm3, Toni Ingolf Gossmann5, Sabrina Dietz3, Mario Dejung3, Holger Herlyn6, Frank Buchholz2,7,8,9,10, Matthias Mann4, Falk Butter3.
Abstract
Molecular phylogenomics investigates evolutionary relationships based on genomic data. However, despite genomic sequence conservation, changes in protein interactions can occur relatively rapidly and may cause strong functional diversification. To investigate such functional evolution, we here combine phylogenomics with interaction proteomics. We develop this concept by investigating the molecular evolution of the shelterin complex, which protects telomeres, across 16 vertebrate species from zebrafish to humans covering 450 million years of evolution. Our phylointeractomics screen discovers previously unknown telomere-associated proteins and reveals how homologous proteins undergo functional evolution. For instance, we show that TERF1 evolved as a telomere-binding protein in the common stem lineage of marsupial and placental mammals. Phylointeractomics is a versatile and scalable approach to investigate evolutionary changes in protein function and thus can provide experimental evidence for phylogenomic relationships.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28176777 PMCID: PMC5309834 DOI: 10.1038/ncomms14334
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Phylointeractomics screen for telomere binders in 16 vertebrate species.
(a) Phylogenetic tree of vertebrate species analysed in this study—all higher ranked vertebrate taxa apart from monotremata are represented. (b) Quantitative label-free DNA interaction screen with DNA oligonucleotides containing either the telomeric repeat sequence (TTAGGG) or a control sequence (GTGAGT). Specific interaction partners are differentiated from background binders by a ratio significantly different from 1:1. All pull-downs were performed in biological replicates (n=4). (c) Volcano plots for all tested vertebrate species. Specifically enriched proteins (red circles) are distinguished from background binders (blue circles) by a two-dimensional cutoff with S0=0.6 and P<0.05 (Welch's t-test). Detected members of the shelterin complex (TERF1, TERF2, TIN2, TPP1, RAP1 and POT1) are highlighted (filled orange dots).
Figure 2Telomere-binding proteins are identified by cross-species validation.
(a) Heat map of identified proteins with specific enrichment at TTAGGG repeats in at least 5 out of the 16 vertebrate species. Species (rows) are arranged according to their phylogenetic relations while proteins (columns) are clustered based on their binding pattern across all pull-downs. Colour gradient represents relative enrichment from TTAGGG binding (red) to equal enrichment on the telomeric and control sequence (blue). Only events that were identified as hits according to the criteria in Fig. 1c are shown. Gene names are based on the human versions, and in the occurrence of paralogues, both enrichment values are displayed side by side in the species concerned (for example, POT1 in mouse). The presence of telomerase activity is indicated on the right. (+) indicates cell lines with activity at the detection limit. IMR90 (human)47 and NIH3T3 cells (mouse)48 are known telomerase-negative and -positive cells, respectively. Quantifications can be found in Supplementary Fig. 2. (b) Heat map of protein sequence identity for all candidate telomere binders from a across 15 vertebrate species. Axolotl was excluded from this analysis as there is currently no published whole-genome annotation available. The protein sequence identity is displayed as a colour gradient relative to the human sequences with the percentage value displayed in each square. Grey squares represent absent genes based on ENSEMBL genome assemblies. Grey values represent non-annotated homologues based on ENSEMBL genome assemblies for which we could identify homologues by reciprocal BLAST search.
Figure 3TERF1 acquired ability to bind telomeric DNA in the therian stem lineage.
(a) Sequence alignment of the TERF1 DBD of several species. Residues involved in DNA binding are marked by a black rectangle. Asterisks (*) indicate positions with fully conserved residues; colons (:) indicate exchanges with biochemical similar and periods (.) with related amino acid. Below each residue, a quantitative representation of the Naive Empirical Bayesian class probability derived from the branch-site modeling in c for selective constraints in therians (red; also in the background of these residues in the sequence alignment) and constraints in vertebrates (blue). Coloured residues refer to results of binding tests of amino-acid exchange variants from d with unchanged binding (blue), reduced binding (orange) or no binding (red). (b) Pull-down of the TERF1 DBD of various different vertebrate species with either telomeric repeats or a control oligonucleotide. (c) Phylogenetic relationship of the species used for substitution rate analysis. The colours denote the branch classifications, with red representing selective constraints across therians and blue representing selective constraints across vertebrates. Substitution rates were calculated using PAML45 to obtain the non-synonymous to synonymous substitution rate ratio (dN/dS=ω). ω values <1, =1 and >1 indicate purifying selection, neutral evolution and diversifying (positive) selection, respectively. A branch-site model (model D) was applied and compared with a homogeneous site model (discrete Model M3) and with a Model D that assumes neutral evolution for a predefined set of branches, representing our null hypothesis (ω=1). (d) Sequence-specific pull-down of single amino-acid exchange variants of the opossum TERF1 DBD. All seven residues exclusively found in platypus but not in therians were tested. (e) Sequence-specific pull-down of platypus TERF1 DBD variants using purified TERF1 DBDs. Combinations of the four identified residues from d were mutated to the opossum sequence to test whether their substitution can attribute TERF1 the capacity of directly binding to telomeric dsDNA. (f) Quantification of western blotting intensities from e (n=3) shows similar binding affinity for platypus variants 3 and 4 compared with wild-type opossum TERF1-DBD. Mean with s.d. error bars.