Literature DB >> 28257525

FOXP in Tetrapoda: Intrinsically Disordered Regions, Short Linear Motifs and their evolutionary significance.

Lucas Henriques Viscardi1, Luciana Tovo-Rodrigues2, Pamela Paré1, Nelson Jurandi Rosa Fagundes1, Francisco Mauro Salzano1, Vanessa Rodrigues Paixão-Côrtes3, Claiton Henrique Dotto Bau1, Maria Cátira Bortolini1.   

Abstract

The FOXP subfamily is probably the most extensively characterized subfamily of the forkhead superfamily, playing important roles in development and homeostasis in vertebrates. Intrinsically disorder protein regions (IDRs) are protein segments that exhibit multiple physical interactions and play critical roles in various biological processes, including regulation and signaling. IDRs in proteins may play an important role in the evolvability of genetic systems. In this study, we analyzed 77 orthologous FOXP genes/proteins from Tetrapoda, regarding protein disorder content and evolutionary rate. We also predicted the number and type of short linear motifs (SLIMs) in the IDRs. Similar levels of protein disorder (approximately 70%) were found for FOXP1, FOXP2, and FOXP4. However, for FOXP3, which is shorter in length and has a more specific function, the disordered content was lower (30%). Mammals showed higher protein disorders for FOXP1 and FOXP4 than non-mammals. Specific analyses related to linear motifs in the four genes showed also a clear differentiation between FOXPs in mammals and non-mammals. We predicted for the first time the role of IDRs and SLIMs in the FOXP gene family associated with possible adaptive novelties within Tetrapoda. For instance, we found gain and loss of important phosphorylation sites in the Homo sapiens FOXP2 IDR regions, with possible implication for the evolution of human speech.

Entities:  

Year:  2017        PMID: 28257525      PMCID: PMC5409772          DOI: 10.1590/1678-4685-GMB-2016-0115

Source DB:  PubMed          Journal:  Genet Mol Biol        ISSN: 1415-4757            Impact factor:   1.771


Introduction

Members of the Forkhead box (FOX) gene superfamily have been widely associated with organismal development and are identified by their evolutionary conserved forkhead DNA-binding domain (Lam ; Morris and Fanucchi, 2016). The FOXP subfamily is probably the most extensively characterized subfamily of the forkhead superfamily. The four FOXP genes (FOXP1, FOXP2 FOXP3, and FOXP4) emerged by duplication events during the origin of vertebrates (Santos ; Song ). Since the duplication events, paralogues FOXP1, FOXP2, and FOXP4 have played an important role in brain, lung, heart, and jaw development in vertebrates, while FOXP3 has been associated with the development and homeostasis of the immune system, since it is described as a master-regulator of CD4+ and CD25+ T-cells (Coffer and Burgering, 2004; Akbar ; Takahashi ; Benayoun ; Andersen ; Lam ; Cesario ). Undoubtedly, the most widely known member of the FOXP subfamily is FOXP2, as it has attracted the attention of the scientific community and the general media because of its role in the evolution of speech and vocalization in mammals (Zhang ; Li ), especially because mutations in this gene promote severe impairment of articulation and grammar in humans (Enard ; Schön ; Enard, 2011; Bowers and Konopka, 2012). FOXP2 is expressed primarily in the brain, where it plays an important role in synapse formation and cell adhesion, as well as in the specification and differentiation of the lung epithelium and gastrointestinal and cardiovascular tissues (Song ). Evolutionary studies have been successively improved by incorporating new methodological approaches. Analysis of intrinsically disordered regions (IDRs), which is routinely used in medical and structural biology studies, can also be applied in evolutionary studies because of the possible role of IDRs in the evolvability (evolutionary capacity; Pigliucci, 2008; Xue , 2013) of genetic systems (Neduva and Russell, 2005). IDRs are protein segments rich in hydrophilic, polar, and charged amino acids (glutamine, serine, glutamic acid, arginine, and lysine), as well as glycine, proline, and alanine (Iakoucheva ; Liu ). IDRs are prevalent in proteins that exhibit multiple physical interactions and play critical roles in various biological processes, including regulation and signaling (Dunker ; Nguyen Ba ; Forman-Kay and Mittag, 2013). The conformational flexibility of IDRs facilitates exposure of specific residues for modification and binding to other proteins and molecules (Huang and Sarai, 2012; Liu and Huang, 2014). Thus, intrinsically disordered proteins (IDPs) are characterized by a high IDR content and the absence of stable well-folded three-dimensional structures in solutions (Forman-Kay and Mittag, 2013). Short linear motifs (SLIMs) are short stretches in protein sequences that mediate protein-protein interactions. SLIMs are typically 2–10 amino acids long; however, only two or three amino acids are essential for interaction with other molecules. SLIMs are common elements in IDRs, and they probably play a significant role in the functioning of these disordered regions (Wagner and Lynch, 2008; Huang and Sarai, 2012; Nguyen Ba ; Forman-Kay and Mittag, 2013; Liu and Huang, 2014). The presence of a great number of these motifs in such regions probably confers functional flexibility to this class of proteins (Gould ; Disfani ; Dinkel , 2014). Furthermore, SLIMs are particularly evolvable because they are poorly conserved between lineages and can appear and disappear through small changes (Wagner and Lynch, 2008). Therefore, changes in SLIMs significantly impact complex regulatory networks (Neduva and Russell, 2005). Thus, analysis of these changes enables the assessment of their importance in the evolutionary trajectory of animals. In addition to the forkhead, leucine-zipper, and zinc-finger domains, other molecular elements such as IDRs may play crucial roles in the function of FOXP proteins. However, these structures have not been studied extensively. Thus, the present investigation aims to ask how FOXPs structural forms changed throughout Tetrapoda evolution regarding linear motifs composition and disordered content. Furthermore, as FOXP3 is known to be the only gene among the FOXP family playing a role in the immune system, we investigated if a higher evolutionary rate would be observed when compared with other FOXPs, and if such a rate could be related with higher disordered content.

Material and Methods

Seventy-seven orthologues FOXP genes/proteins from tetrapods (Table S1) were considered in the present study. FOXP nucleotide sequences were retrieved from the NCBI database using BLASTN with 20,000 Max target sequences. We also used the Ensembl genome database (http://ensembl.org/) for sequence retrieval. The Neanderthal exome (Castellano ; http://cdna.eva.mpg.de/) was consulted to verify possible specific changes within the genus Homo. However, one protein-coding gene may codify more than one isoform. The presence of many isoforms in the FOX genes, caused by alternative splicing, was handled conservatively by choosing only isoforms that clearly resemble the canonical form identified in humans by using UniProt (http://www.uniprot.org/). Incomplete sequences were removed from the analysis. Subsequently, the sequences were aligned using the MAFFT algorithms (standard pattern) implemented in the Guidance web server (http://guidance.tau.ac.il/). The alignments are available in the Supplementary Material. Phylogenetic trees were drawn using FigTree1.4.2. (http://tree.bio.ed.ac.uk/software/figtree/) according to the literature (Meredith ; Perelman ; Song ). Importantly, while both FOXP2 and FOXP4 passed through a standard NsSites test site analysis, for FOXP3 and FOXP1 we had to employ distinct data tests. Because of the absence of several base pairs in Xenopus laevis FOXP1, we excluded this species. For FOXP3, just the mammalian sequences were used because reptilian and amphibian FOXP3 are shorter and very different, while in birds, FOXP3 is completely absent (Andersen ). In addition, we removed from the analysis a residual N-terminal part of FOXP3 present only in the mammals Nomascus leucogenys Papio anubis, Chlorocebus sabaeus, Callithrix jacchus, Cricetulus griseus, Panthera tigris, Myotis brandtii, Pteropus alecto, Chrysochloris asiatica, and Dasypus novemcinctus, as they do not align or resemble other orthologous and known isoforms. We predicted disordered regions by using the PONDR-FIT metapredictor (Xue ). Additionally, the MobiDB server (Potenza ) was consulted to check consensus predictions for their disorder content, as provided by a variety of disorder predictors. SLIMs were predicted using the ELM webserver (Dinkel , 2014) considering only the cell nucleus as the cell compartment for biochemical interaction context of FOXP proteins. Given that the linear motifs predicted by ELM can present a high rate of false positives, we considered only ELM in IDR regions and validated such predictions by analyzing the literature on the interactions between linear motifs and their ligands with other transcription factors. Therefore, we considered only linear motifs with confirmed experimental data and/or certainty for ELM reliability annotation. All information regarding the linear motifs was retrieved from the ELM server and from the literature. The ELM server classifies SLIMs into the following four types: protease cleavage sites, protein motif interaction/binding sites, posttranslational modification sites, and subcellular targeting signals (Dinkel ). Linear motifs present in the forkhead, leucine-zipper, and zinc-finger domains were not considered because they can represent false positives. Statistical tests comparing sites under purifying selection and/or positive selection within and without disordered regions were performed using WinPepi and SPSS 2.0. To estimate the molecular evolutionary patterns of FOXP1, FOXP2, FOXP3, and FOXP4, we applied phylogeny-based maximum likelihood analysis of ω (non-synonymous/synonymous rate ratio or dN/dS) implemented in the PAML 4.7 package (Yang, 2007). This approach allows the ω ratio to vary among sites while considering several different codon substitution models. A value of ω < 1 indicates potential negative selection, while ω = 1 indicates neutrality, and ω > 1 indicates positive selection. For the NsSites codon substitution model, likelihood ratio tests (LRT) were performed between neutral models (M1a, nearly neutral, M8a, Beta and ω = 1) and models that allow positive selection and/or relaxation of functional constraints (M2a, positive selection and M8, Beta + Selection). Using log values from models M1a, M2a, M8a, and M8, we applied an LRT using HyPhy 2.2.0. The Branch Site Model was also used to detect if different linear motif composition and disorder scores are reflected in different evolutionary rates among Tetrapoda. The phylogeny was a priori divided into two clades, and a LRT was used to evaluate divergences in selective pressures between them, as indicated by different ω ratios. We employed the clade model type D that assumes two site classes, which was compared with the neutral model M1a by an LRT with two degrees of freedom. A Bayes empirical Bayes (BEB) approach was considered using CODEML in PAML 4.7 to verify which sites could be under neutral, purifying, or positive selection. The phylogenetic trees used to construct the PAML 4.7 input files were revised as described previously (Meredith ; Perelman ; Song ).

Results and Discussion

FOXP1, FOXP2, FOXP3 and FOXP4 structures and their intrinsic protein disorder content

Our analyses revealed that the three paralogous proteins with similar functions and tissue expression, FOXP1, FOXP2, and FOXP4, had high and similar disorder contents (~70%). In contrast, FOXP3, which plays a role in immune system regulation, presented a lower disorder degree (~30%) relative to its paralogs (Tables 1, S2-S5), according to PONDR-FIT. The patterns of the disordered and ordered regions, as well as the disorder proportion of orthologous proteins, are relatively conserved among taxonomic groups (Tables 1, S2-S5). However, mammals presented a higher degree of protein disorder than all other organisms for FOXP1 and FOXP4 (P < 0.001, Table 1). Particularly, amphibians presented a lower degree of disorder for FOXP2 (~64%, Tables 1 and S3.1) than the other classes (P < 0.01, Table 1). These FOXP disorder prediction values are, in general, higher than those obtained by other authors (Andersen ), but they used just partial proteins and fewer species. Importantly, it is worthy of note that the larger mammalian sample compared to non-mammals may have contributed to these statistical differences in the protein disorder content analysis.
Table 1

Mean disorder proportion for FOXP proteins by class1.

ClassFOXP2FOXP4FOXP1FOXP32
Mammals0.70110.73210.69150,3065
Birds0.70390.68580.6782
Reptiles0.69840.68270.6713
Amphibians0.63050.7068NA

Mammals showed significant higher proportions than the other groups, as assessed by the Kruskal-Wallis test, for FOXP1and FOXP4 (P < 0.001). Additionally, according to the same test, amphibians presented a lower degree of disorder for FOXP2 (P < 0.01).

Only mammalian genes were used for the FOXP3 analysis.

NA: Not available. Since several base pairs in Xenopus laevis FOXP1 sequence are missing, we excluded it from the analysis.

Mammals showed significant higher proportions than the other groups, as assessed by the Kruskal-Wallis test, for FOXP1and FOXP4 (P < 0.001). Additionally, according to the same test, amphibians presented a lower degree of disorder for FOXP2 (P < 0.01). Only mammalian genes were used for the FOXP3 analysis. NA: Not available. Since several base pairs in Xenopus laevis FOXP1 sequence are missing, we excluded it from the analysis. Interestingly, our data reveals that mammals present significantly higher FOXP1 and FOXP4 disorder degrees than the other groups. This finding may be associated with the more complex interaction networks present in mammals, as already proposed for other genetic systems (Disfani ), and to a positive correlation between the number of binding partners and disorder scores (Dunker ). Thus, it is reasonable to speculate that mammalian FOXP1 and FOXP4 present a larger number of binding partners than the other orthologues investigated here.

FOXP1, FOXP2, FOXP3, and FOXP4 and their interaction sites

Usually, intrinsically disordered proteins are enriched with SLIMs, which play crucial roles in their interaction with other proteins (Tables S6.1-6.4). Here we will briefly describe some selected representative results of the SLIMs compositional analysis. For FOXP1, some of our findings include a Polo-like kinase 1 (PLK) phosphorylation site at position 33 (MOD_PLK), which differentiates Sauropsida (reptiles and birds) from mammals (Table 2). PLK is involved in cell cycle events (Nakajima ; Murakami ), suggesting some differences in the FOXP1 phosphorylation pattern during the cell cycle between mammals and Sauropsida.
Table 2

Linear motifs changes in representative species of Tetrapoda, as predicted by ELM.

Aligned PositionNucleotideAmino acidGrantham Score Homo sapiens Pan troglodytes Pan paniscus Mus musculus Taeniopygia guttata Serinus canaria Anolis carolinensis Xenopus laevis
FOXP133GGT- > AGTGly- > Ser5600 *
GGT- > GCAGly- > Ala601
GGT- > AGCGly- > Ser561p,r,v 1p,r,v
GGT- > GGCGly- > GlySyn1p,r,v
FOXP2314GCA- > GCGAla- > AlaSyn000
GCG- > CCAAla- > Pro27
GCG- > TCTAla- > Ser991d 1d 1d 1d,s
GCG- > CCCAla- > Pro27
368AAC- > ACCAsn- > Thr650o4,q3 1o4,q4 1o4,q4 1o4,q4 1o4,q4 1o4,q4 1o4,q4
390AGT- > AATSer- > Asn460o,q 111110o
FOXP4408CCG- > CCAPro- > ProSyn001
CCG- > CTGPro- > Leu981i 1i 1i
CCG- > TTGPro- > Leu981i
689TCG- > TCASer- > SerSyn0d 0d
TCG- > TTGSer- > Leu145
TCG- > GTGSer- > Val1241
TCG- > ACASer- > Thr581k
TCG- > ACGSer- > Thr581k 1k
TCG- > GTCSer- > Val1241j

indicates gap.

Syn = synonymous changes.

Zero (0) indicates the amino acid present in the Homo sapiens reference sequence, whereas 1 indicates a variant amino acid .Subscribed letters indicate the predicted presence of specific Eukaryotic Linear Motifs (see code shown in Table S9). Subscribed numbers are the number of times that each SLIMs appeared.

The nature of modification is not representing an ancestry and descendant relationship. Grantham scores predicted as conservative (0-50), moderately conservative (51-100), moderately radical (101-150) or radical (> 151).

indicates gap. Syn = synonymous changes. Zero (0) indicates the amino acid present in the Homo sapiens reference sequence, whereas 1 indicates a variant amino acid .Subscribed letters indicate the predicted presence of specific Eukaryotic Linear Motifs (see code shown in Table S9). Subscribed numbers are the number of times that each SLIMs appeared. The nature of modification is not representing an ancestry and descendant relationship. Grantham scores predicted as conservative (0-50), moderately conservative (51-100), moderately radical (101-150) or radical (> 151). In the case of FOXP2 (Table 2), mammals have lost one DOC_USP7_1 at position 314, which interacts with the deubiquitinating enzyme USP7/HAUSP (herpes virus-associated ubiquitin-specific protease) present in all non-mammals, due to a serine to alanine change. Previous studies have demonstrated that the interaction of USP7 with FOX members regulates oxidative stress responses through ubiquitination (van der Horst ). Thus, the possible loss of DOC_USP7_1 in mammals could have a functional implication related to response to oxidative stress. Two known non-synonymous substitutions between humans (Homo sapiens and Neanderthals) and chimpanzees (FOXP2 Asn325Ser and Thr303Asn) deserve additional attention, since they were related to human speech (Enard , Krause ). One of them (Asn325Ser) promotes the gain of two motifs, MOD_CK1_1 and MOD_GSK3_1, in humans due to the presence of a serine at aligned position 390 (Table 2). Both motifs are promoters of phosphorylation by kinases. Interestingly, carnivores also have a serine at this FOXP2 ortholog position (Zhang ), leading to a convergence event of the emergence of both MOD_CK1_1 and MOD_GSK3_1 motifs observed in humans. Cooper (2006) suggested that phosphorylation by kinase C in this FOXP2 region may be related to human behavioral traits such as language. However, the other Homo-specific substitution at aligned position 368 (Thr303Asn) led to the loss of a phosphorylation site. Changes in phosphorylation patterns can modulate the regulation of transcription factors and their binding affinity to co-activators and DNA. These changes can in turn alter gene expression, cell growth, and differentiation (Iakoucheva ). Thus, our results have one very relevant implication: the loss of this phosphorylation site at position 368/303 can have been as important as the gain of the phosphorylation site at position 390/325 for the evolution of human speech. The phenotype implication of the presence of these SLIMS in carnivores is unknown. For FOXP3, which was only investigated in mammals (see Material and Methods section), a CK1 phosphorylation site (MOD_CK1_1) is predicted at position 194 (Table 3) for several mammal species, except New World (NW) monkeys (Saimiri boliviensis and Callithrix jacchus) and Tarsius syrichta. Interestingly, these primates present four other linear motifs in this region: MOD_GSK3_1, MOD_ProDkin_1, DEG_SCF_FBW7_1, and DOC_WW_Pin1_4. Therefore, we identified the presence of the same SLIMs in two distinct branches of primates (New World monkeys and Tarsiidae) that live in somewhat similar rainforest environments. As mentioned before, FOXP3 is the only FOXP member playing a role in the immune system, suggesting that at least one of these motifs is associated with the immune response, indicating adaptation through convergence or the maintenance of a primate ancestral state.
Table 3

FOXP3 Linear motifs changes in Mammals, as predicted by ELM.

Aligned Position194
NucleotideGTG- > ATGGTG- > ACAGTG- > TTGGTG- > GGGGTG- > GCAGTG- > GCGGTG- > ACG
Amino acidVal- > MetVal- > ThrVal- > LeuVal- > GlyVal- > AlaVal- > AlaVal- > Thr
Grantham Score216932109646469
Homo sapiens 0o
Pan troglodytes 0o
Pan paniscus 0o
Gorilla gorilla 0o
Pongo abellii 0o
Pongo pygameus 0o
Hylobates lar 0o
Nomascus 0o
Macaca mulatta 1o
Papio anubis 1o
Chlorocebus sabaeus 1o
Saimiri boliviensis 1b e o q2 w
Callithrix jacchus 1b e o q2 w
Galeopterus variegatus 0o
Tarsius syrichta 1b e o q2 w
Tupaia chinensis 1m3 j o
Sorex araneus 1o
Mus musculus 1c
Cricetulus griseus 1c
Rattus norvegicus 1c
Octodon degus 1o
Oryctolagus cuniculus 1o
Ochotona princeps 1o
Physeter catodon 1o
Orcinus orca 1o
Camelus ferus 0o
Bos taurus 0o
Equus caballus 1o
Ailuropoda melanoleuca 0o
Felis catus 0o
Canis lupus familiaris 0
Vicugna pacos 0o
Panthera tigris 0o
Mustela putorius furo 0o
Odobenus rosmarus 0o
Leptonychotes weddellii 0o
Ceratotherium simum 1b e o q2 w
Eptesicus fuscus 0o
Myotis brandtii 0o
Pteropus alecto 1s
Condylura cristata 1o
Chrysochloris asiatica 1c
Erinaceus europaeus 1
Echinops telfairi 1b e o q2 w
Orycteropus afer afer 1b e o q2 w
Loxodonta africana 1f t v
Trichechus manatus 1e w
Dasypus novemcinctus 1o2 s

indicates gap. Syn = synonymous change. Zero (0) indicates the amino acid present in the Homo sapiens reference sequence whereas 1 indicates a variant amino acid .Subscribed letters indicate the predicted presence of specific Eukaryotic Linear Motifs (see code shown in Table S9). The nature of modification is not representing an ancestry and descendant relationship. Grantham scores predicted as conservative (0-50) moderately conservative (51-100) moderately radical (101-150) or radical (> 151).

indicates gap. Syn = synonymous change. Zero (0) indicates the amino acid present in the Homo sapiens reference sequence whereas 1 indicates a variant amino acid .Subscribed letters indicate the predicted presence of specific Eukaryotic Linear Motifs (see code shown in Table S9). The nature of modification is not representing an ancestry and descendant relationship. Grantham scores predicted as conservative (0-50) moderately conservative (51-100) moderately radical (101-150) or radical (> 151). Another interesting finding is the sharing of the linear motif LIG_PTAP_UEV_1 between Neanderthals and modern humans due to the Gly175Ser (human position) mutation (Table 4). It has also been suggested that linear motifs mediate interactions between viruses with their hosts (Hagai ). In fact, LIG_PTAP_UEV_1 mediates the binding of several cellular and viral proteins to the UEV domain of the class E vacuolar sorting protein Tsg101 (Göttlinger ), and it is essential for the efficient egress of viral particles from many enveloped RNA viruses (Bieniasz, 2006). Our results indicate that this motif may have played an important role in Homo self-immune defense during the Pleistocene.
Table 4

FOXP3-specific changes in primates.

OrganismsAligned positionHuman positionAA ChangeMotifs1
Neanderthal and Humans140132Pro- > Thr(+2) DEG_SCF_FBW7_1
183175Gly- > Ser(+) LIG_PTAP_UEV_1
Neanderthal192184Ser- > Leu(-) MOD_CK1_1, (+) DOC_MAPK_1
Catarrhini278270Pro- > Ser(+) MOD_GSK3_1
Haplorhini2 8274Val- > Leu(-) DOC_WW_Pin1_4, (-)MOD_ProDKin_1
9789Ser- > Leu
129121Arg- > His
132124Asp- > Glu
181173Ser- > Asn(-)DOC_WW_Pin1_4, (-) MOD_ProDKin_1
246238Val- > Met
262254Gly- > Ser
338325Phe- > Leu
424411Phe- > Leu

+: change causes motif gain; -: change causes motif loss.

Excluding Tarsius syrichta.

+: change causes motif gain; -: change causes motif loss. Excluding Tarsius syrichta. Regarding FOXP4, a striking difference between mammals and Sauropsida was also found (birds and reptiles, Table 2). For instance, the loss of LIG_CtBP_PxDLS_1 in mammals is due to the substitution of a leucine for proline at aligned position 408, probably after the divergence of Synapsida and Sauropsida. Mendoza showed that the presence of the CtBP binding region in the bird Taeniopygia guttata has been associated with the potential FOXP4 regulation capacity. This finding for CtBP interaction may be associated with an enhanced potential for transcriptional repression of FOXP4, known for FOXP1 and FOXP2 (Mendoza ). At aligned position 689, almost all non-mammals present a motif that interacts with FHA (LIG_FHA_1 or LIG_FHA_2), while mammals present a DOC_USP7_1 motif. To better understand the role of SLIMs in evolution, we additionally compared members within the FOXP family to verify the number of unique linear motifs in each paralog (Table 5). The number of predicted types of SLIMs range from 28 (FOXP2) to 39 (FOXP4). Furthermore, FOXP3 presents three unique motifs (DOC_PP2B_1, TRG_NLS_MonoCore_2, and TRG_NLS_MonoExtN_4), FOXP1 presents four (DEG_SCF_FBW7_2, LIG_PCNA_PIPBox_1, LIG_WD40_WDR5_1, and TRG_NES_CRM1_1), while FOXP2 presents no unique SLIM. FOXP4 presents six motifs, among which three (DOC_PP1_RVXF_1, LIG_BRCT_BRAC1_2, and TRG_NLS_MonoExtC_3) are common to almost all species investigated in the current study.
Table 5

Number of shared and unique short linear motifs (SLIMs) among Tetrapoda FOXPs.

ProteinTotal type of SLIMsNumber of unique SLIMsTotal SLIMs in Homo sapiens Total SLIMs in Pan sp.Total SLMIs in Serinus canaria 1 Total of species compared
FOXP13442 13213213550
FOXP228014314214054
FOXP33233 6962-57
FOXP43964 14214316065

Bird, representing Sauropsida.

DEG_SCF_FBW7_2, LIG_PCNA_PIPBox_1, LIG_WD40_WDR5_1, and TRG_NES_CRM1_1);

DOC_PP2B_1, TRG_NLS_MonoCore_2, and TRG_NLS_MonoExtN_4;

FOXP4 presents six motifs, among which three (DOC_PP1_RVXF_1, LIG_BRCT_BRAC1_2, and TRG_NLS_MonoExtC_3) are common to almost all species investigated in the current study.

Bird, representing Sauropsida. DEG_SCF_FBW7_2, LIG_PCNA_PIPBox_1, LIG_WD40_WDR5_1, and TRG_NES_CRM1_1); DOC_PP2B_1, TRG_NLS_MonoCore_2, and TRG_NLS_MonoExtN_4; FOXP4 presents six motifs, among which three (DOC_PP1_RVXF_1, LIG_BRCT_BRAC1_2, and TRG_NLS_MonoExtC_3) are common to almost all species investigated in the current study.

Molecular evolutionary patterns

Evolutionary tests for FOXP1, FOXP2, FOXP3 and FOXP4 considering all the tetrapod species investigated in this study indicated that the best log-likelihood model is M1a, which assumes purifying selection and neutral ω values. FOXP1, FOXP2 and FOXP4 present more than 95% of the sites, with ω = 0.03066, 0.01965 and 0.02778, respectively (Table S7), indicating a strong role for purifying selection. FOXP3 presents 10% of ω values equal to 1, which indicates molecular neutral evolution and/or relaxation of functional constraints. Additionally, we used the results from the Bayes Empirical Bayes (BEB) test to calculate the posterior probabilities that each codon is under positive selection (Yang, 2007). The BEB values are only significant for M2 and M8 (that include such selection), therefore this last strategy was only adopted to detect eventual functional sites. Such analysis showed four sites in mammals with ω > 1 and probability > 91%, but the p value was not significant (Table S7). Regardless, it is important to highlight that one of the sites inferred with ω = 1.06 (probability = 98.9%) is located at position 194 of FOXP3 (Table 3). This position presents differences in SLIM prediction (MOD_GSK3_1, MOD_ProDkin_1, MOD_CK1_1, DEG_SCF_FBW7_1, and DOC_WW_Pin1_4) in Saimiri boliviensis, Callithrix jacchus, and Tarsius syrichta when compared with the other species. Saimiri boliviensis and Callithrix jacchus probably share the same linear motifs because of their clear and relatively recent common origin, but Tarsius syrichta, which is phylogenetically more distant, may present them because of convergent evolution (Tables S6.3 and 3). MOD_ProdKin_1 is a post-translational modification site phosphorylated by a MAP kinase, while DEG_SCF_FBW7_1 is a degradation site mediated by an important protein complex (Skp, Cullin, F-box containing complex or SCF) that plays a role in checkpoints during the cell cycle (Nguyen Ba ). DOC_WW_Pin1_4 interacts with the enzyme Pin1, whose function is also associated with the cell cycle, among others. Additionally, Pin1 regulates the immune response (Gavva ; Wulf ; Wijchers ; Saxena ), which is a known function of FOXP3. Again, as we identified the presence of the same SLIMs in two distinct primate branches that live in similar environments (rainforest), this allow us to infer that a simple neutral model is insufficient to explain this scenario. In the case of FOXP4 (Table S8), the Branch Site model indicated that mammals have a ω value 3.7 times higher than non-mammals (0.66102 versus 0.18012), a result compatible with relaxation of evolutionary pressures. This striking difference (p < 0.001) may be attributed to certain changes, such as the absence of the interaction site for CtBP (LIG_CtBP_PxDLS_1) in all mammals (except Sus scrofa). Another structural/functional change that can explain the distinct ω values observed between mammals and non-mammals is the presence of a glutamine-rich region in mammalian FOXP4, associated to its repression ability.

Conclusion

Our study reveals some important general and more specific findings. For instance, 70% of the disorder content has been retained in FOXP1, FOXP2, and FOXP4 orthologs. Some of the results obtained can be associated with taxa-specific conditions, while others may represent molecular convergence. In fact, we found changes at FOXP3 sites with possible functional implications in the primate branch, including the genus Homo. Finally, the FOXP1 and FOXP4 results show instigating differences between mammals and non-mammals, suggesting their role in the emergence of adaptive novelty within the taxon Tetrapoda. Our results indicate that part of the FOXP evolutionary “stability” over a long evolutionary period may be attributed to the maintenance of a similar proportion of disordered regions, but not to amino acid content or linear motifs. Moreover, some of the changes can be interpreted as indicating taxa-specific adaptations, since they are probably functional.
  52 in total

Review 1.  Broca's arrow: evolution, prediction, and language in the brain.

Authors:  David L Cooper
Journal:  Anat Rec B New Anat       Date:  2006-01

2.  From sequence and forces to structure, function, and evolution of intrinsically disordered proteins.

Authors:  Julie D Forman-Kay; Tanja Mittag
Journal:  Structure       Date:  2013-09-03       Impact factor: 5.006

3.  Patterns of coding variation in the complete exomes of three Neandertals.

Authors:  Sergi Castellano; Genís Parra; Federico A Sánchez-Quinto; Fernando Racimo; Martin Kuhlwilm; Martin Kircher; Susanna Sawyer; Qiaomei Fu; Anja Heinze; Birgit Nickel; Jesse Dabney; Michael Siebauer; Louise White; Hernán A Burbano; Gabriel Renaud; Udo Stenzel; Carles Lalueza-Fox; Marco de la Rasilla; Antonio Rosas; Pavao Rudan; Dejana Brajković; Željko Kucan; Ivan Gušic; Michael V Shunkov; Anatoli P Derevianko; Bence Viola; Matthias Meyer; Janet Kelso; Aida M Andrés; Svante Pääbo
Journal:  Proc Natl Acad Sci U S A       Date:  2014-04-21       Impact factor: 11.205

4.  Impacts of the Cretaceous Terrestrial Revolution and KPg extinction on mammal diversification.

Authors:  Robert W Meredith; Jan E Janečka; John Gatesy; Oliver A Ryder; Colleen A Fisher; Emma C Teeling; Alisha Goodbla; Eduardo Eizirik; Taiz L L Simão; Tanja Stadler; Daniel L Rabosky; Rodney L Honeycutt; John J Flynn; Colleen M Ingram; Cynthia Steiner; Tiffani L Williams; Terence J Robinson; Angela Burk-Herrick; Michael Westerman; Nadia A Ayoub; Mark S Springer; William J Murphy
Journal:  Science       Date:  2011-09-22       Impact factor: 47.728

Review 5.  Advantages of proteins being disordered.

Authors:  Zhirong Liu; Yongqi Huang
Journal:  Protein Sci       Date:  2014-03-17       Impact factor: 6.725

6.  Role of Pin1 in the regulation of p53 stability and p21 transactivation, and cell cycle checkpoints in response to DNA damage.

Authors:  Gerburg M Wulf; Yih-Cherng Liou; Akihide Ryo; Sam W Lee; Kun Ping Lu
Journal:  J Biol Chem       Date:  2002-10-17       Impact factor: 5.157

7.  Resolving conflict in eutherian mammal phylogeny using phylogenomics and the multispecies coalescent model.

Authors:  Sen Song; Liang Liu; Scott V Edwards; Shaoyuan Wu
Journal:  Proc Natl Acad Sci U S A       Date:  2012-08-28       Impact factor: 11.205

Review 8.  Regulation of yeast forkhead transcription factors and FoxM1 by cyclin-dependent and polo-like kinases.

Authors:  Hiroshi Murakami; Hirofumi Aiba; Makoto Nakanishi; Yuko Murakami-Tonami
Journal:  Cell Cycle       Date:  2010-08-02       Impact factor: 4.534

9.  Archaic chaos: intrinsically disordered proteins in Archaea.

Authors:  Bin Xue; Robert W Williams; Christopher J Oldfield; A Keith Dunker; Vladimir N Uversky
Journal:  BMC Syst Biol       Date:  2010-05-28

10.  The derived FOXP2 variant of modern humans was shared with Neandertals.

Authors:  Johannes Krause; Carles Lalueza-Fox; Ludovic Orlando; Wolfgang Enard; Richard E Green; Hernán A Burbano; Jean-Jacques Hublin; Catherine Hänni; Javier Fortea; Marco de la Rasilla; Jaume Bertranpetit; Antonio Rosas; Svante Pääbo
Journal:  Curr Biol       Date:  2007-10-18       Impact factor: 10.834

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.