| Literature DB >> 28257525 |
Lucas Henriques Viscardi1, Luciana Tovo-Rodrigues2, Pamela Paré1, Nelson Jurandi Rosa Fagundes1, Francisco Mauro Salzano1, Vanessa Rodrigues Paixão-Côrtes3, Claiton Henrique Dotto Bau1, Maria Cátira Bortolini1.
Abstract
The FOXP subfamily is probably the most extensively characterized subfamily of the forkhead superfamily, playing important roles in development and homeostasis in vertebrates. Intrinsically disorder protein regions (IDRs) are protein segments that exhibit multiple physical interactions and play critical roles in various biological processes, including regulation and signaling. IDRs in proteins may play an important role in the evolvability of genetic systems. In this study, we analyzed 77 orthologous FOXP genes/proteins from Tetrapoda, regarding protein disorder content and evolutionary rate. We also predicted the number and type of short linear motifs (SLIMs) in the IDRs. Similar levels of protein disorder (approximately 70%) were found for FOXP1, FOXP2, and FOXP4. However, for FOXP3, which is shorter in length and has a more specific function, the disordered content was lower (30%). Mammals showed higher protein disorders for FOXP1 and FOXP4 than non-mammals. Specific analyses related to linear motifs in the four genes showed also a clear differentiation between FOXPs in mammals and non-mammals. We predicted for the first time the role of IDRs and SLIMs in the FOXP gene family associated with possible adaptive novelties within Tetrapoda. For instance, we found gain and loss of important phosphorylation sites in the Homo sapiens FOXP2 IDR regions, with possible implication for the evolution of human speech.Entities:
Year: 2017 PMID: 28257525 PMCID: PMC5409772 DOI: 10.1590/1678-4685-GMB-2016-0115
Source DB: PubMed Journal: Genet Mol Biol ISSN: 1415-4757 Impact factor: 1.771
Mean disorder proportion for FOXP proteins by class1.
| Class | FOXP2 | FOXP4 | FOXP1 | FOXP3 |
|---|---|---|---|---|
| Mammals | 0.7011 | 0.7321 | 0.6915 | 0,3065 |
| Birds | 0.7039 | 0.6858 | 0.6782 | |
| Reptiles | 0.6984 | 0.6827 | 0.6713 | |
| Amphibians | 0.6305 | 0.7068 | NA |
Mammals showed significant higher proportions than the other groups, as assessed by the Kruskal-Wallis test, for FOXP1and FOXP4 (P < 0.001). Additionally, according to the same test, amphibians presented a lower degree of disorder for FOXP2 (P < 0.01).
Only mammalian genes were used for the FOXP3 analysis.
NA: Not available. Since several base pairs in Xenopus laevis FOXP1 sequence are missing, we excluded it from the analysis.
Linear motifs changes in representative species of Tetrapoda, as predicted by ELM.
| Aligned Position | Nucleotide | Amino acid | Grantham Score |
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| FOXP1 | 33 | GGT- > AGT | Gly- > Ser | 56 | 0 | 0 |
| |||||
| GGT- > GCA | Gly- > Ala | 60 | 1 | |||||||||
| GGT- > AGC | Gly- > Ser | 56 | 1p,r,v | 1p,r,v | ||||||||
| GGT- > GGC | Gly- > Gly | Syn | 1p,r,v | |||||||||
| FOXP2 | 314 | GCA- > GCG | Ala- > Ala | Syn | 0 | 0 | 0 | |||||
| GCG- > CCA | Ala- > Pro | 27 | ||||||||||
| GCG- > TCT | Ala- > Ser | 99 | 1d | 1d | 1d | 1d,s | ||||||
| GCG- > CCC | Ala- > Pro | 27 | ||||||||||
| 368 | AAC- > ACC | Asn- > Thr | 65 | 0o4,q3 | 1o4,q4 | 1o4,q4 | 1o4,q4 | 1o4,q4 | 1o4,q4 | 1o4,q4 | ||
| 390 | AGT- > AAT | Ser- > Asn | 46 | 0o,q | 1 | 1 | 1 | 1 | 1 | 0o | ||
| FOXP4 | 408 | CCG- > CCA | Pro- > Pro | Syn | 0 | 0 | 1 | |||||
| CCG- > CTG | Pro- > Leu | 98 | 1i | 1i | 1i | |||||||
| CCG- > TTG | Pro- > Leu | 98 | 1i | |||||||||
| 689 | TCG- > TCA | Ser- > Ser | Syn | 0d | 0d | |||||||
| TCG- > TTG | Ser- > Leu | 145 | ||||||||||
| TCG- > GTG | Ser- > Val | 124 | 1 | |||||||||
| TCG- > ACA | Ser- > Thr | 58 | 1k | |||||||||
| TCG- > ACG | Ser- > Thr | 58 | 1k | 1k | ||||||||
| TCG- > GTC | Ser- > Val | 124 | 1j | |||||||||
indicates gap.
Syn = synonymous changes.
Zero (0) indicates the amino acid present in the Homo sapiens reference sequence, whereas 1 indicates a variant amino acid .Subscribed letters indicate the predicted presence of specific Eukaryotic Linear Motifs (see code shown in Table S9). Subscribed numbers are the number of times that each SLIMs appeared.
The nature of modification is not representing an ancestry and descendant relationship. Grantham scores predicted as conservative (0-50), moderately conservative (51-100), moderately radical (101-150) or radical (> 151).
FOXP3 Linear motifs changes in Mammals, as predicted by ELM.
| Aligned Position | 194 | ||||||
|---|---|---|---|---|---|---|---|
| Nucleotide | GTG- > ATG | GTG- > ACA | GTG- > TTG | GTG- > GGG | GTG- > GCA | GTG- > GCG | GTG- > ACG |
| Amino acid | Val- > Met | Val- > Thr | Val- > Leu | Val- > Gly | Val- > Ala | Val- > Ala | Val- > Thr |
| Grantham Score | 21 | 69 | 32 | 109 | 64 | 64 | 69 |
|
| 0o | ||||||
|
| 0o | ||||||
|
| 0o | ||||||
|
| 0o | ||||||
|
| 0o | ||||||
|
| 0o | ||||||
|
| 0o | ||||||
|
| 0o | ||||||
|
| 1o | ||||||
|
| 1o | ||||||
|
| 1o | ||||||
|
| 1b e o q2 w | ||||||
|
| 1b e o q2 w | ||||||
|
| 0o | ||||||
|
| 1b e o q2 w | ||||||
|
| 1m3 j o | ||||||
|
| 1o | ||||||
|
| 1c | ||||||
|
| 1c | ||||||
|
| 1c | ||||||
|
| 1o | ||||||
|
| 1o | ||||||
|
| 1o | ||||||
|
| 1o | ||||||
|
| 1o | ||||||
|
| 0o | ||||||
|
| 0o | ||||||
|
| 1o | ||||||
|
| 0o | ||||||
|
| 0o | ||||||
|
| 0 | ||||||
|
| 0o | ||||||
|
| 0o | ||||||
|
| 0o | ||||||
|
| 0o | ||||||
|
| 0o | ||||||
|
| 1b e o q2 w | ||||||
|
| 0o | ||||||
|
| 0o | ||||||
|
| 1s | ||||||
|
| 1o | ||||||
|
| 1c | ||||||
|
| 1 | ||||||
|
| 1b e o q2 w | ||||||
|
| 1b e o q2 w | ||||||
|
| 1f t v | ||||||
|
| 1e w | ||||||
|
| 1o2 s | ||||||
indicates gap. Syn = synonymous change. Zero (0) indicates the amino acid present in the Homo sapiens reference sequence whereas 1 indicates a variant amino acid .Subscribed letters indicate the predicted presence of specific Eukaryotic Linear Motifs (see code shown in Table S9). The nature of modification is not representing an ancestry and descendant relationship. Grantham scores predicted as conservative (0-50) moderately conservative (51-100) moderately radical (101-150) or radical (> 151).
FOXP3-specific changes in primates.
| Organisms | Aligned position | Human position | AA Change | Motifs |
|---|---|---|---|---|
| Neanderthal and Humans | 140 | 132 | Pro- > Thr | (+2) DEG_SCF_FBW7_1 |
| 183 | 175 | Gly- > Ser | (+) LIG_PTAP_UEV_1 | |
| Neanderthal | 192 | 184 | Ser- > Leu | (-) MOD_CK1_1, (+) DOC_MAPK_1 |
| Catarrhini | 278 | 270 | Pro- > Ser | (+) MOD_GSK3_1 |
| Haplorhini | 82 | 74 | Val- > Leu | (-) DOC_WW_Pin1_4, (-)MOD_ProDKin_1 |
| 97 | 89 | Ser- > Leu | ||
| 129 | 121 | Arg- > His | ||
| 132 | 124 | Asp- > Glu | ||
| 181 | 173 | Ser- > Asn | (-)DOC_WW_Pin1_4, (-) MOD_ProDKin_1 | |
| 246 | 238 | Val- > Met | ||
| 262 | 254 | Gly- > Ser | ||
| 338 | 325 | Phe- > Leu | ||
| 424 | 411 | Phe- > Leu |
+: change causes motif gain; -: change causes motif loss.
Excluding Tarsius syrichta.
Number of shared and unique short linear motifs (SLIMs) among Tetrapoda FOXPs.
| Protein | Total type of SLIMs | Number of unique SLIMs | Total SLIMs in | Total SLIMs in | Total SLMIs in | Total of species compared |
|---|---|---|---|---|---|---|
| FOXP1 | 34 | 4 | 132 | 132 | 135 | 50 |
| FOXP2 | 28 | 0 | 143 | 142 | 140 | 54 |
| FOXP3 | 32 | 3 | 69 | 62 | - | 57 |
| FOXP4 | 39 | 6 | 142 | 143 | 160 | 65 |
Bird, representing Sauropsida.
DEG_SCF_FBW7_2, LIG_PCNA_PIPBox_1, LIG_WD40_WDR5_1, and TRG_NES_CRM1_1);
DOC_PP2B_1, TRG_NLS_MonoCore_2, and TRG_NLS_MonoExtN_4;
FOXP4 presents six motifs, among which three (DOC_PP1_RVXF_1, LIG_BRCT_BRAC1_2, and TRG_NLS_MonoExtC_3) are common to almost all species investigated in the current study.