| Literature DB >> 29177034 |
Hafiz Ishfaq Ahmad1, Guiqiong Liu1, Xunping Jiang1, Shishay Girmay Edallew1, Teketay Wassie1, Birhanu Tesema1, Yu Yun1, Liu Pan1, Chenhui Liu1, Yuqing Chong1, Zhao Jia Yu1, Han Jilong1.
Abstract
Bone morphogenetic proteins (BMPs) and the growth factors (GDFs) play an important role in ovarian folliculogenesis and essential regulator of processes of numerous granulosa cells. BMP15 gene variations linked to various ovarian phenotypic consequences subject to the species, from infertility to improved prolificacy in sheep, primary ovarian insufficiency in women or associated with minor subfertility in mouse. To study the evolving role of BMP15 and GDF9, a phylogenetic analysis was performed. To find out the candidate gene associated with prolificacy in mammals, the nucleotide sequence of BMP15 and GDF9 genes was recognized under positive selection in various mammalian species. Maximum-likelihood approaches used on BMP15 and GDF9 genes exhibited a robust divergence and a prompted evolution as compared to other TGFβ family members. Furthermore, among 32 mammalian species, we identified positive selection signals in the hominidae clade resulting to 132D, 147E, 163Y, 191W, and 236P codon sites of BMP15 and 162F, 188K, 206R, 240A, 244L, 246H, 248S, 251D, 253L, 254F and other codon sites of GDF9. The positively selected amino acid sites such as Alanine, Lucien, Arginine, and lysine are important for signaling. In conclusion, this study evidences that GDF9 and BMP15 genes have rapid evolution than other TGFß family members and was subjected to positive selection in the mammalian clade. Selected sites under the positive selection are of remarkable significance for the particular functioning of the protein and consequently for female fertility.Entities:
Keywords: BMP15; GDF9; evolution; mammals; maximum likelihood; positive selection
Year: 2017 PMID: 29177034 PMCID: PMC5689494 DOI: 10.1002/ece3.3336
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Log likelihood values and test statistics for PAML site models of positive selection
| Gene | No. of residues | No. of species | Model | Parameter estimates |
|
|
| PAML | FEL | REL | SLAC | % of sites |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BMP15 | 1234 | 37 | M7 |
| −19,737.9 | −19,329.1 | 17.69 |
| 17, 126, 144, 148, | 45, 95, 126, 144, | 45, 384, 386 | 0.53 |
| M8 | P0 = 0.94697, | 200, 203, 258, 289, | 248, 363 | 363, 384 | ||||||||
| P1 = 0.05303, | 297, 315, 323, 365 | |||||||||||
| GDF9 | 1371 | 32 | M7 |
| −13,717.5 | −13,716.2 | 2.59* | 30, 186, | 21, 30, 34, 139, 186 | 21, 30, 186, 299, | 21, 30, 186, 299, | 0.78 |
| M8 | P0 = 0. 99219, |
| 299, 304, 339 | 304, 339 | 304 | |||||||
| P1 = 0.00781, |
The proportion of sites under positive selection (p1), or under selective constraint (p0), and parameters p and q for the beta distribution. Parameters indicating positive selection are in bold. p: significant at 5% level; p: significant at 1% level. Sites potentially under positive selection identified under model M8 are listed according to the human sequence numbering. Positively selected sites with posterior probability 0.9 are italicized, 0.8–0.9 in bold, and 0.5–0.7 in plain text. The test statistic 2Δl is compared to a χ2 distribution with 2 df, critical values 5.99, 9.21, and 13.82 at 5%, 1%, and 0.1% significance, respectively.
**Significant at 1% level; *Significant at 5% level.
Positively selected sites under different PAML site models using bayes empirical bayes analysis
| Gene | Model | Codon | Amino acid | Posterior probability | Post mean ± |
|---|---|---|---|---|---|
| BMP15 | M8: selection, beta+ ω | 40 | V | 0.846 | 1.407 ± 0.22 |
| 49 | I | 0.678 | 1.303 ± 0.29 | ||
| 75 | Q | 0.755 | 1.350 ± 0.27 | ||
| 90 | R | 0.923 | 1.455 ± 0.16 | ||
| 109 | A | 0.824 | 1.393 ± 0.24 | ||
| 133 | V | 0.888 | 1.434 ± 0.19 | ||
| 176 | H | 0.55 | 1.218 ± 0.32 | ||
| 191 | P | 0.703 | 1.315 ± 0.29 | ||
| 199 | G | 0.567 | 1.224 ± 0.32 | ||
| 200 | R | 0.651 | 1.281 ± 0.31 | ||
| 203 | P | 0.607 | 1.249 ± 0.32 | ||
| 258 | W | 0.572 | 1.227 ± 0.32 | ||
| 289 | K | 0.609 | 1.253 ± 0.32 | ||
| 297 | K | 0.765 | 1.358 ± 0.26 | ||
| 315 | A | 0.727 | 1.331 ± 0.28 | ||
| 323 | R | 0.775 | 1.362 ± 0.26 | ||
| 365 | V | 0.578 | 1.238 ± 0.31 | ||
| GDF‐9 | M8: selection, beta+ ω | 30 | G | 0.774 | 1.576 ± 0.39 |
| 186 | F | 0.581 | 1.270 ± 0.34 | ||
| 245 | L | 0.814 | 1.185 ± 0.34 | ||
| 254 | L | 0.722 | 1.242 ± 0.34 | ||
| 292 | G | 0.824 | 1.203 ± 0.34 | ||
| 302 | Y | 0.531 | 1.350 ± 0.36 | ||
| 304 | V | 0.958 | 1.271 ± 0.33 |
Posterior probabilities >90%.
Figure 1(a) Location of positively selected amino acid sites identified BMP15 gene. The crystal structure of human BMP as a reference and positively selected sites were drawn onto the crystal structure using Phyre tool (http://www.sbg.bio.ic.ac.uk/phyre2/html). All the residues identified as under selection fall in the domain containing the ligand binding site. The sites which fall in the region identified as the ligand binding site and another cluster in a region immediately following the signal sequence. (b) Location of positively selected amino acid sites identified GDF9 gene. The crystal structure of human GDF9 as a reference and positively selected sites was drawn onto the crystal structure using Phyre tool (http://www.sbg.bio.ic.ac.uk/phyre2/html). All the residues identified as under selection fall in the domain containing the ligand binding site. The sites which fall in the region identified as the ligand binding site and another cluster in a region immediately following the signal sequence
Figure 2Selection pressures among goat BMP15 gene sequences using mechanistic empirical combination (MEC) model of selecton online tool. Yellow and brown highlights represent positive selection, gray and white highlights represent neutral selection, and purple highlight represents negative selection on codons
Figure 3Selection pressures among goat GDF9 gene sequences using mechanistic empirical combination (MEC) model of selecton online tool. Yellow and brown highlights represent positive selection, gray and white highlights represent neutral selection, and purple highlight represents negative selection on codons
Figure 4The protein–protein interaction (PPI) network built by STRING database for BMP15 and GDF genes. Gray and Red circles characterize downregulated and upregulated genes, respectively. Line thickness indicates the strength of the interaction. Dash and solid edge mean negative and positive correlation coefficient. Network nodes denote proteins' post‐transcriptional modifications or splice isoforms, and each node represents all the proteins produced by a single, protein‐coding gene locus