| Literature DB >> 30744572 |
Carrie A Whittle1, Cassandra G Extavour2,3.
Abstract
BACKGROUND: Germ lines are the cell lineages that give rise to the sperm and eggs in animals. The germ lines first arise from primordial germ cells (PGCs) during embryogenesis: these form from either a presumed derived mode of preformed germ plasm (inheritance) or from an ancestral mechanism of inductive cell-cell signalling (induction). Numerous genes involved in germ line specification and development have been identified and functionally studied. However, little is known about the molecular evolutionary dynamics of germ line genes in metazoan model systems.Entities:
Keywords: Apis; Caenorhabditis; Codon usage; Drosophila; Germ line genes; Metazoans; Molecular evolution; Primordial germ cells; Protein divergence; Specification mode
Mesh:
Year: 2019 PMID: 30744572 PMCID: PMC6371493 DOI: 10.1186/s12862-019-1363-x
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
The 34 germ line genes studied in Drosophila. For this study, genes have been classified based on known roles in PGC-specification in established models. However, many genes (excluding LSI genes) have been reported to play roles in both germ line specification and at some stage(s) of germ line development or maintenance across model systems and specification modes [1, 3, 4, 13–16].
| Gene | Gene Name | FlyBase ID | Length (codons) | Established PGC or Germ Line Role in Animal Models | References |
|---|---|---|---|---|---|
| Category 1: Lineage-Specific Inheritance (LSI) ( | |||||
|
|
| FBgn0003015 | 607 | Assembly of germ plasm components | [ |
|
|
| FBgn0016053 | 72 | Localized to germ plasm; transcriptional repression in pole cells (DM) | [ |
| Category 2: Inheritance ( | |||||
|
|
| FBgn0041164 | 1189 | [ | |
|
|
| FBgn0000114 | 811 | [ | |
|
|
| FBgn0000256 | 1362 | Interacts with | [ |
|
|
| FBgn0000392 | 1118 | [ | |
|
|
| FBgn0000405 | 531 | Cell cycle; mRNA localized to germ plasm (DM) | [ |
|
|
| FBgn0005695 | 570 | Localized to germ plasm; required for PGC specification (DM) | [ |
|
|
| FBgn0002736 | 148 | Required for PGC specification; localization of | [ |
|
|
| FBgn0004882 | 916 | Localized to germ plasm; | [ |
|
|
| FBgn0263102 | 1124 | Required for germ plasm formation | [ |
|
|
| FBgn0003475 | 1021 | Products are component of germ plasm (DM), interacts with | [ |
|
|
| FBgn0003520 | 1027 | Localization to germ plasm, PGC specification | [ |
|
|
| FBgn0003891 | 2516 | Component of germ plasm, germ cell formation (DM) | [ |
|
|
| FBgn0003978 | 368 | Component of germ plasm, involved in assembly (DM) | [ |
| Category 3: Induction ( | |||||
|
|
| FBgn0035625 | 1217 | BMP signalling pathway for PGC specification, represses somatic expression (MM, GB) | [ |
|
|
| FBgn0014135 | 771 | FGF protein; mitogen for PGCs (MM) | [ |
|
|
| FBgn0005592 | 1053 | FGFR (FGF receptor), linked to PGCs (MM) | [ |
|
|
| FBgn0011723 | 698 | Activation of | [ |
|
|
| FBgn0000490 | 589 | BMP2/4 ortholog; involved in BMP-BLIMP1 signalling for PGC specification (MM, GB) | [ |
|
|
| FBgn0024234 | 456 | BMP5/7/8 ortholog; involved in PGC specification (MM, GB) | [ |
|
|
| FBgn0011648 | 526 | Ortholog to MM | [ |
|
|
| FBgn0011655 | 772 | Ortholog to MM | [ |
|
|
| FBgn0003169 | 521 | BMP receptor; putative ortholog in MM Bmpr2 (note; or Acvr2); associated with PGCs in GB | [ |
|
|
| FBgn0003317 | 583 | BMP receptor; putative mammalian ortholog Bmpr1a/b (note: or Acvr) involved PGC specification in birds | [ |
|
|
| FBgn0025800 | 487 | Orthologs are | [ |
|
|
| FBgn0003463 | 1039 | MM ortholog CHRD involved in BMP/CHRD signalling pathway, which relates to PGCs; Regulates DPP (DM) | [ |
|
|
| FBgn0003716 | 576 | BMP receptor; mammalian ortholog (Bmpr1a/b) involved PGC specification in birds, GB | [ |
|
|
| FBgn0284084 | 469 | Ortholog to MM | [ |
|
|
| FBgn0024179 | 914 | BMP receptor; putative ortholog to MM Bmpr2 (similar to punt), GB-punt is linked to PGCs in GB | [ |
| Category 4: Inh/Ind ( | |||||
|
|
| FBgn0002962 | 402 | Germ plasm component (DM), regulates mRNA, associated with PGCs in MM; common germ cell across metazoans | [ |
|
|
| FBgn0004872 | 844 | Component of germ plasm (DM); germ line essential across metazoans (DM, MM) | [ |
|
|
| FBgn0003165 | 1534 | Regulate mRNA in PGCs (DM), conserved germ cell role in humans | [ |
|
|
| FBgn0283442 | 662 | mRNA and protein localizes to germ plasm (DM, zebrafish, | [ |
NOTE: Gene identifiers are from the reference species D. melanogaster. Genes were placed in one of four categories based on their role in PGC-specification. Experimental or cytological evidence linking genes to lineage-specific inheritance (LSI), to Inheritance or Induction modes, or Inh/Ind are shown. This gene list was used as a reference to identify orthologs in five other Drosophila species from the melanogaster group, and in C. elegans and A. mellifera using reciprocal BLASTX. Length is for the full CDS per gene. Species abbreviations in cited evidence are Drosophila melanogaster (DM), Gryllus bimaculatus (GB), or Mus musculus (MM).
/ and mENC’ across the phylogeny of six species of the reference model Drosophila
| Gene | Mean mENC' | SE | Alignment Length (codons) | |
|---|---|---|---|---|
| Lineage-specific Inheritance | ||||
|
| 0.1573 | 52.18 | 0.29 | 580 |
|
| 0.0933 | 37.25 | 0.66 | 71 |
| Inheritance | ||||
|
| 0.1189 | 54.26 | 0.35 | 1007 |
|
| 0.1102 | 54.09 | 0.25 | 789 |
|
| 0.0885 | 53.59 | 0.19 | 990 |
|
| 0.0878 | 54.23 | 0.55 | 1048 |
|
| 0.0810 | 51.02 | 0.44 | 367 |
|
| 0.0773 | 54.28 | 0.44 | 888 |
|
| 0.0751 | 54.34 | 0.17 | 1001 |
|
| 0.0716 | 53.77 | 0.38 | 1446 |
|
| 0.0692 | 50.23 | 0.41 | 509 |
|
| 0.0392 | 55.13 | 0.20 | 568 |
|
| 0.0369 | 51.39 | 0.19 | 723 |
|
| 0.0297 | 49.71 | 0.27 | 469 |
|
| 0.0001 | 42.29 | 0.72 | 147 |
| Induction | ||||
|
| 0.1303 | 53.17 | 0.43 | 494 |
|
| 0.0935 | 53.99 | 0.63 | 1040 |
|
| 0.0833 | 47.48 | 0.95 | 261 |
|
| 0.0734 | 50.42 | 0.32 | 561 |
|
| 0.0575 | 49.71 | 0.23 | 651 |
|
| 0.0572 | 51.06 | 0.44 | 1110 |
|
| 0.0517 | 51.64 | 0.24 | 430 |
|
| 0.0462 | 53.52 | 0.39 | 897 |
|
| 0.0422 | 52.47 | 0.57 | 759 |
|
| 0.0402 | 50.88 | 0.80 | 565 |
|
| 0.0305 | 49.45 | 0.48 | 919 |
|
| 0.0303 | 49.58 | 0.72 | 451 |
|
| 0.0248 | 49.75 | 0.54 | 510 |
|
| 0.0232 | 52.42 | 0.31 | 433 |
|
| 0.0135 | 50.54 | 0.17 | 468 |
| Inh/Ind | ||||
|
| 0.1145 | 50.51 | 0.38 | 389 |
|
| 0.0545 | 50.59 | 0.70 | 572 |
|
| 0.0446 | 54.91 | 0.76 | 711 |
|
| 0.0320 | 50.53 | 0.38 | 798 |
NOTE. Results are shown for each of the 34 genes under study a measured using codeml in PAML [55]. Genes are classified based on their role in PGC specification and ranked by / within each group. The mean dN and mean dS values are provided in Table S4. SE=standard error for mENC’.
/ and mENC’ across the phylogeny of four species of Caenorhabditis
| CE Gene | DM Gene | Mean mENC' | SE | |
|---|---|---|---|---|
| Lineage-specific Inheritance | ||||
|
| - | 0.1619 | 46.13 | 0.42 |
|
| - | 0.1553 | 50.29 | 0.45 |
| Inheritance | ||||
|
|
| 0.1127 | 45.01 | 0.68 |
|
|
| 0.0833 | 51.56 | 1.40 |
|
|
| 0.0719 | 52.78 | 0.58 |
|
|
| 0.0570 | 51.68 | 0.58 |
|
|
| 0.0499 | 52.63 | 0.80 |
|
|
| 0.0433 | 51.35 | 0.46 |
|
|
| 0.0386 | 50.54 | 1.52 |
|
|
| 0.0081 | 38.41 | 1.79 |
| Induction | ||||
|
|
| 0.0963 | 51.18 | 0.80 |
|
|
| 0.0748 | 49.36 | 1.09 |
|
|
| 0.0675 | 50.00 | 1.30 |
|
|
| 0.0639 | 52.74 | 0.37 |
|
|
| 0.0575 | 52.56 | 0.41 |
|
|
| 0.0548 | 52.34 | 0.67 |
|
|
| 0.0522 | 50.24 | 1.15 |
|
|
| 0.0395 | 47.73 | 0.45 |
|
|
| 0.0264 | 49.69 | 0.42 |
|
|
| 0.0128 | 46.30 | 0.78 |
| Inh/Ind | ||||
|
|
| 0.0761 | 43.98 | 1.48 |
|
|
| 0.0753 | 50.79 | 0.92 |
|
|
| 0.0642 | 46.13 | 1.81 |
NOTE. Values are shown for each of the 23 genes under study as measured using codeml in PAML [55]. Genes are ranked from highest to lowest / values within each category. CE Caenorhabditis elegans, DM D. melanogaster. The inheritance gene par-1 was included for Caenorhabditis (see Methods). SE standard error for mENC’.
/ and mENC’ across the phylogeny of four species of Apis
| Gene | Mean mENC' | SE | |
|---|---|---|---|
| Inheritance N=13 | |||
|
| 0.1011 | 54.66 | 0.13 |
|
| 0.0792 | 56.67 | 0.27 |
|
| 0.0774 | 53.24 | 0.07 |
|
| 0.0673 | 51.78 | 0.34 |
|
| 0.0564 | 56.22 | 0.05 |
|
| 0.0560 | 53.13 | 0.15 |
|
| 0.0395 | 51.24 | 0.10 |
|
| 0.0326 | 53.42 | 0.27 |
|
| 0.0238 | 53.39 | 0.53 |
|
| 0.0192 | 55.04 | 0.13 |
|
| 0.0173 | 54.07 | 0.32 |
|
| 0.0067 | 54.84 | 0.25 |
|
| 0.0001 | 46.43 | 0.39 |
| Induction N=13 | |||
|
| 0.1342 | 52.24 | 0.14 |
|
| 0.1163 | 51.80 | 0.24 |
|
| 0.0854 | 48.68 | 0.48 |
|
| 0.0627 | 55.22 | 0.21 |
|
| 0.0462 | 55.48 | 0.15 |
|
| 0.0375 | 49.42 | 0.72 |
|
| 0.0363 | 46.37 | 0.40 |
|
| 0.0335 | 52.73 | 0.39 |
|
| 0.0324 | 53.20 | 0.25 |
|
| 0.0133 | 52.72 | 0.26 |
|
| 0.0116 | 53.91 | 0.09 |
|
| 0.0080 | 52.44 | 0.36 |
|
| 0.0001 | 51.63 | 0.29 |
| Inh/Ind | |||
|
| 0.1393 | 54.72 | 0.39 |
|
| 0.1155 | 53.33 | 0.31 |
|
| 0.0038 | 53.84 | 0.29 |
|
| 0.0001 | 38.89 | 0.96 |
NOTE. Values are from codeml in PAML [55]. Genes are listed using the ortholog name from D. melanogaster. The inheritance gene par-1 was included for Apis (see Methods). SE standard error for mENC
Results from sites analysis of positive selection (M7 versus M8) in Drosophila, Caenorhabditis and Apis
| Gene | Role |
| Sites (BEB Probabilities) | |
|---|---|---|---|---|
|
| ||||
|
| Induction | ** | 29 S** | |
|
| Inheritance | ** | 252 P**, 253H**, 437 T* | |
|
| ||||
|
| Inheritance | ** | 423 N* | |
|
| Induction | ** | 231 S* | |
|
| Inheritance | ** | 468 A**, 625 N*, 647 Q*, 701 G**, 704 T**,705V* | |
|
| ||||
|
| Inheritance | ** | 337 T*, 339 A*, 341 A* | |
|
| Inheritance | ** | 4 G**, 5 L** | |
|
| Induction | * | 80 S*, 81 T*, 86 Q**, 87 L * | |
|
| Induction | ** | - | |
|
| Inheritance | ** | - | |
|
| Inh/Ind | ** | 153 H**,318 T** ,320 A ** | |
|
| Inheritance | ** | - | |
|
| Inh/Ind | ** | 306 Y* | |
|
| Inheritance | ** | - | |
|
| Induction | ** | 329 I* | |
NOTE: Analysis was conducted in PAML [55]. Only those genes at or near statistical significance are shown. ** P<0.05 for 2X∆lnL *P=0.062. BEB probabilities for specific sites are indicated as ** P≥0.95 (**) and *0.95≥P>0.9
Fig. 1.The expression breadth (percentage of 30 developmental stages expressed) for the 34 Drosophila genes under study and its relationship to protein divergence. a breadth of expression across developmental stages at a level >0 RPKM; b breadth of expression at a level >5 RPKM; c / versus expression breadth (RPKM>0); d / versus expression breadth (RPKM>5). The 30 tissues and stages are provided in Table S8. For A-B, genes are listed on the X-axis in the same order as presented in Table 2, from highest to lowest / within each of the four categories of genes. Gene names associated with each dN/dS point are provided in Table 2.
Fig. 2.Correlations in / and in mENC’ between Drosophila and Caenorhabditis and Drosophila and Apis. a The / values for Caenorhabditis versus those from Drosophila; b the / values for Apis versus those from Drosophila; c mENC’ for Caenorhabditis versus Drosophila; d mENC’ for Apis versus Drosophila. Spearman’s ranked R and P values are shown across all genes for the between genus orthologs (pooled across categories). The lineage-specific inheritance (LSI) genes in Drosophila and in Caenorhabditis genes are shown on the X and Y axis respectively in A and C and were not included in the calculation of between genus correlations. The gene names associated with each value of / and mENC’ shown are provided in Tables 2, 3, and 4.