| Literature DB >> 35069705 |
Tobias Heinen1, Chen Xie2, Maryam Keshavarz2,3, Dominik Stappert3, Sven Künzel2, Diethard Tautz2.
Abstract
Map2k7 (synonym Mkk7) is a conserved regulatory kinase gene and a central component of the JNK signaling cascade with key functions during cellular differentiation. It shows complex transcription patterns, and different transcript isoforms are known in the mouse (Mus musculus). We have previously identified a newly evolved testis-specific transcript for the Map2k7 gene in the subspecies M. m. domesticus. Here, we identify the new promoter that drives this transcript and find that it codes for an open reading frame (ORF) of 50 amino acids. The new promoter was gained in the stem lineage of closely related mouse species but was secondarily lost in the subspecies M. m. musculus and M. m. castaneus. A single mutation can be correlated with its transcriptional activity in M. m. domesticus, and cell culture assays demonstrate the capability of this mutation to drive expression. A mouse knockout line in which the promoter region of the new transcript is deleted reveals a functional contribution of the newly evolved promoter to sperm motility and the spermatid transcriptome. Our data show that a new functional transcript (and possibly protein) can evolve within an otherwise highly conserved gene, supporting the notion of regulatory changes contributing to the emergence of evolutionary novelties.Entities:
Keywords: kinase; new promotor; regulatory evolution; sperm maturation; testis
Year: 2022 PMID: 35069705 PMCID: PMC8766832 DOI: 10.3389/fgene.2021.812139
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Map2k7 expression in M. m. domesticus versus M. m. musculus. (A) Northern blots with testis RNA from M. m. domesticus individuals of a wild population from France (Fra) and the laboratory inbred strain C57Bl6/J (Bl6), as well as M. m. musculus individuals from a wild-caught population from Vienna (Vie) and a wild-caught population from the Czech Republic (Cze). A fragment spanning exons 5-10 was used as a probe (see Methods for generating this fragment). The strongly expressed 1.6 kb band is only visible in the two individuals from the M. m. domesticus subspecies. The weak 1.6 kb band in M. m. musculus is derived from another transcript variant that differs with respect to an additional intron in the 3′-end of the long transcript (compare annotation in Figure 2). (B) In situ hybridization with the Map2k7 probe on cross-sections of seminiferous tubules of an individual from M. m. domesticus (left) and an individual from M. m. musculus (right) was the same fragment as used for the Northern blots. Note that the M. m. domesticus signal developed already after 3 h of color incubation, while the M. m. musculus signal developed only after 4 days of color incubation. This is compatible with the fact that the signal in M. m. domesticus comes mainly from the strong 1.6 kb transcript seen in the Northern blot, while the signal in M. m. musculus comes from the weaker 3.5 kb transcript. Microscope pictures were taken at ×100 magnification.
FIGURE 2Map2k7 transcript variants and transcriptome read coverage. The figure is based on UCSC genome browser (Kent et al., 2002) tracks for the mouse GRCm38/mm10 reference sequence (Waterston et al., 2002). The top panel shows the different transcript and splice versions from the mouse reference genome, representing M. m. domesticus and therefore includes the new promoter/exon (highlighted by grey shading). The middle panel is based on data from (Harr et al., 2016) and shows transcriptome read mapping tracks for different tissues. The lower panel is based on transcriptome data from (Harr et al., 2016) and (Neme and Tautz, 2016) and shows transcriptome read mapping tracks for different populations, subspecies, and species. The phylogenetic relationships are depicted to the left. DOM: M. m. domestics, MUS: M. m. musculus, CAS: M. m. castaneus.
FIGURE 3Comparison of the main Map2k7 transcripts and their coding potentials. Transcript depictions are taken from the UCSC browser annotations (see also Figure 2), whereby only the two main transcripts are shown, the conserved one (γ1) and the new one starting from the new promoter (α1). The protein domains of the Map2k7 functional kinase are depicted on the top, including the JNK-binding sites (D-domains in blue), the kinase domain (in orange) with its functional phosphorylation sites (yellow), as well as the C-terminal DVD domain (green). The first AUG in the new transcript is in the third exon, and it has a 50aa ORF, which would represent a de novo generated protein. If this AUG would not be used, the next AUG in a different frame could potentially lead to the expression of a truncated version of the Map2k7 protein, containing only the kinase and the DVD domains. The corresponding coding exon sequences (up to exon 4 for the γ1 transcript) are depicted below (flanked by their splice sites, introns are depicted as three dots) with the translations in the two alternative reading frames.
FIGURE 4Functional test of the promoter region driving the new transcript. The top shows the alignment scheme of genomic sequences from populations, sub-species, and species based on the genome data described in (Harr et al., 2016), aligned to the mouse GRCm38/mm10 reference sequence. The fragment shown represents the one used for the promoter studies - only replacements with respect to the reference are marked, the two relevant regions discussed in the text are enlarged with their respective sequences. DOM represents M. m. domesticus populations, MUS represents M. m. musculus populations, CAS represents a M. m. castaneus population. T/A is the only mutation that correlates with the expression of the new promoter. Note that the transcriptional start site, marked as “0”, is located between the annotated site for this transcript, which would be 31 bp further upstream, and the site from which the bulk of the transcripts generated in the RNASeq experiment starts, which is 8 bp downstream (see Supplementary Figure S2 for a corresponding sequence depiction of this region). The bottom shows the scheme of the four constructs tested in cell culture and their expression levels measured as relative luminescence units [RLU] (see Methods). Error bars indicate standard deviations from eight replicates; all differences are significant (p < 0.05, t-test).
Testis and sperm analysis for the Map2k7-α1 promotor KO animals versus wild type (WT) animals. Testis weights were normalized with mouse body weights. Sperm analysis was done with a CEROS Sperm Analyzer (Hamilton Thorne, Beverly MA) setup. Two-sided t-tests were done to calculate the p-values. SD is standard deviation (in italics).
| Genotype | Mouse weight (g) | Normalized testis weight (g) | Motile sperm (%) | Progressive sperm (%) | |
|---|---|---|---|---|---|
| WT (N = 10) | averages | 25.14 | 0.20 | 48 | 26 |
|
|
|
|
|
| |
| KO (N = 18) | averages | 27.59 | 0.17 | 42 | 22 |
|
|
|
|
|
| |
|
| 0.003 | 0.001 | 0.021 | 0.031 |
FIGURE 5Whole transcriptomes analysis of three tissues from wild type versus knockout mice. (A) Overall PCA comparison. Strong differentiation is seen between the tissue samples, implying that those transcript sets are very different. The differences between wild type and knockouts are much smaller for each tissue. (B–D) Significantly differentially expressed genes. A dot represents each gene, genes with padj < 0.05 values are plotted as blue dots. The number of significantly differentially expressed genes is provided as an inset for each tissue.
List of top significant genes in the RNASeq analysis for the three tissue samples. Named genes (i.e., predicted genes were not included) were retrieved from the DeSeq2 output ranked according to baseMean/mRNA length and expression characteristics (see Supplementary Table S1 for full results).
| Tissue | Gene name | Description | log2 fold-change | Padj | Expression | Function | Literature |
|---|---|---|---|---|---|---|---|
| Testis (top 10 most highly expressed genes that are specifically expressed in testis) | |||||||
|
| Nuclear transition protein 2 | −0.37 | 0.0021 | testis | conversion of nucleosomal chromatin in spermatids, involved in sperm motility |
| |
|
| Sperm mitochondrial-associated cysteine-rich protein | −0.25 | 0.0006 | testis | involved in sperm motility and fertilization |
| |
|
| glyceraldehyde-3-phosphate dehydrogenase, spermatogenic | −0.12 | 0.0206 | testis | required for normal sperm motility and male fertility |
| |
|
| Four and a half LIM domains 4 | 0.09 | 0.0186 | testis | may affect sperm maturation and morphology |
| |
|
| tubulin, alpha 3A, spermatogenic | −0.14 | 0.0027 | testis | required for the production of normal spermatozoa |
| |
|
| Cysteine-rich secretory protein 2 | 0.10 | 0.0444 | testis | regulates calcium fluxes during sperm capacitation, essential for fertility |
| |
|
| A-kinase anchor protein 4 | 0.13 | 0.0037 | testis | major structural component of sperm fibrous sheath, plays a role in sperm motility |
| |
|
| tubulin, alpha 3B, spermatogenic | −0.11 | 0.0036 | testis | required for the production of normal spermatozoa |
| |
|
| cytochrome c oxidase subunit 8C | −0.26 | 0.0000 | testis | required for the production of normal spermatozoa |
| |
|
| testis specific H1.7 linker histone | −0.15 | 0.0013 | testis | required for cell restructuring and DNA condensation during spermiogenesis |
| |
| Caput epididymis (top 10 most highly expressed genes that are specifically expressed in testis/epididymis) | |||||||
|
| Epididymal-specific lipocalin-8 | 0.15 | 0.0493 | epididymis | involved in sperm maturation and motility |
| |
|
| lipocalin 9 | 0.14 | 0.0450 | epididymis | may act redundantly to Lcn8 |
| |
|
| Epididymal protein 13 | -0.13 | 0.0378 | epididymis | no fecundity defect in knockout mice |
| |
|
| cystatin 8 (cystatin-related epididymal spermatogenic) | 0.24 | 0.0016 | epididymis | capacitation of spermatozoa |
| |
|
| Ribonuclease-like protein 10 | 0.40 | 0.0000 | epididymis | required for post-testicular sperm maturation |
| |
|
| Disintegrin and metalloproteinase domain-containing protein 7 | 0.15 | 0.0108 | epididymis | required for epididymal integrity, sperm morphology and motility |
| |
|
| Transmembrane epididymal family member 1a | 0.28 | 0.0035 | epididymis | no information on sperm effects | ||
|
| Transmembrane epididymal family member 2 | 0.34 | 0.0001 | epididymis | no information on sperm effects | ||
|
| serine peptidase inhibitor, Kazal type 12 | −0.31 | 0.0003 | epididymis | part of Spink family with redundant functions in sperm maturation |
| |
|
| defensin beta 12 | 0.20 | 0.0033 | epididymis | immunity protein | ||
| Cauda (top 10 most highly expressed genes) | |||||||
|
| Cysteine-rich secretory protein 1 | −0.46 | 0.0013 | testis, epididymis | required to optimize sperm flagellum waveform |
| |
|
| CAMPATH-1 antigen | −0.69 | 0.0005 | broad | immunity protein |
| |
|
| Glutathione peroxidase 3 | −0.73 | 0.0043 | broad | protects cells from oxidative damage, involved in the maturation of sperm cells |
| |
|
| Defensin beta 28 | −0.37 | 0.0404 | testis, epididymis | immunity protein | ||
|
| Serine protease inhibitor Kazal-type 8 | −0.44 | 0.0009 | testis, epididymis | part of Spink family with redundant functions in sperm maturation |
| |
|
| Beta-2-microglobulin | −0.39 | 0.0035 | broad | component of the class I major histocompatibility complex (MHC) | ||
|
| WAP four-disulfide core domain protein 15B | −0.75 | 0.0002 | testis, epididymis | Wfdc family members act in innate immune responses during epididymitis |
| |
|
| defensin beta 2 | −0.38 | 0.0336 | testis, epididymis | immunity protein | ||
|
| Actin, aortic smooth muscle | 0.44 | 0.0409 | broad | cell motility, may affect contractability of seminiferous tubules |
| |
|
| ribosomal protein, large, P1 | −0.54 | 0.0110 | broad | ribosomal protein | ||
| Cauda (top 10 genes with highest log2fold changes) | |||||||
|
| parvalbumin | 8.93 | 0.0158 | broad | no information on sperm effects | ||
|
| myosin, light polypeptide 2, regulatory, cardiac, slow | 8.36 | 0.0338 | broad | motorprotein involved in spermiogenesis |
| |
|
| myosin, heavy polypeptide 7, cardiac muscle, beta | 8.04 | 0.0405 | broad | motorprotein involved in spermiogenesis |
| |
|
| myosin, light polypeptide 1 | 7.26 | 0.0170 | broad | motorprotein involved in spermiogenesis |
| |
|
| ATPase, Ca++ transporting, cardiac muscle, fast twitch 1 | 7.05 | 0.0008 | broad | Ca-gradient regulation, male fertility |
| |
|
| myosin, heavy polypeptide 1, skeletal muscle, adult | 6.73 | 0.0404 | broad | motorprotein involved in spermiogenesis |
| |
|
| actinin alpha 2 | 6.40 | 0.0380 | broad | motorprotein involved in spermiogenesis |
| |
|
| sarcolipin | 6.01 | 0.0307 | broad | no information on sperm effects | ||
|
| troponin T3, skeletal, fast | 5.86 | 0.0050 | broad | motorprotein involved in spermiogenesis |
| |
|
| myosin light chain kinase family, member 4 | 5.85 | 0.0138 | broad | motorprotein involved in spermiogenesis |
| |
Selected from the top 17 genes in the whole list (see Supplementary Table S1B).
Selected from the top 27 genes in the whole list (see Supplementary Table S1D).
Represent the top 10 genes from the list (see Supplementary Table S1F).