| Literature DB >> 29901159 |
Tiancheng Zhang1, Jun Wu1, Caihua Liao2, Zhong Ni3, Jufen Zheng1, Fudong Yu1.
Abstract
mRNA has an important role in spermatogenesis and the maintenance of fertility, and may act as a potential biomarker for the clinical diagnosis of infertility. In the present study, potential biomarkers associated with teratozoospermia were screened through systemic bioinformatics analysis. Initially, genome‑wide expression profiles were downloaded from the Gene Expression Omnibus and primary analysis was conducted using R software, which included preprocessing of raw microarray data, transformation between probe ID and gene symbol and identification of differentially expressed genes. Subsequently, a functional enrichment analysis was conducted using the Database for Annotation, Visualization and Integrated Discovery to investigate the biological processes involved in the development of teratozoospermia. Finally, a protein‑protein interaction network of notable differentially expressed genes was constructed and cross‑analysis performed for multiple datasets, to obtain a potential biomarker for teratozoospermia. It was observed that G protein subunit β 3, G protein subunit α o1 and G protein subunit g transducin 1 were upregulated and enriched using Kyoto Encyclopedia of Genes and Genomes (KEGG) in the network and in cross analysis. Furthermore, ribosomal protein S3 (RPS3), RPS5, RPS6, RPS16 and RPS23 were downregulated and enriched using KEGG in teratozoospermia. In conclusion, the results of the present study identified several mRNAs involved in sperm morphological development, which may aid in the understanding and treatment of infertility.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29901159 PMCID: PMC6072217 DOI: 10.3892/mmr.2018.9112
Source DB: PubMed Journal: Mol Med Rep ISSN: 1791-2997 Impact factor: 2.952
Figure 1.Box plots of data distribution in semen samples. The horizontal axis represents sample names and the vertical axis represents expression values. The first eight samples (left) are normal sperm samples. The reamining 13 samples are teratozoospermia samples. The black line in the box plot is the median of each data group and the data standardization degree may be inferred from its distribution. (A) Data prior to standardization. (B) Data following standardization.
Figure 2.Differential expression analysis. (A) Heat map presenting the expression pattern across different samples. The horizontal axis represents sample names. The first eight samples are normal sperm samples. The further 13 samples are teratozoospermia samples. The left vertical axis presented clusters of DEGs, and the top horizontal axis presents clusters of samples. Red represents upregulated genes and green represents downregulated genes. (B) Volcano plot of DEGs. The y-axis is logFC and the x-axis represents -log10 (adjusted P-value). The red dots represent the DEGs upregulated and the green dots represent the DEGs downregulated while the black dots represent genes that were not differentially expressed. DEGs, differentially expressed genes; FC, fold change.
Top 10 upregulated and downregulated DEGs.
| A, Upregulated DEGs | ||||||
|---|---|---|---|---|---|---|
| Gene | logFC | AveExpr | t | P-value | Adjusted P-value | B |
| HS3ST3A1 | 6.15829 | 6.264543 | 22.41724 | 1.07×10−16 | 1.78×10−13 | 28.22125 |
| XKR4 | 6.108402 | 6.904738 | 38.15492 | 1.13×10−21 | 2.44×10−17 | 38.46529 |
| C14orf37 | 6.089262 | 5.604961 | 29.23487 | 3.63×10−19 | 1.97×10−15 | 33.48849 |
| HYMAI | 5.891272 | 7.247713 | 16.23585 | 9.01×10−14 | 2.38×10−11 | 21.67427 |
| WIF1 | 5.55279 | 5.831665 | 34.46781 | 1.03×10−20 | 1.11×10−16 | 36.61418 |
| SRGAP2C | 5.491472 | 6.150009 | 23.34658 | 4.51×10−17 | 8.89×10−14 | 29.03932 |
| SRGAP2 | 5.182356 | 5.577265 | 26.86696 | 2.24×10−18 | 6.06×10−15 | 31.83706 |
| DQ592442 | 5.070846 | 7.77992 | 19.09086 | 3.15×10−15 | 2.07×10−12 | 24.96463 |
| MOGAT1 | 5.004399 | 5.407998 | 14.35237 | 1.10×10−12 | 1.71×10−10 | 19.18967 |
| LOC101928622 | 4.938621 | 5.265134 | 25.09824 | 9.64×10−18 | 2.09×10−14 | 30.48735 |
| ZPBP | −5.39989 | 9.272953 | −13.465 | 3.91×10−12 | 4.71×10−10 | 17.91695 |
| PPDPF | −5.48137 | 7.809478 | −15.1896 | 3.50×10−13 | 7.22×10−11 | 20.32858 |
| MSMB | −5.54533 | 8.057753 | −11.4953 | 8.49×10−11 | 4.52×10−09 | 14.82184 |
| TSACC | −5.62232 | 9.854213 | −12.3248 | 2.22×10−11 | 1.74×10−09 | 16.17356 |
| PIP | −5.63595 | 8.930515 | −9.11178 | 6.10×10−09 | 1.17×10−07 | 10.49482 |
| TCEAL4 | −5.81103 | 7.351813 | −13.5001 | 3.72×10−12 | 4.57×10−10 | 17.96871 |
| RPS5 | −5.83978 | 7.284755 | −21.5724 | 2.41×10−16 | 3.71×10−13 | 27.44499 |
| RPL7AL2 | −6.12204 | 7.736477 | −16.0315 | 1.17×10−13 | 3.01×10−11 | 21.41787 |
| SEMG1 | −6.1932 | 10.05513 | −15.8135 | 1.54×10−13 | 3.67×10−11 | 21.14096 |
| SEMG2 | −6.69875 | 7.822177 | −10.4268 | 5.32×10−10 | 1.81×10−08 | 12.96753 |
DEGs, differentially expressed genes; FC, fold change; HS3ST3A1, heparan sulfate-glucosamine 3-sulfotransferase 3A1; XKR4, XK related 4; C14orf37, armadillo-like helical domain containing 4; HYMAI, hydatidiform mole associated and imprinted (non-protein coding); WIF1, WNT inhibitory factor 1; SRGAP2C, SLIT-ROBO Rho GTPase activating protein 2C; SRGAP2, SLIT-ROBO Rho GTPase activating protein 2; MOGAT1, monoacylglycerol O-acyltransferase 1; ZPBP, zona pellucida binding protein; PPDPF, pancreatic progenitor cell differentiation and proliferation factor; MSMB, microseminoprotein β; TSACC, TSSK6 activating cochaperone; PIP, prolactin induced protein; TCEAL4, transcription elongation factor A like 4; RPS5, ribosome protein S5; RPL7AL2, ribosomal protein L7a pseudogene 12; SEMG1, semenogelin 1; SEMG2, semenogelin 2; AveExpr, Average expression of this probe set in all samples; t, t-value in the T-test between two sets of Bayes adjusted; B, the logarithmic value of the standard deviation obtained by empirical Bayes.
Gene Ontology terms enriched in the teratozoospermia-related module.
| A, Upregulated genes | |||
|---|---|---|---|
| ID | Term | Count | FDR |
| GO.0007399 | Nervous system development | 37 | 4.12×10−07 |
| GO.0032502 | Developmental process | 60 | 3.97×10−07 |
| GO.0048856 | Anatomical structure development | 56 | 3.52×10−07 |
| GO.0098794 | Postsynapse | 16 | 3.04×10−07 |
| GO.0050793 | Regulation of developmental process | 39 | 1.73×10−07 |
| GO.2000026 | Regulation of multicellular organismal development | 34 | 6.97×10−08 |
| GO.0097060 | Synaptic membrane | 15 | 6.89×10−08 |
| GO.0051094 | Positive regulation of developmental process | 30 | 1.85×10−08 |
| GO.0051239 | Regulation of multicellular organismal process | 44 | 1.85×10−08 |
| GO.0045211 | Postsynaptic membrane | 16 | 4.26×10−10 |
| GO.0045047 | Protein targeting to ER | 62 | 1.05×10−37 |
| GO.0031974 | Membrane-enclosed lumen | 497 | 3.44×10−38 |
| GO.0044428 | Nuclear part | 456 | 3.40×10−38 |
| GO.0006614 | SRP-dependent cotranslational protein targeting to membrane | 62 | 2.24×10−38 |
| GO.0006413 | Translational initiation | 89 | 6.39×10−39 |
| GO.0003723 | RNA binding | 262 | 8.87×10−40 |
| GO.0005737 | Cytoplasm | 941 | 2.62×10−43 |
| GO.0032991 | Macromolecular complex | 552 | 4.40×10−48 |
| GO.0044446 | Intracellular organelle part | 807 | 1.03×10−54 |
| GO.0044422 | Organelle part | 821 | 8.62×10−55 |
FDR, false discovery rate; ER, endoplasmic reticulum; SRP, signal recognition particle.
KEGG pathways enriched in the teratozoospermia-related module.
| A, Up-regulated genes | ||
|---|---|---|
| Term | Count | FDR |
| Neuroactive ligand-receptor interaction | 15 | 7.99×10−08 |
| Retrograde endocannabinoid signaling | 9 | 1.95×10−06 |
| Morphine addiction | 8 | 9.94×10−06 |
| GABAergic synapse | 7 | 8.70×10−05 |
| Nicotine addiction | 5 | 3.09×10−04 |
| Rap1 signaling pathway | 9 | 3.89×10−04 |
| Ras signaling pathway | 9 | 5.16×10−04 |
| PI3K-Akt signaling pathway | 11 | 5.16×10−04 |
| Glutamatergic synapse | 6 | 2.62×10−03 |
| Cholinergic synapse | 5 | 1.98×10−02 |
| Ribosome | 63 | 1.12×10−34 |
| Huntington s disease | 56 | 8.43×10−19 |
| Oxidative phosphorylation | 46 | 3.97×10−18 |
| Parkinson's disease | 45 | 4.14×10−16 |
| Alzheimer's disease | 46 | 7.12×10−14 |
| Proteasome | 20 | 1.94×10−10 |
| Non-alcoholic fatty liver disease | 38 | 2.39×10−10 |
| Metabolic pathways | 147 | 2.17×10−09 |
| Protein processing in endoplasmic reticulum | 35 | 2.29×10−07 |
| RNA transport | 30 | 1.65×10−05 |
KEGG, Kyoto Encyclopedia of Genes and Genomes; FDR, false discovery rate; GABA, γ-aminobutyric acid.
Figure 3.Network analysis of DEGs. (A) PPI network of upregulated DEGs obtained from the STRING database. Upregulated network was constructed with 134 nodes and 199 edges. (B) PPI network of downregulated DEGs obtained from the STRING database. The downregulated network was constructed with 160 nodes and 1,024 edges. DEGs, differentially expressed genes; PPI, protein-protein interaction.