| Literature DB >> 23409192 |
Pranab J Das1, Fiona McCarthy, Monika Vishnoi, Nandina Paria, Cathy Gresham, Gang Li, Priyanka Kachroo, A Kendrick Sudderth, Sheila Teague, Charles C Love, Dickson D Varner, Bhanu P Chowdhary, Terje Raudsepp.
Abstract
Mature mammalian sperm contain a complex population of RNAs some of which might regulate spermatogenesis while others probably play a role in fertilization and early development. Due to this limited knowledge, the biological functions of sperm RNAs remain enigmatic. Here we report the first characterization of the global transcriptome of the sperm of fertile stallions. The findings improved understanding of the biological significance of sperm RNAs which in turn will allow the discovery of sperm-based biomarkers for stallion fertility. The stallion sperm transcriptome was interrogated by analyzing sperm and testes RNA on a 21,000-element equine whole-genome oligoarray and by RNA-seq. Microarray analysis revealed 6,761 transcripts in the sperm, of which 165 were sperm-enriched, and 155 were differentially expressed between the sperm and testes. Next, 70 million raw reads were generated by RNA-seq of which 50% could be aligned with the horse reference genome. A total of 19,257 sequence tags were mapped to all horse chromosomes and the mitochondrial genome. The highest density of mapped transcripts was in gene-rich ECA11, 12 and 13, and the lowest in gene-poor ECA9 and X; 7 gene transcripts originated from ECAY. Structural annotation aligned sperm transcripts with 4,504 known horse and/or human genes, rRNAs and 82 miRNAs, whereas 13,354 sequence tags remained anonymous. The data were aligned with selected equine gene models to identify additional exons and splice variants. Gene Ontology annotations showed that sperm transcripts were associated with molecular processes (chemoattractant-activated signal transduction, ion transport) and cellular components (membranes and vesicles) related to known sperm functions at fertilization, while some messenger and micro RNAs might be critical for early development. The findings suggest that the rich repertoire of coding and non-coding RNAs in stallion sperm is not a random remnant from spermatogenesis in testes but a selectively retained and functionally coherent collection of RNAs.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23409192 PMCID: PMC3569414 DOI: 10.1371/journal.pone.0056535
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Venn diagram of transcripts detected in stallion sperm and testes by microarray analysis (SNR ≥2).
Figure 2Heat maps of GO functional groups for sperm up-regulated (a) and sperm down-regulated (b) transcripts.
Blue boxes denote that the gene has not been associated with the corresponding GO category. Genes with symbols in red font were validated by qRT-PCR.
Figure 3Validation of significantly (p<0.05) sperm up-regulated (a) and sperm down-regulated (b) genes by qRT-PCR (see also Table 1).
Selected most significantly (p<0.005) differentially expressed genes between stallion sperm and testes by microarray analysis and qRT-PCR (see also Fig. 3).
| Gene symbol | DE | Primers for qRT-PCR 5′-3′ | Function: GeneCards: |
|
| up | F: | Embryonic development |
| R: | |||
|
| up | F: | Unannotated |
| R: | |||
|
| up | F: | Sperm motility-sperm flagellum outer layer |
| R: | |||
|
| up | F: | Sperm structure -organizing the cytoskeleton |
| R: | |||
|
| up | F: | Sperm structure-outer dense fiber of sperm tails |
| R: | |||
|
| up | F: | Regulation-regulator of smooth muscle cell differentiation and proliferation, defects in male reproductive organs; cryptorchid phenotype |
| R: | |||
|
| up | F: | Regulation-bulk-protein degradation pathway, essential for autophagosomal formation |
| R: | |||
|
| down | F: | Regulation-cytosolic and membrane bound- detoxification of electrophilic compounds |
| R: | |||
|
| down | F: | Sperm structure -minus-end, microtubule-based motile processes |
| R: | |||
|
| down | F: | Surface protein -sperm surface zona pellucida binding protein |
| R: | |||
|
| down | F: | Regulation-negative regulation of 5-phosphoribose 1-diphosphate synthesis |
| R: |
Summary statistics for stallion sperm RNA-seq.
| Number of: | Sperm 1 | Sperm 2 |
| Raw reads | 64,488,380 | 76,386,416 |
| Base-pairs | 3,224,419,000 | 3,819,320,800 |
| Reads aligned to EcuCab2 | 45,266,539 | 42,335,436 |
| Alignment % to EcuCab2 | 70.2 | 55.4 |
| Non-match reads | 19,221841 | 34,050,980 |
| Calculated total mapped reads | 38,087,876 | 30,261,556 |
| Unique mapped reads | 17,170 | 17,069 |
Aligned reads with calculated AC values.
Reads with AC ≥1 mapped to unique locations (see Table S5 for details).
Figure 4Summary statistics for mapped RNA sequence tags
: (a) Comparison of mapped tags (AC≥1) between the two sperm samples; (b) Proportions of tags with very high (AC≥100), high (10
Distribution and expression of mapped RNA sequence tags in the horse genome.
| Sequence map data | RNA-seq data | |||||
| Horse chr. | Chr size, Mb | No. of genes* | Genes/Mb | No of tags(AC≥1) | Tags/Mb | ACmax |
| ECA1 | 185.8 | 2070 | 11.1 | 1,741 | 9 | 242,049 |
| ECA2 | 120.8 | 1273 | 10.5 | 696 | 6 | 85,760 |
| ECA3 | 119.4 | 1063 | 8.9 | 566 | 5 | 155,206 |
| ECA4 | 108.5 | 980 | 9.0 | 390 | 4 | 30,327 |
| ECA5 | 99.6 | 1221 | 12.3 | 440 | 4 | 15,063 |
| ECA6 | 84.7 | 1107 | 13.1 | 346 | 4 | 31,927 |
| ECA7 | 98.5 | 1455 | 14.8 | 412 | 4 | 49,977 |
| ECA8 | 94 | 880 | 9.4 | 372 | 4 | 9,702 |
| ECA9 | 94 | 610 | 6.5 | 276 | 3 | 7,597 |
| ECA10 | 83.9 | 1204 | 14.4 | 1,451 | 17 | 18,607 |
| ECA11 | 61.3 | 1245 | 20.3 | 1,544 | 25 | 49,866 |
| ECA12 | 33 | 742 | 22.5 | 775 | 23 | 1,244 |
| ECA13 | 42.5 | 745 | 17.5 | 1,023 | 24 | 4,197 |
| ECA14 | 93.9 | 839 | 8.9 | 1,141 | 12 | 10,517 |
| ECA15 | 91.5 | 815 | 8.9 | 1,159 | 13 | 42,457 |
| ECA16 | 87.3 | 846 | 9.7 | 1,160 | 13 | 1,592 |
| ECA17 | 80.7 | 497 | 6.2 | 754 | 9 | 4,436 |
| ECA18 | 82.5 | 578 | 7.0 | 782 | 9 | 5,761 |
| ECA19 | 59.9 | 541 | 9.0 | 574 | 10 | 2,189 |
| ECA20 | 64.1 | 840 | 13.1 | 484 | 8 | 57,595 |
| ECA21 | 57.7 | 491 | 8.5 | 290 | 5 | 1,717 |
| ECA22 | 49.9 | 613 | 12.3 | 278 | 6 | 3,832 |
| ECA23 | 55.7 | 410 | 7.4 | 281 | 5 | 14,218 |
| ECA24 | 46.7 | 564 | 12.1 | 276 | 6 | 37,167 |
| ECA25 | 39.5 | 605 | 15.3 | 284 | 7 | 12,795 |
| ECA26 | 41.8 | 272 | 6.5 | 203 | 5 | 12,070 |
| ECA27 | 39.9 | 277 | 6.9 | 217 | 5 | 5,922 |
| ECA28 | 46.1 | 477 | 10.3 | 263 | 6 | 56,479 |
| ECA29 | 33.6 | 246 | 7.3 | 141 | 4 | 55,336 |
| ECA30 | 30 | 218 | 7.3 | 127 | 4 | 1,233 |
| ECA31 | 24.9 | 176 | 7.1 | 116 | 5 | 19,300 |
| ECAX | 124.1 | 1239 | 10.0 | 390 | 3 | 133,493 |
| ECAUn | n/a | n/a | n/a | 302 | n/a | 272,950 |
| Mt | 0.016 | 37 | 2312.5 | 3 | 188 | 4,390 |
|
|
| |||||
Mb–megabase-pair; AC–average coverage; Mt–mitochondrial genome; map information for horse chromosomes was retrieved from Ensemb (http://www.ensembl.org/index. html); * includes known and novel protein coding, miRNA, rRNA, snRNA, snoRNA and Misc RNA genes.
Figure 5Structural annotation of 19, 257 mapped RNA sequence tags (AC≥1):
(a) Distribution of the tags in structural annotation categories by ERANGE; (b) Comparison of annotated genes by GOanna (human genome) and ERANGE (horse genome).
NCBI BLAST alignments for 12 most abundant (AC>50,000) un-annotated mapped RNA sequence tags.
| Chromosomal Location | AC | NCBI BLAST alignment | NCBI accession number | E-value | Identity % |
| chrUn:41671499-41671648 | 272,950 |
| AB117610 | 2E-48 | 92 |
| chr1:183854089-183854405 | 189,364 |
| AB061847 | 3E-150 | 99 |
| chr3:36417092-36417971 | 155,205 | Mouse 28S rRNA | X00525 | 0 | 91 |
| chrX:51467917-51468014 | 133,492 |
| EU727190 | 1E-13 | 93 |
| chr1:89070737-89071827 | 104,507 |
| NR_046309 | 0 | 99 |
| chrUn:55274673-55275483 | 98,778 |
| M11167 | 0 | 91 |
| chrX:87062618-87062767 | 82,610 |
| NG_002426 | 2E-54 | 94 |
| chrUn:64060479-64061987 | 62,499 |
| JN412502 | 0 | 99 |
| chr20:7063095-7063141 | 57,594 |
| EU727191 | 3E-10 | 98 |
| chr28:36791911-36792005 | 56,479 |
| M20910 | 9E-30 | 94 |
| chr29:1282347-1282467 | 55,336 |
| NR_046235 | 3E-51 | 99 |
| chr1:89070491-89070650 | 53,940 |
| XM_001916364 | 1E-60 | 94 |
Structural and functional annotations for mRNAs and ESTs with the highest AC values by RNA-seq.
| NCBI Accession No | Gene symbol | Gene name | Location Chr:Mb | AC value | Predicted or known function(s) | Ref. |
| mRNA | ||||||
| NM_001081847 |
| Matrix metallopeptidase 1 | 7:12.7 | 11766 | spermatogenesis |
|
| NM_001082495 |
| Matrix metallopeptidase 3 | 7:12.7 | 11766 | spermatogenesis |
|
| NM_001135102 |
| Transition protein2 | 13:33.2 | 1730 | sperm chromatin structure |
|
| NM_001083596 |
| Protamine 1 | 13:33.2 | 1730 | sperm chromatin structure |
|
| NM_001159690 |
| Pyruvate kinase, muscle | 1:121.0 | 297 | high fertility sperm |
|
| NM_001163873 |
| Glucose-regulated protein | 28:27.3 | 238 | sperm maturation |
|
| NM_001081764 |
| Collagen, type II, alpha 1 | 6:65.6 | 222 | testes development and descent, male infertility |
|
| NM_001160296 |
| F-box protein 9 | 20:50.6 | 182 | expressed in male germ-cells, sperm differentiation |
|
| NM_001081842 |
| Caspase 1, apoptosis-related cysteine peptidase | 7:14.5 | 119 | male fertility |
|
| NM_001081932 |
| Cysteine-rich secretory protein 2 | 20:47.7 | 114 | sperm capacitation and sperm-egg fusion |
|
| NM_001081874 |
| Cysteine-rich secretory protein 3 | 20:47.7 | 114 | protects sperm from degradation |
|
| dbEST | ||||||
| CD470129 |
| Nuclear export mediator factor | 1:183.8 | 242048 | sperm-egg interaction |
|
| CX595503 |
| Beta-catenin-interacting protein 1 | 2:41.4 | 85760 | cytoskeletal, cellular morphogenesis, germ cell loss and sterility |
|
| CD466273 |
| Lymphocyte cytosolic protein 1 | 7:12.7 | 11766 | sperm maturation |
|
| CD472316 |
| Deoxynucleotidyltransferase, terminal, interacting protein 2 | 5:71.1 | 11420 | chromatin remodeling |
|
| CD467145 |
| FGD1 family, member 3 | 23:54.9 | 10551 | sperm motility |
|
| CX595998 |
| LYR motif containing 4 | 20:5.6 | 8565 | mitochondrial membrane polarization |
|
| CX596255 |
| Protein disulfide-isomerase A4 precursor | 4:101.1 | 4232 | spermatogenesis, sperm maturation |
|
| CX592294 |
| NADH dehydrogenase (ubiquinone) flavoprotein 2 | 8:34.0 | 1997 | expressed in sperm |
|
Figure 6Comparison of RNA-seq data with current equine gene models:
(a) PKM2 showing 9 in silico prediction sites, of which two are positioned 5′ upstream to exon 1; (b) CRISP3 with 3 in silico prediction sites, all located 5′ upstream to exon 1; (c) PRM1 and TNP2 cluster (the protamine cluster) with 12 in silico prediction sites of which only two align with PRM1 and TNP2 exons. Black boxes with numbers –exons in current gene models; blue boxes –very highly expressed tags (AC≥100); red boxes–highly expressed tags (10
Figure 7ECAY transcripts in stallion sperm.
Agarose gel images showing RT-PCR amplicons of 7 ECAY genes and transcripts in stallion sperm.