| Literature DB >> 27899719 |
Gerly Sillaste1, Lauris Kaplinski2, Riho Meier1,3, Ülle Jaakma1,4, Elo Eriste1, Andres Salumets5,6,7,8.
Abstract
DNA compaction with protamines in sperm is essential for successful fertilization. However, a portion of sperm chromatin remains less tightly packed with histones, which genomic location and function remain unclear. We extracted and sequenced histone-associated DNA from sperm of nine ejaculates from three bulls. We found that the fraction of retained histones varied between samples, but the variance was similar between samples from the same and different individuals. The most conserved regions showed similar abundance across all samples, whereas in other regions, their presence correlated with the size of histone fraction. This may refer to gradual histone-protamine transition, where easily accessible genomic regions, followed by the less accessible regions are first substituted by protamines. Our results confirm those from previous studies that histones remain in repetitive genome elements, such as centromeres, and added new findings of histones in rRNA and SRP RNA gene clusters and indicated histone enrichment in some spermatogenesis-associated genes, but not in genes of early embryonic development. Our functional analysis revealed significant overrepresentation of cGMP-dependent protein kinase G (cGMP-PKG) pathway genes among histone-enriched genes. This pathway is known for its importance in pre-fertilization sperm events. In summary, a novel hypothesis for gradual histone-to-protamine transition in sperm maturation was proposed. We believe that histones may contribute structural information into early embryo by epigenetically modifying centromeric chromatin and other types of repetitive DNA. We also suggest that sperm histones are retained in genes needed for sperm development, maturation and fertilization, as these genes are transcriptionally active shortly prior to histone-to-protamine transition.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27899719 PMCID: PMC5184773 DOI: 10.1530/REP-16-0441
Source DB: PubMed Journal: Reproduction ISSN: 1470-1626 Impact factor: 3.906
The coverage averages, maximums and averages of top 5th percentile of all positions of all samples.a
| Sample | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Average sequencing coverage | 0.373 | 0.097 | 0.129 | 0.406 | 0.043 | 0.121 | 0.389 | 0.076 | 0.183 |
| Maximum coverage | 128 | 100 | 104 | 136 | 71 | 107 | 136 | 98 | 130 |
| Average of top 5th percentile | 46.8 | 45.1 | 46.4 | 51.9 | 24.1 | 30.2 | 58.5 | 37.5 | 45.8 |
Sample names contain abbreviation of the name of the animal and sequence number of the ejaculate. Average sequencing coverage – the total length of mapped reads divided by the size of cow genome. Maximum coverage – the maximum number of mapped reads that overlap one genomic position. The top 5th percentile is calculated by cumulative coverage, i.e. those genomic positions with the highest coverage values that together sum up to 5% of total coverage.
The pairwise correlations of the coverages of genomic positions of all samples.a
| Fag1 | Fag2 | Fag3 | Far1 | Far2 | Far3 | Ole1 | Ole2 | Ole3 | |
|---|---|---|---|---|---|---|---|---|---|
| Fag1 | 0.669 | 0.663 | 0.892 | 0.695 | 0.790 | 0.878 | 0.664 | 0.817 | |
| Fag2 | 0.896 | 0.688 | 0.793 | 0.795 | 0.720 | 0.842 | 0.789 | ||
| Fag3 | 0.695 | 0.786 | 0.798 | 0.726 | 0.844 | 0.797 | |||
| Far1 | 0.757 | 0.810 | 0.900 | 0.698 | 0.828 | ||||
| Far2 | 0.826 | 0.755 | 0.807 | 0.804 | |||||
| Far3 | 0.793 | 0.785 | 0.856 | ||||||
| Ole1 | 0.748 | 0.855 | |||||||
| Ole2 | 0.821 | ||||||||
| Ole3 |
Only those positions where all samples had at least one read are included. The correlations are calculated between the enrichment values of all recorded genomic positions (number of reads overlapping this position) in two samples. Sample names contain abbreviation of the name of the animal and sequence number of the ejaculate.
Figure 1The variability of histone enrichment in specific positions. An example of genomic region (chromosome 11) showing the histone content in two different samples (Far2 and Far3). In certain regions, the enrichment is always low, in others always high (in both cases the correlation of enrichment with overall histone content of sample is low) and in some regions, the enrichment is correlated with the histone content of a given sample. X-axis – genomic position and Y-axis sample enrichment. A, B, C – different histone conservation patterns (A – always low, B – always high and C – correlated with average enrichment).
Figure 2The average histone enrichment relative to transcription start site (TSS). The average histone enrichment relative to transcription start site (TSS). X-axis – the distance in base pairs (negative values represent upstream region) from TSS of known genes. Y-axis – the average enrichment by histones of all nine samples. Histone enrichment is the highest immediately after TSS. Another peak at −500 bp from TSS indicates on gene promoter or enhancer regions. Only positions from 2000 bp upstream to 2000 bp downstream are shown for clarity.
Repetitive sequence classes categorized by histone enrichment score.a
| Class | Enrichment score | Number of copies | Overall length in bp |
|---|---|---|---|
| Satellite | 4.9288 | 9481 | 10,216,800 |
| rRNA | 0.9225 | 1432 | 267,280 |
| SRP RNA | 0.5476 | 68 | 13,150 |
| SINE | 0.1635 | 2,157,060 | 341,045,070 |
| RNA | 0.1404 | 337 | 54,870 |
| LTR | 0.1375 | 538,411 | 134,180,980 |
| tRNA | 0.1299 | 2632 | 194,700 |
| LINE | 0.0992 | 1,888,497 | 772,366,750 |
| scRNA | 0.0954 | 31 | 2760 |
The average enrichment is calculated by dividing the total number of histones overlapping given repeat types with the total length of these repeats in genome.
LINE, Long Interspersed Elements; LTR, Long Terminal Repeats; rRNA, ribosomal RNA; scRNA, small cytoplasmatic RNA; SINE, Short Interspersed Elements; SRP RNA, Signal recognition particle RNA; tRNA, transfer RNA.
Averages, regression and correlation values of deciles by coverage.a
| Decile | Fag1 | Fag2 | Fag3 | Far1 | Far2 | Far3 | Ole1 | Ole2 | Ole3 | Test Avg | Positions | Intercept | Slope | Correlation |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 3.79 | 5.43 | 5.98 | 4.7 | 2.13 | 3.42 | 5.29 | 3.95 | 4.99 | 4.32 | 62095 | 0.535 | 4.299 | 0.626 |
| 2 | 6.85 | 8.05 | 9.08 | 8.46 | 3 | 5.21 | 9.46 | 5.75 | 7.9 | 6.5 | 34542 | 0.459 | 4.999 | 0.693 |
| 3 | 8.92 | 9.76 | 11.1 | 10.98 | 3.59 | 6.4 | 12.32 | 6.97 | 9.93 | 7.96 | 26558 | 0.427 | 5.299 | 0.723 |
| 4 | 11.04 | 11.51 | 12.96 | 13.63 | 4.23 | 7.63 | 15.26 | 8.19 | 11.88 | 9.4 | 21430 | 0.416 | 5.401 | 0.74 |
| 5 | 13.67 | 13.25 | 14.97 | 16.73 | 4.96 | 9.06 | 18.53 | 9.55 | 14.08 | 10.98 | 17490 | 0.402 | 5.526 | 0.766 |
| 6 | 17.17 | 15.69 | 17.56 | 20.51 | 6.05 | 11 | 22.74 | 11.39 | 16.86 | 13.09 | 14165 | 0.407 | 5.484 | 0.782 |
| 7 | 21.84 | 18.69 | 20.81 | 25.73 | 7.88 | 13.5 | 28.51 | 14.17 | 20.89 | 15.99 | 11247 | 0.422 | 5.346 | 0.808 |
| 8 | 28.45 | 23.57 | 25.28 | 32.9 | 11.41 | 17.33 | 36.56 | 18.9 | 27.09 | 20.6 | 8739 | 0.474 | 4.86 | 0.825 |
| 9 | 35.63 | 31.74 | 33.07 | 41.82 | 17.05 | 22.27 | 46.98 | 26.34 | 35.3 | 27.63 | 6877 | 0.56 | 4.066 | 0.769 |
| 10 | 51.17 | 40.98 | 42.49 | 57.96 | 23.49 | 31.63 | 65.04 | 34.96 | 49.4 | 37.16 | 4914 | 0.53 | 4.348 | 0.854 |
Deciles are calculated by the average coverage value in the control group (Fag1, Far1 and Ole1). Test Avg – the average coverage in test group (Fag2, Fag3, Far2, Far3, Ole2 and Ole3). Positions – the number of positions in given decile. Intercept and Slope – regression values, divided by Test Avg. Correlation – the correlation coefficient between the average coverage of full genome and Test Avg. Sample names contain abbreviation of the name of the animal and sequence number of the ejaculate.
Figure 3The correlation of histone enrichment in specific positions with the average histone content of samples. Dots – individual samples. Lines – linear regression lines. Only 5th and 10th deciles by cumulative enrichment are shown. Deciles are calculated by different samples than regression values. The higher slope of the regression line of the 5th decile indicates that in these regions the enrichment is more strongly correlated with the histone content. X-axis – histone content (sequencing coverage) of sample. Y-axis – the average normalized enrichment in given decile.