| Literature DB >> 28683716 |
Min Wang1, Timothy P Hancock2, Iona M MacLeod2, Jennie E Pryce3,2, Benjamin G Cocks3,2, Benjamin J Hayes2.
Abstract
BACKGROUND: Enhancers are non-coding DNA sequences, which when they are bound by specific proteins increase the level of gene transcription. Enhancers activate unique gene expression patterns within cells of different types or under different conditions. Enhancers are key contributors to gene regulation, and causative variants that affect quantitative traits in humans and mice have been located in enhancer regions. However, in the bovine genome, enhancers as well as other regulatory elements are not yet well defined. In this paper, we sought to improve the annotation of bovine enhancer regions by using publicly available mammalian enhancer information. To test if the identified putative bovine enhancer regions are enriched with functional variants that affect milk production traits, we performed genome-wide association studies using imputed whole-genome sequence data followed by meta-analysis and enrichment analysis.Entities:
Mesh:
Year: 2017 PMID: 28683716 PMCID: PMC5499214 DOI: 10.1186/s12711-017-0331-4
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Mapping of bovine candidate enhancers
| dba | Queryb | Metf | m%g | Hitsh | eSNPl | ||||
|---|---|---|---|---|---|---|---|---|---|
| ISeqc | u (bp)d | σ (bp)e | OSeqi | µ (bp)j | σ (bp)k | ||||
| VISTA | 4481 | 1959 | 1395 | COM | 4285 (96%) | 9945 | 896 | 710 | 82,865 |
| LO | 3808 (85%) | 964 | 1399 | 883 | |||||
| BN | 3627 (81%) | 9945 | 896 | 710 | |||||
| FANTOM5 | 109,882 | 277 | 158 | COM | 95,123 (87%) | 30,371 | 231 | 115 | 50,447 |
| LO | 94,302 (86%) | 6061 | 245 | 503 | |||||
| BN | 10,054 (9%) | 30,389 | 231 | 115 | |||||
| dbSUPER | 1745 | 45,750 | 56,541 | COM | 1605 (92%) | 50,938 | 739 | 763 | 282,285 |
| LO | 1549 (88%) | 32 | 1916 | 2447 | |||||
| BN | 1113 (64%) | 50,938 | 739 | 763 | |||||
| H3K4me3 | 13,797 | 2393 | 879 | NA | 13,797 (100%) | 13,660 | 2394 | 879 | 302,659 |
| H3K27ac | 45,784 | 2304 | 1910 | NA | 45,784 (100%) | 42,963 | 2305 | 1910 | 965,716 |
aDatabase from which input query sequences were obtained
bQuery sequence downloaded from respective host sites (may include overlapping regions)
cNumber of input enhancer query sequences
dMean length of input enhancer query sequences (measured in bp)
eStandard deviation length of input enhancer query sequences (measured in bp)
fMethod that returned values in column m%; COM: a set of non-overlapping regions from the combined results of liftOver and BLASTn; LO: liftOver; BN: BLASTn
gNumber of mapped query sequences (ratio of mapped query sequences); NA not applicable
hHit is a non-overlapping genomic interval in the bovine genome that matches with at least one query sequence from the respective input database
iNumber of non-overlapping candidate bovine enhancer genomic intervals; all hits, i.e. output from the respective software were merged into non-overlapping genomic intervals; some OSeq values were larger than corresponding ISeq values because one query sequence was found at multiple locations in the bovine genome
jMean length of bovine putative enhancer sequences (measured in bp)
kStandard deviation length of bovine putative enhancer sequences (measured in bp)
lNumber of imputed whole-genome sequence variants in OSeq genomic intervals
Fig. 1Degree of overlap between bovine enhancer sets (measured by total base pairs)
Degree of overlap between the sets of bovine enhancers analysed
| VISTAa (%) | FANTOM5b (%) | dbSUPERc (%) | H3K27acd (%) | H3K4me3e (%) | |
|---|---|---|---|---|---|
| VISTA | 100 | 19.63 | 71.29 | 1.89 | 0.26 |
| FANTOM5 | 24.92 | 100 | 52.18 | 9.86 | 3.74 |
| dbSUPER | 16.88 | 9.73 | 100 | 4.30 | 1.43 |
| H3K27ac | 0.17 | 0.70 | 1.63 | 100 | 26.97 |
| H3K4me3 | 0.07 | 0.80 | 1.65 | 81.66 | 100 |
Each value in the table represents the ratio, expressed as a percentage, of the total overlapping base pairs between the two enhancer sets listed in a row and column, relative to the total number of base pairs in the enhancer set listed in the corresponding row
aVISTA is a database for evolutionarily ultra-conserved sequences between vertebrates
bFANTOM5 is a database for active enhancers from various human and mouse tissue
cdDbSUPER is a database for super-enhancers from various human or mouse tissues across multiple studies
dH3K27ac represents the dataset from the Villar et al. [29] study, which used ChIP-Seq profiling to detect the regions of the bovine genome that contained the histone modification signal H3K27ac from four bulls’ liver tissues
eH3K4me3 represents the dataset from the Villar et al. [29] study which used ChIP-Seq profiling to detect the regions of the bovine genome that contained the histone modification signal H3K4me3 from four bull’s liver tissues
Fig. 2Degree of overlap between enhancer sets from liver (measured by total base pairs)
Number of imputed whole-genome sequence variants and of significant variants
| Phenotype | GWAS | Meta-analysis | |||
|---|---|---|---|---|---|
| Gender | Variants tested | Significant variants | Filtered variants tested | Significant variants | |
| FY | Bulls | 28,899,038 | 3720 (0.013%) | 23,462,193 | 6967 (0.030%) |
| Cows | 3474 (0.012%) | ||||
| MY | Bulls | 4408 (0.015%) | 23,455,977 | 10,071 (0.043%) | |
| Cows | 6801 (0.024%) | ||||
| PY | Bulls | 1786 (0.006%) | 23,470,099 | 4804 (0.020%) | |
| Cows | 2981 (0.010%) | ||||
Fig. 3Manhattan plots: all GWAS and meta-analysis cohorts
Degree of overlap between significant variants in the sets of bovine enhancers analysed
| VISTA (%) | FANTOM5 (%) | dbSUPER (%) | H3K4me3 (%) | H3K27Ac (%) | |
|---|---|---|---|---|---|
| VISTA | 100 | 31.25 | 50 | 0 | 0 |
| FANTOM5 | 38.46a | 100 | 38.46 | 0 | 0 |
| dbSUPER | 9.88 | 6.17 | 100 | 0 | 1.23 |
| H3K4me3 | 0 | 0 | 0 | 100 | 77.90 |
| H3K27Ac | 0 | 0 | 0.08 | 46.04 | 100 |
aEach value in the table represents the ratio, expressed as a percentage, of the total overlapping variants between the two enhancer sets listed in a row and column, relative to the total number of significant variants in the enhancer set listed in the corresponding row
Fig. 4Degree of overlap between significant imputed WGS SNPs in bovine enhancer sets
Enrichment of significant enhancer SNPs for milk production traits in the permutation tests
| Phenotype | Database | All SNPs/a subset of SNPs | Fold changea | Rankb |
|---|---|---|---|---|
| FY | VISTA | All | 0.16235 | 0.0001 |
| MY | 0.336793 | 0.0001 | ||
| PY | 0 | 0.0001 | ||
| FY | FANTOM5 | 0.200888 | 0.0001 | |
| MY | 0.414773 | 0.0018 | ||
| PY | 0.096781 | 0.0001 | ||
| FY | dbSUPER | 0.226716 | 0.0001 | |
| MY | 0.354855 | 0.0001 | ||
| PY | 0.329258 | 0.0001 | ||
| FY | Villar (H3K4me3) | 2.892337 | <0.0001 | |
| MY | 2.100798 | <0.0001 | ||
| PY | 3.081699 | <0.0001 | ||
| FY | Villar (H3K27ac) | 1.459357 | <0.0001 | |
| MY | 1.29579 | <0.0001 | ||
| PY | 1.458739 | <0.0001 | ||
| FY | Villar:H3K4me3 | H3K4me3-specific only | 3.358042 | <0.0001 |
| MY | 2.440737 | <0.0001 | ||
| PY | 4.414017 | <0.0001 | ||
| FY | Villar:H3K27ac | H3K27ac-sepcific only | 0.967677 | 0.3063 |
| MY | 1.023431 | 0.647 | ||
| PY | 0.953471 | 0.2794 | ||
| FY | Villar:H3K4me3 and H3K27ac | Overlaps: H3K4me3 and H3K27ac | 2.795722 | <0.0001 |
| MY | 2.025759 | <0.0001 | ||
| PY | 2.801491 | <0.0001 |
aFold change is the ratio between the actual number of significant SNPs in an enhancer set and the mean number of all significant SNPs in the 10,000 random samples
bRanking position of the actual number of significant SNPs in an enhancer set within the distribution of all the numbers of significant SNPs for the 10,000 random samples; if the actual number of significant SNPs was the largest among all the numbers of the 10,000 random significant SNPs, the rank was set to <0.0001; otherwise it was denoted as the ranking position of the actual number of significant SNPs among the number of random significant SNPs
Fig. 5Enrichment of significant SNPs in all enhancer sets. The vertical line indicates the number of significant variants in the original analysis. The histograms represent the number of significant variants in random samplings. If an analysis was significant, the vertical line would be on the right to the histogram and clearly separated from the histogram
Fig. 6Functional class of enhancer SNPs. Each pie chart demonstrates the proportion of the total number of SNPs in the putative enhancer regions in the functional classes. The grey area represents variants without functional class annotation
Fig. 7Functional class of core enhancer SNPs. Each pie chart demonstrates the proportion of the total number of SNPs in the Villar identified regions in the functional classes. The grey area represents variants without functional class annotation