| Literature DB >> 31513639 |
Mariângela B C Maldonado1, Nelson B de Rezende Neto2, Sheila T Nagamatsu3,4, Marcelo F Carazzolle3,5, Jesse L Hoff6, Lynsey K Whitacre6, Robert D Schnabel6,7, Susanta K Behura6, Stephanie D McKay8, Jeremy F Taylor6, Flavia L Lopes1.
Abstract
Methylation patterns established and maintained at CpG sites may be altered by single nucleotide polymorphisms (SNPs) within these sites and may affect the regulation of nearby genes. Our aims were to: 1) identify and generate a database of SNPs potentially subject to epigenetic control by DNA methylation via their involvement in creating, removing or displacing CpG sites (meSNPs), and; 2) investigate the association of these meSNPs with CpG islands (CGIs), and with methylation profiles of DNA extracted from tissues from cattle with divergent feed efficiencies detected using MIRA-Seq. Using the variant annotation for 56,969,697 SNPs identified in Run5 of the 1000 Bull Genomes Project and the UMD3.1.1 bovine reference genome sequence assembly, we identified and classified 12,836,763 meSNPs according to the nature of variation created at CpGs. The majority of the meSNPs were located in intergenic regions (68%) or introns (26.3%). We found an enrichment (p<0.01) of meSNPs located in CGIs relative to the genome as a whole, and also in differentially methylated sequences in tissues from animals divergent for feed efficiency. Seven meSNPs, located in differentially methylated regions, were fixed for methylation site creating (MSC) or destroying (MSD) alleles in the differentially methylated genomic sequences of animals differing in feed efficiency. These meSNPs may be mechanistically responsible for creating or deleting methylation targets responsible for the differential expression of genes underlying differences in feed efficiency. Our methyl SNP database (dbmeSNP) is useful for identifying potentially functional "epigenetic polymorphisms" underlying variation in bovine phenotypes.Entities:
Year: 2019 PMID: 31513639 PMCID: PMC6742455 DOI: 10.1371/journal.pone.0222329
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Examples of variants caused by meSNP.
The modification caused by a meSNP can create, destroy or displace a CpG site, the disruption of these sites can drastically change the methylation state. Base in bold represents a meSNP.
Fig 2SNPs versus meSNPs.
Number of meSNPs identified in comparison to the total number of SNPs annotated in Run5 of the 1000 Bull Genomes Project by chromosome.
Characterization of meSNPs according to the variant pattern created at a CpG site.
| Chromosome | Creation of a | % | Elimination of a | % | Displacement of a | % |
|---|---|---|---|---|---|---|
| 1 | 331,266 | 2.58 | 374,131 | 2.91 | 5,165 | 0.04 |
| 2 | 286,089 | 2.23 | 332,345 | 2.59 | 5,100 | 0.04 |
| 3 | 248,693 | 1.94 | 297,706 | 2.32 | 5,004 | 0.04 |
| 4 | 260,690 | 2.03 | 305,178 | 2.38 | 4,707 | 0.04 |
| 5 | 255,625 | 1.99 | 306,213 | 2.39 | 5,097 | 0.04 |
| 6 | 243,815 | 1.90 | 279,834 | 2.18 | 3,993 | 0.03 |
| 7 | 228,011 | 1.78 | 270,713 | 2.11 | 4,393 | 0.03 |
| 8 | 231,166 | 1.80 | 271,667 | 2.12 | 4,126 | 0.03 |
| 9 | 215,600 | 1.68 | 256,171 | 2.00 | 3,815 | 0.03 |
| 10 | 218,695 | 1.70 | 261,782 | 2.04 | 4,094 | 0.03 |
| 11 | 224,534 | 1.75 | 285,744 | 2.23 | 5,249 | 0.04 |
| 12 | 244,606 | 1.91 | 257,742 | 2.01 | 3,795 | 0.03 |
| 13 | 184,502 | 1.44 | 242,025 | 1.89 | 4,339 | 0.03 |
| 14 | 182,869 | 1.42 | 228,305 | 1.78 | 3,802 | 0.03 |
| 15 | 199,706 | 1.56 | 235,545 | 1.83 | 3,733 | 0.03 |
| 16 | 187,079 | 1.46 | 233,294 | 1.82 | 3,999 | 0.03 |
| 17 | 165,861 | 1.29 | 204,661 | 1.59 | 3,654 | 0.03 |
| 18 | 149,413 | 1.16 | 199,793 | 1.56 | 4,404 | 0.03 |
| 19 | 142,988 | 1.11 | 196,753 | 1.53 | 4,674 | 0.04 |
| 20 | 157,402 | 1.23 | 190,181 | 1.48 | 2,868 | 0.02 |
| 21 | 162,161 | 1.26 | 205,731 | 1.60 | 3,790 | 0.03 |
| 22 | 130,358 | 1.02 | 173,493 | 1.35 | 3,066 | 0.02 |
| 23 | 154,706 | 1.21 | 192,864 | 1.50 | 3,721 | 0.03 |
| 24 | 144,810 | 1.13 | 186,311 | 1.45 | 3,156 | 0.02 |
| 25 | 106,921 | 0.83 | 157,543 | 1.23 | 3,491 | 0.03 |
| 26 | 113,731 | 0.89 | 147,737 | 1.15 | 2,566 | 0.02 |
| 27 | 106,800 | 0.83 | 138,189 | 1.08 | 2,470 | 0.02 |
| 28 | 112,764 | 0.88 | 134,260 | 1.05 | 2,116 | 0.02 |
| 29 | 135,654 | 1.06 | 175,098 | 1.36 | 3,231 | 0.03 |
| X | 211,563 | 1.65 | 240,603 | 1.87 | 3,455 | 0.03 |
Fig 3Example of meSNP database conformation, developed from the annotations contained in Run5 of the 1000 Bull Genomes Project and the Variant Effect Predictor.
In the "Genome" column the second nucleotide represents the position of the meSNP in reference genome UMD3.1.1, described in the "Position" column; this nucleotide is also described in the "Ref" column. The "Alt" column represents the change caused by meSNP in the genomic sequence.
Fig 4Genomic distribution of meSNPs.
Classification of meSNPs according to sequence ontology (SO term) provided by Ensembl.
Fig 5Classification of meSNPs located within CGIs.
Genomic distribution of meSNPs identified in (A) relaxed, and (B) strictly defined CGIs.
Distribution of meSNPs identified within CGIs by chromosome.
| Chromosome | meSNPs in relaxed CGIs | % | meSNPs in strict CGIs | % |
|---|---|---|---|---|
| 1 | 50,386 | 0.39 | 8,127 | 0.06 |
| 2 | 52,508 | 0.41 | 9,358 | 0.07 |
| 3 | 53,945 | 0.42 | 10,997 | 0.09 |
| 4 | 52,517 | 0.41 | 10,679 | 0.08 |
| 5 | 58,389 | 0.45 | 10,191 | 0.08 |
| 6 | 45,406 | 0.35 | 8,085 | 0.06 |
| 7 | 47,065 | 0.37 | 10,957 | 0.09 |
| 8 | 45,373 | 0.35 | 10,589 | 0.08 |
| 9 | 47,077 | 0.37 | 9,478 | 0.07 |
| 10 | 41,654 | 0.32 | 8,846 | 0.07 |
| 11 | 59,271 | 0.46 | 11,541 | 0.09 |
| 12 | 44,811 | 0.35 | 7,172 | 0.06 |
| 13 | 46,969 | 0.37 | 7,654 | 0.06 |
| 14 | 42,020 | 0.33 | 7,563 | 0.06 |
| 15 | 38,450 | 0.30 | 6,501 | 0.05 |
| 16 | 43,487 | 0.34 | 7,346 | 0.06 |
| 17 | 43,153 | 0.34 | 8,406 | 0.07 |
| 18 | 51,040 | 0.40 | 11,114 | 0.09 |
| 19 | 51,814 | 0.40 | 11,535 | 0.09 |
| 20 | 31,778 | 0.25 | 4,967 | 0.04 |
| 21 | 42,513 | 0.33 | 7,902 | 0.06 |
| 22 | 41,210 | 0.32 | 7,303 | 0.06 |
| 23 | 47,656 | 0.37 | 9,448 | 0.07 |
| 24 | 38,240 | 0.30 | 6,706 | 0.05 |
| 25 | 49,157 | 0.38 | 11,502 | 0.09 |
| 26 | 32,560 | 0.25 | 5,966 | 0.05 |
| 27 | 32,810 | 0.26 | 6,292 | 0.05 |
| 28 | 20,847 | 0.16 | 4,550 | 0.04 |
| 29 | 40,902 | 0.32 | 7,506 | 0.06 |
| X | 37,062 | 0.29 | 7,615 | 0.06 |
meSNPs responsible for changes in genomic sequence associated with variation in feed efficiency in cattle.
| SI | 8 | rs209651975 | C | T | Destroying CpG | High feed efficiency | 0 | 0.5 | Intergenic | — |
| SI | 13 | rs385282371 | C | G | Creating CpG | High feed efficiency | 0.5 | 0 | Intergenic | — |
| SI | 15 | rs41776554 | T | C | Creating CpG | High feed efficiency | 0.875 | 0.125 | Coding | KIAA1549L |
| SI | 19 | rs209283171 | G | A | Destroying CpG | Low feed efficiency | 0.667 | 0.167 | Proximal promoter | — |
| SI | 21 | rs110952331 | C | T | Destroying CpG | Low feed efficiency | 1 | 0.25 | Intergenic | — |
| SI | 26 | rs110491592 | C | G | Destroying CpG | Low feed efficiency | 0.5 | 0 | Intergenic | — |
| SI | 28 | rs209796881 | C | G | Creating CpG | Low feed efficiency | 0 | 0.5 | 3’UTR | BICC1 |
| SI | 5 | rs137028580 | A | G | Creating CpG | Low feed efficiency | 0.5 | 0 | Coding | WNT5B |
| LD | 15 | rs42963062 | C | T | Destroying CpG | High feed efficiency | 0.75 | 0.25 | Intergenic | — |
| SI | 21 | rs211094916 | G | A | Destroying CpG | Low feed efficiency | 0 | 0.5 | Intergenic | — |
| SI | 24 | rs380531399 | A | G | Creating CpG | Low feed efficiency | 0.625 | 0 | Intronic | CD226 |
| SI | 29 | rs42161551 | A | G | Creating CpG | High feed efficiency | 0.125 | 0.625 | Intronic | ME3 |
Chr Chromosome
Ref Reference
Alt Alternative
HFEG High Feed Efficiency Group
LFEG Low Feed Efficiency Group
SI Small intestine
LD Longissimus dorsi