| Literature DB >> 30862965 |
Gabriel M Guillocheau1, Abdelmajid El Hou1, Cédric Meersseman1,2, Diane Esquerré3, Emmanuelle Rebours1, Rabia Letaief1, Morgane Simao1, Nicolas Hypolite1, Emmanuelle Bourneuf1,4, Nicolas Bruneau1, Anne Vaiman1, Christy J Vander Jagt5, Amanda J Chamberlain5, Dominique Rocha6.
Abstract
Allelic imbalance is a common phenomenon in mammals that plays an important role in gene regulation. An Allele Specific Expression (ASE) approach can be used to detect variants with a cis-regulatory effect on gene expression. In cattle, this type of study has only been done once in Holstein. In our study we performed a genome-wide analysis of ASE in 19 Limousine muscle samples. We identified 5,658 ASE SNPs (Single Nucleotide Polymorphisms showing allele specific expression) in 13% of genes with detectable expression in the Longissimus thoraci muscle. Interestingly we found allelic imbalance in AOX1, PALLD and CAST genes. We also found 2,107 ASE SNPs located within genomic regions associated with meat or carcass traits. In order to identify causative cis-regulatory variants explaining ASE we searched for SNPs altering binding sites of transcription factors or microRNAs. We identified one SNP in the 3'UTR region of PRNP that could be a causal regulatory variant modifying binding sites of several miRNAs. We showed that ASE is frequent within our muscle samples. Our data could be used to elucidate the molecular mechanisms underlying gene expression imbalance.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30862965 PMCID: PMC6414783 DOI: 10.1038/s41598-019-40781-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of SNPs detected in RNA and DNA with their annotation frequencies.
| Variant consequences | DNA | RNA | ||
|---|---|---|---|---|
| Number of SNPs | % | Number of SNPs | % | |
| intergenic variant | 8,099,157 | 67.81 | 54,410 | 20.20 |
| intron variant | 2,922,660 | 24.47 | 106,700 | 39.61 |
| upstream gene variant | 413,619 | 3.46 | 14,734 | 5.47 |
| downstream gene variant | 405,237 | 3.39 | 53,630 | 19.91 |
| synonymous variant | 39,119 | 0.33 | 14,315 | 5.31 |
| missense variant | 29,931 | 0.25 | 9,786 | 3.63 |
| 3 prime UTR variant | 19,332 | 0.16 | 11,555 | 4.29 |
| splice region variant | 6,471 | 0.05 | 475 | 0.18 |
| non coding exon variant | 3,930 | 0.03 | 0 | 0.00 |
| 5 prime UTR variant | 3,544 | 0.03 | 1,374 | 0.51 |
| unindentified | 269 | 0.00 | 132 | 0.05 |
| splice donor variant | 153 | 0.00 | 73 | 0.03 |
| splice acceptor variant | 148 | 0.00 | 44 | 0.02 |
| initiator codon variant | 62 | 0.00 | 0 | 0.00 |
| coding sequence variant | 46 | 0.00 | 59 | 0.02 |
| mature miRNA variant | 37 | 0.00 | 0 | 0.00 |
| stop retained variant | 32 | 0.00 | 15 | 0.01 |
| non coding transcript variant | 19 | 0.00 | 11 | 0.00 |
| frameshift variant | 0 | 0.00 | 1,221 | 0.45 |
| protein altering variant | 0 | 0.00 | 1 | 0.00 |
| non coding transcript exon variant | 0 | 0.00 | 855 | 0.32 |
Distribution of detected SNPs from RNA-Seq and WGS data per individual.
| Individual | DNA only | RNA only | Overlap | BH | Bh | Concordant | Hh | hH | Discordant |
|---|---|---|---|---|---|---|---|---|---|
| LIM01 | 11,420,182 | 19,039 | 44,861 | 27,410 | 11,354 | 86.4% | 4,979 | 1,118 | 13.6% |
| LIM02 | 11,549,679 | 17,681 | 46,624 | 29,535 | 12,671 | 90.5% | 3,974 | 444 | 9.5% |
| LIM03 | 11,753,420 | 15,867 | 49,721 | 31,024 | 16,413 | 95.4% | 1,633 | 651 | 4.6% |
| LIM04 | 11,770,633 | 13,801 | 38,579 | 23,198 | 12,968 | 93.7% | 1,149 | 1,264 | 6.3% |
| LIM05 | 11,668,108 | 11,596 | 36,346 | 22,687 | 11,637 | 94.4% | 1,513 | 509 | 5.6% |
| LIM06 | 11,645,235 | 16,568 | 44,888 | 27,925 | 12,860 | 90.9% | 3,295 | 808 | 9.1% |
| LIM07 | 11,287,139 | 6,218 | 15,075 | 9,088 | 3,439 | 83.1% | 1,947 | 601 | 16.9% |
| LIM08 | 11,734,961 | 18,876 | 55,713 | 35,061 | 17,430 | 94.2% | 2,306 | 916 | 5.8% |
| LIM09 | 11,563,319 | 13,215 | 33,473 | 21,119 | 9,012 | 90.0% | 2,897 | 445 | 10.0% |
| LIM13 | 8,718,858 | 27,165 | 28,651 | 18,020 | 3,671 | 75.7% | 6,707 | 253 | 24.3% |
| LIM14 | 11,665,886 | 12,410 | 34,686 | 22,388 | 9,932 | 93.2% | 1,796 | 570 | 6.8% |
| LIM15 | 11,516,569 | 15,344 | 40,398 | 25,775 | 10,135 | 88.9% | 3,931 | 557 | 11.1% |
| LIM16 | 11,766,765 | 12,041 | 35,918 | 22,612 | 11,854 | 96.0% | 890 | 562 | 4.0% |
| LIM17 | 9,511,239 | 21,194 | 28,415 | 17,675 | 3,677 | 75.1% | 6,863 | 200 | 24.9% |
| LIM18 | 11,755,926 | 8,686 | 24,893 | 15,029 | 8,585 | 94.9% | 902 | 377 | 5.1% |
| LIM19 | 11,517,295 | 15,901 | 40,528 | 25,083 | 11,315 | 89.8% | 3,573 | 557 | 10.2% |
| LIM20 | 11,330,071 | 12,058 | 19,755 | 12,190 | 4,423 | 84.1% | 2,753 | 389 | 15.9% |
| LIM21 | 11,110,581 | 14,100 | 30,031 | 19,059 | 6,466 | 85.0% | 4,147 | 359 | 15.0% |
| LIM22 | 11,534,319 | 23,041 | 77,560 | 45,999 | 24,815 | 91.3% | 5,907 | 839 | 8.7% |
| Average | 11,306,326 | 15,516 | 38,217 | 23,730 | 10,666 | 89.1% | 3,219 | 601 | 10.9% |
BH: Both Homozygous, Bh: Both Heterozygous, Concordant: Rate of BH and Bh, Hh: Homozygous in DNA and Heterozygous in RNA, hH: Heterozygous in DNA and Homozygous in RNA, Discordant: Rate of Hh and hH.
Figure 1Chromosomal distribution with the number of ASE SNPs (grey bars), the size of the genomes (blue bars), the number of genes: total (blue line) and only expressed in muscle (grey line).
Figure 2Enriched GO terms for genes affected by ASE. Functional enrichments for gene ontology (GO) terms associated with the 2,119 genes affected by ASE SNPs (5,658). Only the top ranked 20 terms are shown. The horizontal bar represents the number of ASE-genes involved, with the corresponding q-values. The GO terms categories included Biological Process (black), Cell Component (blue) and Molecular Function (green). The enrichment analysis was performed with the GOrilla tool.
ASE SNPS tested by Pyrosequencing.
| BTA | Position | SNP ID | REF | ALT | ASE count | Gene | Annotation | Validated |
|---|---|---|---|---|---|---|---|---|
| 3 | 32,003,949 | rs382378456 | C | A | 407/336 |
| 3′UTR variant | Yes |
| 7 | 5,520,428 | rs208775256 | G | C | 26/12 |
| missense variant | No |
| 7 | 98,579,574 | rs41255587 | G | A | 146/208 |
| 3′UTR variant | Yes |
| 7 | 98,580,401 | rs209641420 | A | C | 303/221 |
| 3′UTR variant | Yes |
| 8 | 572,167 | rs110694123 | G | A | 48/73 |
| synonymous variant | Yes |
| 8 | 944,049 | rs109919583 | C | T | 47/121 |
| 3′UTR variant | Yes |
REF: reference allele, ALT: alternative allele, ASE count: number of reference allele reads/number of alternative allele reads.
Figure 3Pyrosequencing results of one ASE-SNP in PALLD gene. (a) In gDNA, 47% for allele C and 53% for allele T. (b) In cDNA, 33% for allele C and 67% for allele T.
Figure 4Boxplots of SNP showing genetic variations of APMAP (a) or PRNP (b) expressions. (N) number of animals per genotype.