| Literature DB >> 28202015 |
Aroa Suárez-Vega1, Beatriz Gutiérrez-Gil1, Christophe Klopp2, Gwenola Tosser-Klopp3, Juan José Arranz4.
Abstract
BACKGROUND: The identification of genetic variation underlying desired phenotypes is one of the main challenges of current livestock genetic research. High-throughput transcriptome sequencing (RNA-Seq) offers new opportunities for the detection of transcriptome variants (SNPs and short indels) in different tissues and species. In this study, we used RNA-Seq on Milk Sheep Somatic Cells (MSCs) with the goal of characterizing the genetic variation within the coding regions of the milk transcriptome in Churra and Assaf sheep, two common dairy sheep breeds farmed in Spain.Entities:
Keywords: Dairy Sheep; Milk Somatic Cells; RNA-Seq; Transcriptome Variants
Mesh:
Year: 2017 PMID: 28202015 PMCID: PMC5312585 DOI: 10.1186/s12864-017-3581-1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of sequencing results according to the annotation performed in this study of the MSC transcriptome based on the sheep genome reference Oar_v3.1
| Total Reads (paired end) | 119325116 | ||
|---|---|---|---|
| Total Tags | 120473958 | ||
| Total Assigned | 110083502 | ||
| Group | Total_bases | Tag_count | Tags/kb |
| CDSa_Exons | 32776750 | 65846229.88 | 2008.93 |
| 5′UTRb_Exons | 3479917 | 960588.13 | 276.04 |
| 3′UTRc_Exons | 8651433 | 4457991.13 | 515.29 |
| Introns | 803999021 | 13554137.88 | 16.86 |
| TSSd_up_1kb | 21995006 | 933617.50 | 42.45 |
| TSS_up_5kb | 101300701 | 2521024.00 | 24.89 |
| TSS_up_10kb | 187280303 | 3117103.63 | 16.64 |
| TESe_down_1kb | 21770670 | 10545653.88 | 484.40 |
| TES_down_5kb | 96011366 | 21156069.50 | 220.35 |
| TES_down_10kb | 173072739 | 22147451.25 | 127.97 |
a CDS Coding DNA sequence; b 5′UTR leader untranslated sequence; c 3′UTR trailer untranslated sequence; d TSS Transcription Start Site; e TES Transcription End Site
Summary statistics of the identified variants
| Fields | Counts SnpEff | Counts VEP | |
|---|---|---|---|
| Variants processed | 216637 | 212742 | |
| SNPs | 197948 | 195503 | |
| Insertions | 8603 | 7233 | |
| Deletions | 10086 | 9032 | |
| Effects by impact | |||
| HIGH | 2128 | 1891 | |
| MODERATE | 22440 | 22385 | |
| LOW | 43986 | 43667 | |
| MODIFIER | 312170 | 232768 | |
| Effects by type | |||
| 3_prime_UTR | 12940 | 12950 | |
| 5_prime_UTR | 1819 | 1824 | |
| downstream_gene | 113225 | 113207 | |
| frameshift | 1162 | 1096 | |
| inframe_deletion | 168 | 314 | |
| inframe_insertion | 127 | 229 | |
| intergenic_region | 96639 | 16991 | |
| intron | 59198 | 58408 | |
| missense | 21841 | 21824 | |
| non_coding_exon | 2002 | 1993 | |
| non_coding_transcript | 10 | 9492 | |
| splice_acceptor | 525 | 332 | |
| splice_donor | 594 | 371 | |
| splice_region | 2353 | 2187 | |
| start_lost | 16 | 28 | |
| stop_gained | 119 | 112 | |
| stop_lost | 28 | 30 | |
| stop_retained | 26 | 31 | |
| synonymous | 43003 | 43004 | |
| upstream_gene | 27952 | 27948 | |
Fig. 1Genome-wide variant densities. Manhattan plot showing the variant density (number of SNPs per Mb) on the Y-axis and the positions of the genome across the 26 ovine autosomes and the X chromosome on the X-axis
Fig. 2Functional characterization established by SnpEff and VEP software for the novel variants identified in this study within the QTL previously reported for milk yield, milk protein percentage and milk fat percentage. a Distribution of the novel variants by impact; b Distribution of moderate impact novel variants within QTL regions by functional effect; c Distribution of high impact novel variants within QTL regions by functional effect
Functionally relevant variants in genes codifying for major milk proteins
| Variant a | Gene | Allele Freq | Effect | AA | |
|---|---|---|---|---|---|
| Assaf | Churra | ||||
|
|
| 0.25 | 0.5 | Missense-Deleterious |
|
|
|
| 0.375 | 0 | Missense-Deleterious |
|
|
|
| 0.375 | 0.5 | Missense-Tolerated |
|
|
|
| 0.125 | 0.5 | Missense-Deleterious |
|
|
|
| 0.38 | 0.75 | Missense-Tolerated |
|
|
|
| 0.625 | 0.25 | Missense-Tolerated |
|
|
|
| 0 | 0.125 | Missense-Tolerated |
|
|
|
| 0.625 | 0.625 | Splice donor | |
|
|
| 0 | 0.125 | Missense-Deleterious |
|
|
|
| 0 | 0.25 | Missense-Tolerated |
|
|
|
| 0.125 | 0.75 | Missense-Tolerated |
|
a For described variants rs identifier is indicated and novel variants are described with the unique ID “INSDC Genome accession:CHROM:POS:REF:ALT”.
Milk fat candidate genes considered in this study
| Gene symbol | Description |
|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Functionally relevant variants detected in the milk fat candidate genes considered in this study
| Varianta | Gene | Allele Freq | Effect | AA | |
|---|---|---|---|---|---|
| Assaf | Churra | ||||
|
|
| 0.5 | 0.5 | High-Splice donor | |
|
|
| 0.125 | 0 | Missense-Deleterious (0) |
|
|
|
| 0.5 | 0.5 | High-Splice aceptor | |
|
|
| 0.25 | 0 | Missense-Deleterious (0.02) |
|
|
|
| 0.25 | 0 | Missense-Deleterious (0) |
|
|
|
| 0.5 | 0.5 | High-Splice aceptor | |
|
|
| 1 | 1 | Moderate-Inframe deletion |
|
|
|
| 0 | 0.125 | Missense-deleterious-low_confidence (0.04) |
|
|
|
| 0.5 | 0.5 | High-Splice donor | |
a For described variants rs identifier is indicated and novel variants are described with the unique ID “INSDC Genome accession:CHROM:POS:REF:ALT”