| Literature DB >> 26475143 |
Joanna Szyda1, Magdalena Frąszczak2, Magda Mielczarek2, Riccardo Giannico3, Giulietta Minozzi3,4, Ezequiel L Nicolazzi3, Stanislaw Kamiński5, Katarzyna Wojdak-Maksymiec6.
Abstract
Despite the growing number of sequenced bovine genomes, the knowledge of the population-wide variation of sequences remains limited. In many studies, statistical methodology was not applied in order to relate findings in the sequenced samples to a population-wide level. Our goal was to assess the population-wide variation in DNA sequence based on whole-genome sequences of 32 Holstein-Friesian cows. The number of SNPs significantly varied across individuals. The number of identified SNPs increased with coverage, following a logarithmic curve. A total of 15,272,427 SNPs were identified, 99.16 % of them being bi-allelic. Missense SNPs were classified into three categories based on their genomic location: housekeeping genes, genes undergoing strong selection, and genes neutral to selection. The number of missense SNPs was significantly higher within genes neutral to selection than in the other two categories. The number of variants located within 3'UTR and 5'UTR regions was also significantly different across gene families. Moreover, the number of insertions and deletions differed significantly among cows varying between 261,712 and 330,103 insertions and from 271,398 to 343,649 deletions. Results not only demonstrate inter-individual variation in the number of SNPs and indels but also show that the number of missense SNPs differs across genes representing different functional backgrounds.Entities:
Mesh:
Year: 2015 PMID: 26475143 PMCID: PMC4653241 DOI: 10.1007/s00335-015-9606-7
Source DB: PubMed Journal: Mamm Genome ISSN: 0938-8990 Impact factor: 2.957
Fig. 1The genome averaged sequencing coverage for each individual
Fig. 2Coverage in percent observed for a particular individual on BTA1 (black), BTA10 (red), BTA20 (green), and BTX (blue) (Color figure online)
Genes selected for comparison
| Gene | BTA | SNPs | |||
|---|---|---|---|---|---|
| NCBI ID | Acronym | Name | All | Missense | |
| Housekeeping | |||||
| 280979 | ACTB | Actin, beta | 25 | 110 | 32 |
| 280729 | B2M | Beta-2-microglobulin | 10 | 1039 | 0 |
| 281181 | G3PDH | Glyceraldehyde-3-phosphate dehydrogenase | 5 | 403 | 1 |
| 515614 | HMBS | Hydroxymethylbilane synthase | 15 | 49 | 0 |
| 767874 | HSP90AB1 | Heat shock 90 kDa protein 1, beta | 23 | 94 | 32 |
| 444874 | UBC | Ubiquitin C | 17 | 431 | 5 |
| Strong selection | |||||
| 767906 | ARL4A | ADP-ribosylation factor-like 4A | 4 | 57 | 1 |
| 407216 | BMP4 | Bone morphogenetic protein 4 | 10 | 122 | 0 |
| 282609 | DGAT1 | Diacylglycerol | 14 | 113 | 1 |
| 535043 | ITGA6 | integrin, alpha 6 | 2 | 3647 | 21 |
| 444881 | MYD88 | Myeloid differentiation primary response 88 | 22 | 176 | 0 |
| Neutral to selection | |||||
| 534958 | AGTPBP1 | ATP/GTP binding protein 1 | 8 | 12,176 | 67 |
| 520250 | ANKRD32 | Ankyrin repeat domain 32 | 7 | 1672 | 29 |
| 533894 | LRP1 | Low-density lipoprotein receptor-related protein 1 | 5 | 1327 | 11 |
| 540504 | SYNE2 | Spectrin repeat-containing nuclear | 10 | 21,384 | 245 |
| 515119 | URI1 | URI1, prefoldin-like chaperone | 18 | 3591 | 20 |
Fig. 3The total number of detected SNPs and related coverage for data representing our study (black) enhanced by data from Stothard et al. (2011) in red, Kõks et al. (2013) in blue, and Kõks et al. (2014) in green. The best fitting function is superimposed on the data points (Color figure online)
Fig. 4The percentage of SNPs with 1 and 3 alleles as well as the total number of SNPs identified across 32 individuals
Fig. 5The number of private SNPs identified on BTA10 using FreeBayes, Samtools, and GATK