| Literature DB >> 25821781 |
Samuele Bovo1, Francesca Bertolini1, Giuseppina Schiavo1, Gianluca Mazzoni1, Stefania Dall'Olio1, Luca Fontanesi1.
Abstract
The aim of this study was to identify single nucleotide polymorphisms (SNPs) that could be associated with back fat thickness (BFT) in pigs. To achieve this goal, we evaluated the potential and limits of an experimental design that combined several methodologies. DNA samples from two groups of Italian Large White pigs with divergent estimating breeding value (EBV) for BFT were separately pooled and sequenced, after preparation of reduced representation libraries (RRLs), on the Ion Torrent technology. Taking advantage from SNAPE for SNPs calling in sequenced DNA pools, 39,165 SNPs were identified; 1/4 of them were novel variants not reported in dbSNP. Combining sequencing data with Illumina PorcineSNP60 BeadChip genotyping results on the same animals, 661 genomic positions overlapped with a good approximation of minor allele frequency estimation. A total of 54 SNPs showing enriched alleles in one or in the other RRLs might be potential markers associated with BFT. Some of these SNPs were close to genes involved in obesity related phenotypes.Entities:
Year: 2015 PMID: 25821781 PMCID: PMC4364060 DOI: 10.1155/2015/950737
Source DB: PubMed Journal: Int J Genomics ISSN: 2314-436X Impact factor: 2.326
Summary of sequencing data obtained from the two reduced representation libraries (RRLs) of the positive (Pos_HaeIII) and negative (Neg_HaeIII) back fat thickness estimated breeding value DNA pools.
| Information1 | Pos_ | Neg_ | Pos + Neg |
|---|---|---|---|
| Sequenced reads | 3,581,496 | 3,887,066 | 7,468,562 |
| Reads after preprocessing | 3,390,796 | 3,731,776 | 7,122,572 |
| Removed duplicates | 698,191 | 845,961 | 1,544,152 |
| Mapped reads (Qm > 20; Rdup) | 1,449,838 | 1,476,125 | 2,925,963 |
| Sequenced bases (Qm > 20; Rdup) | 137,429,598 | 145,859,611 | 256,880,473 |
| Mean and max depth of coverage (Qm > 20; Rdup) | 1.18; 209 | 1.16; 217 | 1.29; 426 |
| Sequenced bases (Qm > 20; RD ≥ 3; Rdup) | 3,394,898 | 3,057,171 | 3,942,266 |
| Sequenced bases retained by SNAPE (Qm > 20; RD ≥ 3; Rdup) | 3,369,555 | 3,034,731 | 237,969 (in common) |
| SNPs (Qm > 20; RD ≥ 3; Rdup) | 10,694 | 10,339 | 39,165 |
1Qm = mapping quality; RD = read depth; Rdup = removed duplicates.
Summary of the SNP annotation results obtained using the variant effect predictor (VEP) tool.
| Gene position or SNP effect | Number of SNPs |
|---|---|
| 3 prime UTR variant | 203 |
| 3 prime UTR variant, NMD transcript variant | 1 |
| 5 prime UTR variant | 58 |
| Downstream gene variant | 2710 |
| Intergenic variant | 24414 |
| Intron variant | 12591 |
| Intron variant, NMD transcript variant | 126 |
| Intron variant, noncoding transcript variant | 306 |
| Missense variant | 159 |
| Missense variant, splice region variant | 8 |
| Noncoding transcript exon variant, noncoding transcript variant | 29 |
| Splice acceptor variant | 2 |
| Splice donor variant | 1 |
| Splice region variant, 3 prime UTR variant | 1 |
| Splice region variant, intron variant | 25 |
| Splice region variant, synonymous variant | 12 |
| Stop gained | 2 |
| Stop lost | 1 |
| Stop retained variant | 1 |
| Synonymous variant | 217 |
| Synonymous variant, NMD transcript variant | 3 |
| Upstream gene variant | 2675 |
| Total* | 43545 |
*The sum includes 39,165 variations, 4,380 of which have multiple annotations, for a total of 43,545 SNP annotations.
Summary of regression analysis between allele frequency estimated by Ion Torrent sequencing and the allele frequency obtained by genotyping with the Illumina PorcineSNP60 BeadChip.
| RD | Polymorphic sites | Polymorphic and monomorphic sites | ||
|---|---|---|---|---|
|
| Positions |
| Positions | |
| ≥3 | 0.1199 | 258 | 0.6882 | 317 (258 + 59) |
| ≥4 | 0.1601 | 99 | 0.6399 | 119 (99 + 20) |
| ≥5 | 0.1611 | 36 | 0.5868 | 41 (36 + 5) |
| ≥6 | 0.3866 | 11 | 0.7006 | 13 (11 + 2) |
RD = read depth; R 2 = regression coefficient; positions: number of genomic sites analyzed.
Figure 1Scatter plot of allele frequency estimated by Ion Torrent sequencing data for SNPs called by at least 6 reads (allele frequency NGS) and obtained by genotyping data (MAF genotyping) for the same SNPs.
Overlapping results between the SNPs associated with back fat thickness as identified with the Ion Torrent sequencing data (P Fisher < 0.05) and those obtained in the genome wide association study (GWAS) reported by Fontanesi et al. [10] (P < 0.05, window = ±0.5 Mbp for each marker).
| Chr. | Marker | PosM | PGWAS | PosSNP |
|
|---|---|---|---|---|---|
| 1 | ALGA0000009 | 52,297 | 2.75 | 68,514 | 2.86 |
| 1 | ALGA0000014 | 79,763 | 1.74 | 68,514 | 2.86 |
| 6 | M1GA0008302 | 787,265 | 1.65 | 873,061 | 1.28 |
| 6 | M1GA0008318 | 945,991 | 4.41 | 873,061 | 1.28 |
| 6 | M1GA0008329 | 996,248 | 9.35 | 873,061 | 1.28 |
| 9 | DRGA0009307 | 17,138,159 | 8.66 | 16,885,924 | 2.81 |
| 12 | DIAS0000309 | 48,865,200 | 9.96 | 48,937,212 | 2.63 |
*Only the top P Fisher for each marker is listed. All other data are presented in Table S5.
Chr. = chromosome; marker = marker in the Illumina PorcineSNP60 BeadChip; PosM = nucleotide position of the marker on the Sscrofa10.2 reference genome; P GWAS = P value of association in the GWAS; PosSNP = nucleotide position on the Sscrofa10.2 reference genome of the SNP having P Fisher < 0.05; P Fisher = P value of Fisher's test.