| Literature DB >> 19055830 |
Ralph T Wiedmann1, Timothy P L Smith, Dan J Nonneman.
Abstract
BACKGROUND: Relatively little information is available for sequence variation in the pig. We previously used a combination of short read (25 base pair) high-throughput sequencing and reduced genomic representation to discover > 60,000 single nucleotide polymorphisms (SNP) in cattle, but the current lack of complete genome sequence limits this approach in swine. Longer-read pyrosequencing-based technologies have the potential to overcome this limitation by providing sufficient flanking sequence information for assay design. Swine SNP were discovered in the present study using a reduced representation of 450 base pair (bp) porcine genomic fragments (approximately 4% of the swine genome) prepared from a pool of 26 animals relevant to current pork production, and a GS-FLX instrument producing 240 bp reads.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19055830 PMCID: PMC2612698 DOI: 10.1186/1471-2156-9-81
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Percent repetitive element content of reduced representation libraries.
| Library1 | Restriction Enzyme | Site | SINEs2 | LINEs | LTR | DNA | Total | Centromeric Repeats3 |
| GSS | Sheared | random | 11.30 | 16.14 | 2.80 | 1.51 | 31.75 | 0.00 |
| pRRL1 | AG|CT | 10.21 | 15.63 | 2.46 | 3.16 | 31.46 | 0.00 | |
| pRRL2 | GG|CC | 13.19 | 11.33 | 3.11 | 1.79 | 29.42 | 0.00 | |
| pRRL3 | CG|CG | 1.98 | 0.40 | 0.03 | 0.42 | 2.83 | 23.83 | |
| pRRL4 | TTT|AAA | 32.04 | 4.85 | 1.67 | 1.09 | 39.65 | 0.00 | |
| pRRL5 | CAG|CTG | 20.99 | 4.91 | 1.71 | 1.25 | 28.86 | 6.18 |
1 GSS refers to genome survey sequence and values were obtained from Wernersson et al., 2005 [14].
2 Values are given as percent of total. SINEs refers to short interspersed elements porcine SINE elements and MIRs; LINEs refers to LINE1, LINE 2, L3/CR1 and RTE elements; LTR refers to long terminal repeats MaLRs, and ERVL; DNA refers to MER1 and MER2-type elements. Total refers to percent of repetitive content identified by RepeatMasker.
3Values in the centromeric repeat column represent percent of sequences that contain a centromeric repeat and not percent of sequence content.
Figure 1Distribution of the contig lengths showing that most of the contigs consist of reads from one end of the restriction fragments. About 25% of the contigs span the entire restriction fragment.
Figure 2The plot shows the distribution of 115,572 SNPs by position in the contigs. The number of SNPs mirrors the profile of contig lengths.
Figure 3The distribution of contigs by number of SNPs.
Figure 4Comparison of the distribution of total contigs to contigs containing SNPs as the depth of coverage changes. The average depth of coverage where SNP were detected was 12.6.
Figure 5Contig spacing along the sequenced pig chromosomes 1, 4, 5, 7, 11, 13 and 14.
Average minor allele frequencies by breed for 168 sampled SNPs.
| Breed | Number of Animals1 | Number in Pool2 | Genotyped (%)3 | Monomorphic (%)4 | Average MAF | Heterozygosity |
| Berkshire | 7 | 3 | 98.21 | 20.83 | 0.2142 | 0.2866 |
| Duroc | 40 | 4 | 98.13 | 15.48 | 0.2003 | 0.2537 |
| Eur. Wild Boar | 5 | 0 | 99.28 | 44.64 | 0.1533 | 0.2520 |
| Hampshire | 29 | 4 | 98.81 | 12.50 | 0.2131 | 0.2692 |
| Landrace | 43 | 4 | 98.01 | 3.57 | 0.2457 | 0.3017 |
| Meishan | 5 | 0 | 98.10 | 44.64 | 0.1447 | 0.1881 |
| Pietrain | 6 | 2 | 98.81 | 25.00 | 0.2041 | 0.2892 |
| Yorkshire | 41 | 4 | 98.88 | 5.36 | 0.2429 | 0.2887 |
1Number of animals of each breed genotyped.
2Number of animals of each breed in the discovery library.
3Percent of animals that produced a genotype call.
4Percent of markers that were monomorphic for that breed.
Figure 6Correlation between allele frequency estimated by 454 sequencing and genotyping.