| Literature DB >> 21283515 |
Christopher M Seabury1, Eric K Bhattarai, Jeremy F Taylor, Ganesh G Viswanathan, Susan M Cooper, Donald S Davis, Scot E Dowd, Mitch L Lockwood, Paul M Seabury.
Abstract
The white-tailed deer (Odocoileus virginianus) represents one of the most successful and widely distributed large mammal species within North America, yet very little nucleotide sequence information is available. We utilized massively parallel pyrosequencing of a reduced representation library (RRL) and a random shotgun library (RSL) to generate a complete mitochondrial genome sequence and identify a large number of putative single nucleotide polymorphisms (SNPs) distributed throughout the white-tailed deer nuclear and mitochondrial genomes. A SNP validation study designed to test specific classes of putative SNPs provides evidence for as many as 10,476 genome-wide SNPs in the current dataset. Based on cytogenetic evidence for homology between cow (Bos taurus) and white-tailed deer chromosomes, we demonstrate that a divergent genome may be used for estimating the relative distribution and density of de novo sequence contigs as well as putative SNPs for species without draft genome assemblies. Our approach demonstrates that bioinformatic tools developed for model or agriculturally important species may be leveraged to support next-generation research programs for species of biological, ecological and evolutionary importance. We also provide a functional annotation analysis for the de novo sequence contigs assembled from white-tailed deer pyrosequencing reads, a mitochondrial phylogeny involving 13,722 nucleotide positions for 10 unique species of Cervidae, and a median joining haplotype network as a putative representation of mitochondrial evolution in O. virginianus. The results of this study are expected to provide a detailed template enabling genome-wide sequence-based studies of threatened, endangered or conservationally important non-model organisms.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21283515 PMCID: PMC3023705 DOI: 10.1371/journal.pone.0015811
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Comparative Contig Overlay with Putative SNPs.
Histogram displaying the bovine chromosome locations (Btau4.0) of 51,087 uniquely aligned (E≤1e-50) white-tailed deer (WTD) sequence contigs and 13,962 putative SNPs (≥3X coverage) derived from sequencing a WTD reduced representation library (RRL) and random shotgun library (RSL; Also See Table S6, S7, and S8). Note, one WTD contig that aligned to BTA13 (Consensusfrom53142contig) was removed by NCBI filtering.
Summary data for a white-tailed deer RRL plus RSL SNP validation study.
| SNP Coverage Class | Minor Allele Count | Total Putative SNPs | % SNPs | Total SNPs Tested | Assays Passing Q.C. | Total SNPs Validated (%) | Total SNPs Predicted |
| 3X | 1 | 6791 | 51.9% | 15 | 14 | 14 (100) | 6791 |
| 4X | 1 | 1951 | 14.9% | 15 | 13 | 13 (100) | 1951 |
| 4X | 2 | 497 | 3.8% | 12 | 12 | 12 (100) | 497 |
| 5X | 1 | 778 | 5.9% | 12 | 12 | 10 (83.3) | 648 |
| 5X | 2 | 255 | 1.9% | 12 | 9 | 7 (77.8) | 198 |
| 6X | 1 | 467 | 3.6% | 12 | 9 | 7 (77.8) | 363 |
| Totals/Avg | 10,739 | 82.0% | 78 | 69 | 63 (89.8) | 10,448 |
Percent of all nuclear SNPs based on total putative SNPs with 3X–150X coverage predicted within 51,087 single-hit contigs (E≤1e-50 to a unique bovine genomic position).
Allele-specific SNP assays exhibiting acceptable and repeatable genotype clustering.
Prediction based on the proportion of validated nuclear SNPs sampled from each class.
Mitochondrial validation assays also confirmed 28 additional SNPs (28/38; 74%).
Figure 2Mitochondrial Haplotype Network.
Median joining (MJ) haplotype network for white-tailed deer mitochondrial SNP variation (n = 28 validated SNPs) discovered by sequencing reduced representation and random shotgun libraries. Numbers indicate SNP positions in numerical order (Table S8). Node sizes are proportional to haplotype frequency, and all branch lengths are drawn to scale. Median vectors are indicated by small red circles. Nine discrete haplotypes were detected among samples from one free-ranging population (yellow) and representatives from three captive populations (black).
Figure 3Mitochondrial Phylogeny.
Majority rule consensus phylogenetic tree for 10 species of Cervidae inferred from concatenated 12S rRNA, 16S rRNA, ND1, ND2, COX1, COX2, ATP8, ATP6, COX3, ND3, ND4L, ND4, ND5, ND6, and CYTB gene sequences. Entries in parenthesis represent Genbank Accession Numbers. The Odocoileus virginianus consensus sequence was derived from this study. Start codons, stop codons, and overlapping regions of genes were excluded. Numbers beside branches indicate posterior probabilities estimated from a Bayesian analysis employing 2×106 generations [35].