| Literature DB >> 21729323 |
Ted Cosart1, Albano Beja-Pereira, Shanyuan Chen, Sarah B Ng, Jay Shendure, Gordon Luikart.
Abstract
BACKGROUND: Gene-targeted and genome-wide markers are crucial to advance evolutionary biology, agriculture, and biodiversity conservation by improving our understanding of genetic processes underlying adaptation and speciation. Unfortunately, for eukaryotic species with large genomes it remains costly to obtain genome sequences and to develop genome resources such as genome-wide SNPs. A method is needed to allow gene-targeted, next-generation sequencing that is flexible enough to include any gene or number of genes, unlike transcriptome sequencing. Such a method would allow sequencing of many individuals, avoiding ascertainment bias in subsequent population genetic analyses.We demonstrate the usefulness of a recent technology, exon capture, for genome-wide, gene-targeted marker discovery in species with no genome resources. We use coding gene sequences from the domestic cow genome sequence (Bos taurus) to capture (enrich for), and subsequently sequence, thousands of exons of B. taurus, B. indicus, and Bison bison (wild bison). Our capture array has probes for 16,131 exons in 2,570 genes, including 203 candidate genes with known function and of interest for their association with disease and other fitness traits.Entities:
Mesh:
Year: 2011 PMID: 21729323 PMCID: PMC3146453 DOI: 10.1186/1471-2164-12-347
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Proportion of targeted exonic base pairs with a consensus genotype. All have a Phred-like quality score of at least 30. Total base pair counts, in millions, are plotted at selected minimum depths of coverage. Our estimates of exonic fixed differences and SNPs are based on consensus genotypes with coverage of at least 8 ×.
Figure 2Chromosomal positions. a, all consensus base differences from the reference taurine genome, and b, heterozygous SNPs only. Both maps show consensus bases with at least 8 × coverage and Phred-like quality of at least 30. Numbers in the legends give totals for variants for each species.
Variant Summary
| Bison | Zebu | Taurine | |
|---|---|---|---|
| 2,426 | 3,890 | 2,525 | |
| 8,635 | 1,634 | 1,329 | |
| 11,061 (0.45%) | 5,524 (0.23%) | 3,854 (0.16%) | |
| 2,447,500 | 2,395,651 | 2,306,566 | |
| 483 (4.37%) | 594 (10.75%) | 545 (14.14%) | |
| 10 | 4 | 1 | |
For each individual, total consensus bases different from the reference, for heterozygous SNPs, fixed differences, and concordance with 1.8 million entries in NCBI's dbSNP database. Total differences from the reference are also given as a percentage of total genotyped bases. dbSNP position matches are also given as a percentage of total differences. dbSNP allele mismatches give the number of alleles that differed from a dbSNP allele while matching its position; for example, in the bison, of the 483 positions matching SNPs in dbSNP, 10 showed alleles different than those listed at dbSNP.
Sanger Sequence Calls vs. Maq
| bison | chr6 | 88531917 | 88532399 | 88532280 | A | G | G | |
| bison | chr6 | 88531917 | 88532399 | 88532296 | T | C | C | |
| zebu | chr6 | 88531917 | 88532399 | 88532296 | T | Y | Y | |
| zebu | chr6 | 88531917 | 88532399 | 88532332 | C | M | M | |
| zebu | chr6 | 88531917 | 88532399 | 88532339 | A | R | R | |
| zebu | chr6 | 88531917 | 88532399 | 88532393 | G | R | R | |
| bison | chr4 | 95689756 | 95690201 | 95690049 | T | C | C | |
| zebu | chr4 | 95689756 | 95690201 | 95690049 | T | C | C | |
| taurine | chr4 | 95689756 | 95690201 | 95690049 | T | C | C | |
| bison | chr10 | 3941758 | 3942115 | 3941786 | T | C | C | |
| bison | chr10 | 3941758 | 3942115 | 3941805 | A | G | G | |
| bison | chr10 | 3941758 | 3942115 | 3941921 | A | G | G | |
| bison | chr10 | 3941758 | 3942115 | 3941934 | G | A | A | |
| bison | chr10 | 3941758 | 3942115 | 3941946 | A | G | G | |
| zebu | chr10 | 3941758 | 3942115 | 3941921 | A | G | G | |
| zebu | chr10 | 3941758 | 3942115 | 3941946 | A | R | R | |
| zebu | chr10 | 3941758 | 3942115 | 3941963 | C | Y | Y | |
| taurine | chr10 | 3941758 | 3942115 | 3941921 | A | R | R | |
| bison | chr17 | 4284137 | 4284804 | 4284160 | T | A | A | |
| bison | chr17 | 4284137 | 4284804 | 4284210 | A | G | G | |
| bison | chr17 | 4284137 | 4284804 | 4284358 | C | Y | Y | |
| bison | chr17 | 4284137 | 4284804 | 4284655 | T | C | C | |
| bison | chr17 | 4284137 | 4284804 | 4284747 | C | T | T | |
| zebu | chr17 | 4284137 | 4284804 | 4284160 | T | W | W | |
| zebu | chr17 | 4284137 | 4284804 | 4284210 | A | G | G | |
| zebu | chr17 | 4284137 | 4284804 | 4284652 | G | K | K | |
| zebu | chr17 | 4284137 | 4284804 | 4284655 | T | Y | Y | |
| taurine | chr17 | 4284137 | 4284804 | 4284210 | A | R | R | |
| taurine | chr17 | 4284137 | 4284804 | 4284652 | G | K | K | |
| bison | chr8 | 112427182 | 112427427 | 112427204 | C | T | T | |
| bison | chr8 | 112427182 | 112427427 | 112427213 | C | T | T | |
| bison | chr8 | 112427182 | 112427427 | 112427326 | A | C | C | |
| bison | chr8 | 112431812 | 112432152 | 112431927 | G | A | A | |
| bison | chr8 | 112434757 | 112435132 | 112435011 | A | C | C | |
| bison | chr8 | 112434757 | 112435132 | 112435120 | C | A | A |
Variant calls in several Sanger sequenced fragments in and near exons in the genes indicated, compared with Maq consensus bases. All variant calls in the Sanger sequenced regions for the bison and zebu are in agreement with the Maq consensus bases. For the taurine five positions (rows in bold) are called as variant by the Maq consensus but not by the Sanger sequenced fragments.
Primers used for PCR amplification and Sanger sequencing
| Gene | Forward (5' to 3') | Reverse (5' to 3') |
|---|---|---|
| AGAAATAATACCATTCTGCAT | GTTGTCTTCTTTGATGTCTCCTTAGAG | |
| GATTCCGCCGCACCTCTC | CCTGTGCAAGGCTGCACAGCC | |
| TCCTCTTCTGACTCGGATCTTT | CCAAGTTCTGTAAATGCTGTCTGC | |
| TGGGTCTGGGCTGTCATCAT | AAGAGATGTTTCCCCAAGTGTTTT | |
| GACCTGCAGAGGTGTGTGAA | TGAAAAATGGAAAGTGTGCAA | |
| CGGGGAGAGACGACACTACA | TGTTTGCAAATGAACCTAACCA | |
| TCTTTGCTCGTCCCAGTAGC | AAGTGAATGAAAAGGAGACCTCA | |
| GGAGACCTAGATGACTGGGTTG | GGGGCATTTGATGTAGAACTTT |