| Literature DB >> 21272354 |
Rebekah E Oliver1, Gerard R Lazo, Joseph D Lutz, Marc J Rubenfield, Nicholas A Tinker, Joseph M Anderson, Nicole H Wisniewski Morehead, Dinesh Adhikary, Eric N Jellen, P Jeffrey Maughan, Gina L Brown Guedira, Shiaoman Chao, Aaron D Beattie, Martin L Carson, Howard W Rines, Donald E Obert, J Michael Bonman, Eric W Jackson.
Abstract
BACKGROUND: Genetic markers are pivotal to modern genomics research; however, discovery and genotyping of molecular markers in oat has been hindered by the size and complexity of the genome, and by a scarcity of sequence data. The purpose of this study was to generate oat expressed sequence tag (EST) information, develop a bioinformatics pipeline for SNP discovery, and establish a method for rapid, cost-effective, and straightforward genotyping of SNP markers in complex polyploid genomes such as oat.Entities:
Mesh:
Year: 2011 PMID: 21272354 PMCID: PMC3041746 DOI: 10.1186/1471-2164-12-77
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
RNA used for cDNA library construction
| Genotype | RNA (μg) | Total RNA (μg) | Ratio | |||
|---|---|---|---|---|---|---|
| Ogle1040 | 68.00 | 50.77 | 26.62 | 173.14 | 318.53 | 1:0.75:0.39:2.55 |
| TAM O-301 | 73.28 | 19.33 | 93.28 | 43.31 | 229.20 | 1:0.26:1.27:0.59 |
| Gem | 52.10 | 66.70 | 62.28 | 164.17 | 345.25 | 1:1.28:1.20:3.15 |
| HiFi | 70.80 | 20.67 | 155.02 | 105.87 | 352.36 | 1:0.29:2.19:1.50 |
| Mean | 66.05 | 39.37 | 84.30 | 121.62 | 311.34 | 1:0.60:1.28:1.84 |
| Total | 264.18 | 157.47 | 337.20 | 486.49 | 1245.34 | |
454 Sequencing data from cDNA libraries of four oat genotypes
| Genotype | Total reads | Assembled reads | # contigs | Total contig length (Mb) | Mean contig length (bp) | length SD (bp) | Minimum contig size (bp) | Maximum contig size (bp) | Average coverage (x) |
|---|---|---|---|---|---|---|---|---|---|
| Ogle1040 | 283,478 | 231,057 | 32,245 | 18.09 | 561 | 205 | 41 | 2217 | 4.1 |
| TAM O-301 | 381,397 | 307,751 | 42,147 | 24.70 | 586 | 217 | 40 | 4303 | 4.0 |
| Gem | 180,447 | 137,743 | 23,681 | 14.20 | 598 | 238 | 43 | 4229 | 3.1 |
| HiFi | 235,589 | 188,015 | 29,036 | 16.40 | 564 | 220 | 42 | 4416 | 3.6 |
| Total | 1,080,911 | 864,566 | 127,109 |
Figure 1Gene Ontology (GO) comparisons. Comparison of biological (A), cellular(B), and molecular (C) GO terms associated with germplasm accessions Ogle1040, TAM O-301, Gem, and HiFi. Term overlaps are shown with a four-way Venn diagram constructed using VENNY. The shared areas for all germplasm were 83% (3221), 88% (749), and 80% (1668) for the different GO terms assigned to the biological, cellular, and molecular GO term classifications, respectively. The numbers of GO terms matched to assembled sequences were 12260 (Ogle), 16040 (TAM), 9605 (Gem), and 11156 (HiFi).
Figure 2High-resolution melt analysis. Examples of SNP validation (A) and SNP diversity analysis (B) using primer oat EST SNP c22314_1. In example (B), the melting curves of primary alleles are shown red and green, those of heterozygous genotypes in blue, and those of genotypes found to contain a deletion within the amplicons in purple.
SNP allele, map position, and segregation ratio of markers polymorphic in the Ogle1040 × TAM O-301 (OT) mapping population
| SNP | Allele | OT LGa | SRb | |
|---|---|---|---|---|
| Ogle1040 | TAM O-301 | |||
| C51_1 | T | C | OT_4 | 0.90 |
| C51_2 | C | T | OT_4 | 0.81 |
| C104_1 | G | A | OT_30 | 0.49 |
| C250_1 | A | G | OT_20_29 | 2.00 |
| c318_1 | G | C | OT_34 | 1.61 |
| c540_1 | A | G | Frag_14 | 1.08 |
| c841_2 | T | G | OT_32_33 | 0.71 |
| c841_3 | T | G | OT_32_33 | 0.68 |
| c876_1a | T | G | OT_16 | 0.89 |
| c876_1b | A | T | OT_10 | 1.32 |
| c1196_1 | A | T | OT_6 | 1.30 |
| c1361_1 | C | A | OT_32_33 | 0.91 |
| c1579_1 | G | C | OT_20_29 | 1.08 |
| c2043_1 | C | G | OT_31 | 0.91 |
| c2106_2 | - | - | OT_32_33 | 0.96 |
| c2391_1 | C | T | OT_32_33 | 0.68 |
| c2391_2 | C | T | OT_30 | 0.82 |
| c2391_4 | C | G | OT_34 | 1.01 |
| c2391_5 | G | T | OT_34 | 1.09 |
| c2539_1 | - | - | Unlinked | 1.60 |
| c2680_1 | T | C | OT_24 | 0.94 |
| c3212_1 | A | G | OT_24 | 1.48 |
| c3768_1 | G | A | OT_24 | 1.03 |
| c4096_1 | - | - | OT_13 | 0.86 |
| c5153_1 | G | A | OT_27 | 0.79 |
| c5252_1 | G | A | OT_11 | 0.94 |
| c5469_1 | A | G | OT_1 | 1.05 |
| c7461_1 | G | A | OT_2 | 1.08 |
| c10486_1 | T | G | Unlinked | 0.92 |
| c11164_1 | A | G | OT_11 | 1.16 |
| c11164_2 | C | T | OT_6 | 1.21 |
| c12344_1 | C | G | Unlinked | 0.79 |
| c12516_1 | T | A | OT_27 | 0.82 |
| c12516_2 | G | A | OT_34 | 1.08 |
| c14852_2 | C | A | OT_2 | 0.78 |
| c15098_1 | C | T | OT_34 | 1.00 |
| c16908_1 | C | A | OT_10 | 0.73 |
| c22314_1 | G | T | OT_11 | 0.94 |
| c23257_1 | A | C | OT_13 | 0.96 |
| lrc14030_1 | T | A | OT_27 | 0.79 |
| lrc16053_1 | A | G | OT_8 | 1.13 |
| lrc16053_2 | C | T | OT_32_33 | 0.67 |
| lrc16053_3 | T | C | OT_32_33 | 0.89 |
| lrc27472_2 | A | C | OT_32_33 | 0.68 |
| lrc34490_1 | C | A | OT_15 | 0.92 |
| lrc38531_1 | C | A | OT_34 | 0.94 |
| lrc38531_2 | - | - | OT_8 | 0.75 |
| lrc40347_1 | G | C | Unlinked | 0.81 |
a Linkage group assignment is based on appending new SSR loci to the existing OT linkage map [Ref. [35]] using Map Manager QTX.
b SR = Segregation ratio, calculated as the quotient of O/T allele frequencies within the population.
Figure 3Cluster analysis. Diversity analysis of oat genotypes was based on the UPGMA method and EST-derived oat SNP markers. PI 291990 and PI 573585 are A-genome diploids (A. strigosa); Cc 7278 is a C-genome diploid (A. eriantha); #169 and Ba 13-13 are CD-genome tetraploids (A. magna); and BYU 210 is a AC-genome tetraploid (A. insularis). Remaining genotypes are hexaploid.
Additional Avena genotypes used for SNP diversity analysis
| Taxon | Identifier | Country of origin | Ploidy | Genomes |
|---|---|---|---|---|
| PI 573585 | Spain | Diploid | AsAs | |
| PI 291990 | Israel | Diploid | AsAs | |
| Cc 7278/19 | Morocco | Diploid | CpCp | |
| #169 (M26) | Morocco | Tetraploid | CCDD | |
| Ba 13-13b | Morocco/Israel | Tetraploid | CCDD | |
| BYU 210 | Italy | Tetraploid | AACC | |
| PI 657407 | Morocco | Hexaploid | AACCDD | |
| PI 411817 | Iran | Hexaploid | AACCDD |
a Syn. A. maroccana
b Domesticated A. magna [Ref. [39]]
Figure 4Sequence validation. Sequencing of real-time PCR amplicons using oat EST-SNP primers revealed both simple (A) and complex (B) polymorphism between Ogle1040 (O) and TAM O-301(T). Individual reads are identified by primer name (c2760_1 and c3768_1) and genotype (O and T). Sequences were aligned using Geneious v4.8.3.
Figure 5SNP discovery pipeline. Schematic representing a sequence for SNP discovery in a complex genome. In silico removal of ambiguous polymorphism was used to optimize candidate SNP sequences prior to primer design and validation.