| Literature DB >> 21152036 |
Yan Fu1, Nathan M Springer, Kai Ying, Cheng-Ting Yeh, A Leonardo Iniguez, Todd Richmond, Wei Wu, Brad Barbazuk, Dan Nettleton, Jeff Jeddeloh, Patrick S Schnable.
Abstract
To date, microarray-based genotyping of large, complex plant genomes has been complicated by the need to perform genome complexity reduction to obtain sufficiently strong hybridization signals. Genome complexity reduction techniques are, however, tedious and can introduce unwanted variables into genotyping assays. Here, we report a microarray-based genotyping technology for complex genomes (such as the 2.3 GB maize genome) that does not require genome complexity reduction prior to hybridization. Approximately 200,000 long oligonucleotide probes were identified as being polymorphic between the inbred parents of a mapping population and used to genotype two recombinant inbred lines. While multiple hybridization replicates provided ∼97% accuracy, even a single replicate provided ∼95% accuracy. Genotyping accuracy was further increased to >99% by utilizing information from adjacent probes. This microarray-based method provides a simple, high-density genotyping approach for large, complex genomes.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21152036 PMCID: PMC2996289 DOI: 10.1371/journal.pone.0014178
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Comparison of genotyping using different subsets of probes and multiple analysis methods.
| B>M (all | B>M (2-FC | M>B (all | M>B (2-FC | |||||
| Number probes | 204934 | 164728 | 20933 | 8394 | ||||
| RIL - M0022 | % Calls | % Consistent | % Calls | % Consistent | % Calls | % Consistent | % Calls | % Consistent |
| Linear model calls (Method I) | ||||||||
| 95.2% | 94.9% | 95.3% | 96.7% | 85.9% | 82.8% | 78.4% | 83.4% | |
| Simple model calls (Method II) | ||||||||
| Replicate 1 | 93.1% | 91.9% | 93.4% | 94.5% | 74.2% | 76.6% | 73.6% | 80.6% |
| Replicate 2 | 93.8% | 93.6% | 94.8% | 96.0% | 74.6% | 79.5% | 76.0% | 81.8% |
| Consistency among replicates | ||||||||
| Both same call | 84.0% | 95.3% | 87.3% | 96.7% | 44.2% | 81.9% | 49.9% | 85.4% |
| Opposing calls5 | 4.2% | 2.0% | 11.5% | 7.4% | ||||
| RIL -M0023 | % Calls | % Validated | % Calls | % Validated | % Calls | % Validated | % Calls | % Validated |
| Linear model calls (Method I) | ||||||||
| 92.5% | 96.7% | 93.3% | 97.1% | 80.9% | 86.4% | 74.2% | 87.7% | |
| Simple model calls (Method II) | ||||||||
| Replicate 1 | 93.7% | 93.9% | 94.8% | 96.3% | 71.2% | 80.5% | 72.4% | 84.7% |
| Replicate 2 | 96.6% | 94.4% | 97.4% | 96.5% | 73.2% | 81.5% | 72.7% | 83.8% |
| Replicate 3 | 95.2% | 94.2% | 96.5% | 96.5% | 77.3% | 82.9% | 78.9% | 85.4% |
| Consistency among replicates | ||||||||
| Same call in all replicates | 90.9% | 95.4% | 94.1% | 96.9% | 58.0% | 83.9% | 60.9% | 86.0% |
| Opposing calls | 1.8% | 0.6% | 3.3% | 2.1% | ||||
The B>M or M>B (all) refers to all probes with a FDR<0.05 in a comparison of B73 and Mo17.
The B>M or M>B (2-FC) probes refers to the subset of polymorphic probes that have a FDR<0.0001 and a minimum of 2-fold change between B73 and Mo17.
The % calls is the percent of SFPs that could be assigned a genotype using an analysis method.
The % consistent refers to the percentage of polymorphic probes that were assigned the same genotype in previously generated genotyping data.
The opposing calls are those for which the same probe was assigned different genotypes in different replicates.
Figure 1Comparisons of analytical approaches for CGH-based mapping.
Data for B>M probes located on chromosome 1 that exhibit at least a 2-fold change (n = 26,953) were plotted following the use of different data analysis approaches. The upper set of plots display data for the RIL M0022 while the lower panels show data for the RIL M0023. The plots on the left and right display probes from the entire chromosome 1 and a close-up view of a 20 Mb region of chromosome 1 (positions 200 Mb–220 MB), respectively. For each set of plots the first panel provides visualization genotyping calls and log2 (RIL/B73) ratios following normalization and analysis using a linear model of multiple replications (Method I). The second and third panels show the genotyping calls and log2(RIL/B73) ratios based on the analysis of a single replicate of data that was normalized using standard NimbleScan approaches (Method II). The final plot shows the genotyping calls for each BAC (n = 1,369) using data from replicate 1 (Method III).
Genotyping calls by BAC (Method III).
| # BACs assigned genotype | % Consistent with PCR genotyping | |
| M0022 | ||
| Linear model | 8333 | 99.19% |
| Simple model | ||
| Replicate 1 | 8206 | 99.18% |
| Replicate 2 | 8264 | 99.22% |
| Consensus | 8036 | 99.23% |
| M0023 | ||
| Linear model | 8262 | 99.44% |
| Simple model | ||
| Replicate 1 | 8289 | 99.47% |
| Replicate 2 | 8351 | 99.51% |
| Replicate 3 | 8325 | 99.47% |
| Consensus | 8273 | 99.47% |
The consensus line specifies the number of BACs that were assigned the same genotype in all replicates.
Figure 2Visualization of whole genome genotypes of two RILs.
The genotype of each BAC with at least 5 filtered B>M probes (n = 7,978 that were called in both lines) was determined and color coded (B73 – blue; Mo17 – red). BACs were then plotted according to their physical positions along a chromosome (x-axis) and by chromosome (y-axis). This visualization was created using a single replicate of data.
Figure 3High-resolution of recombination break-points.
Several plots show detailed views of the CGH mapping data near recombination events in the RIL M0022. The log2(M0022/B73) value is plotted along the y-axis for each of the B>M probes (q<0.0001 and fold-change>2) in five genomic regions. The arrowheads indicate the position of the recombination event and the label indicates the chromosome and the base pair resolution of the recombination event.