| Literature DB >> 27554097 |
Fabian Grandke1,2, Priyanka Singh3,4, Henri C M Heuven3,5, Jorn R de Haan3, Dirk Metzler6.
Abstract
BACKGROUND: Association studies are an essential part of modern plant breeding, but are limited for polyploid crops. The increased number of possible genotype classes complicates the differentiation between them. Available methods are limited with respect to the ploidy level or data producing technologies. While genotype classification is an established noise reduction step in diploids, it gains complexity with increasing ploidy levels. Eventually, the errors produced by misclassifications exceed the benefits of genotype classes. Alternatively, continuous genotype values can be used for association analysis in higher polyploids. We associated continuous genotypes to three different traits and compared the results to the output of the genotype caller SuperMASSA. Linear, Bayesian and partial least squares regression were applied, to determine if the use of continuous genotypes is limited to a specific method. A disease, a flowering and a growth trait with h (2) of 0.51, 0.78 and 0.91 were associated with a hexaploid chrysanthemum genotypes. The data set consisted of 55,825 probes and 228 samples.Entities:
Keywords: Association study; Bayz; Continuous genotypes; Linear regression; Partial least squares; Polyploids
Mesh:
Year: 2016 PMID: 27554097 PMCID: PMC4995758 DOI: 10.1186/s12864-016-2926-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Example probe. Example of genotype values for a hexaploid probe and 228 samples. The x-axis shows the difference between the signals of the two alleles. The y-axis shows the average signal strength per sample. The left and right sides show simulated and real data, respectively. a The simulation demonstrates how the seven genotype classes cluster into groups. b The real data shows the full segregation over the whole spectrum, but no clustering into seven genotype classes
Fig. 2Result comparison. Venn diagrams of significant probes for the disease (a), flowering (b) and growth (c) trait. The significance thresholds for LR(q-value), bayz (BF) and PLS(VIM score) were ≤0.01, ≥10 and ≥2, respectively
Fig. 3Comparison of genotype calls and continuous genotypes. Comparison of continuous values and genotype calls for three selected probes. a-c, d-f and g-i are the disease, flowering and growth trait, respectively. a, d and g were generated by SuperMASSA, based on the signal intensity values of the two alleles. The shapes and colors represent genotype clusters. The expected cluster centers are indicated by solid lines. b, e and h show the correlation of the raw genotype values with the corresponding EBVs. The solid lines represent the LR. c, f and i show the correlation of the genotype classes with the EBVs. The solid lines represent the LR
Fig. 4Overview of contigs and probes for the flowering trait. Probe and contig distribution of the significant markers of the flowering trait. The five bars represent the number of markers that lay in the same contig. The colors distinguish between markers where both or only one of the probes were significant and are more reliable. Multiple markers from the same contig indicate its association to the trait
Overlapping significant markers between all three methods for the flowering trait
| Marker |
|
| BF | VIM |
|---|---|---|---|---|
| AX-89300609 | 0.31 | 1.72×10−15 | 11.03 | 3.23 |
| AX-89215144 | 0.26 | 8.90×10−13 | 15.49 | 2.90 |
| AX-89213862 | 0.18 | 1.74×10−8 | 24.82 | 4.12 |
| AX-89256548 | 0.09 | 5.16×10−6 | 12.27 | 3.45 |
Overview of traits (DEBV)
| Trait |
| Mean | SD | Range | Samples |
|---|---|---|---|---|---|
| Disease | 0.51 | 0.05 | 1.2 | 6.19 | 228 |
| Flowering | 0.78 | 0.00 | 3.3 | 19.01 | 228 |
| Growth | 0.92 | 0.21 | 12.2 | 63.57 | 228 |