| Literature DB >> 28680114 |
Mai F Minamikawa1, Keisuke Nonaka2, Eli Kaminuma3, Hiromi Kajiya-Kanegae1, Akio Onogi1, Shingo Goto2, Terutaka Yoshioka2, Atsushi Imai4, Hiroko Hamada2, Takeshi Hayashi5, Satomi Matsumoto6, Yuichi Katayose6, Atsushi Toyoda7,8, Asao Fujiyama8, Yasukazu Nakamura3, Tokurou Shimizu2, Hiroyoshi Iwata9.
Abstract
Novel genomics-based approaches such as genome-wide association studies (GWAS) and genomic selection (GS) are expected to be useful in fruit tree breeding, which requires much time from the cross to the release of a cultivar because of the long generation time. In this study, a citrus parental population (111 varieties) and a breeding population (676 individuals from 35 full-sib families) were genotyped for 1,841 single nucleotide polymorphisms (SNPs) and phenotyped for 17 fruit quality traits. GWAS power and prediction accuracy were increased by combining the parental and breeding populations. A multi-kernel model considering both additive and dominance effects improved prediction accuracy for acidity and juiciness, implying that the effects of both types are important for these traits. Genomic best linear unbiased prediction (GBLUP) with linear ridge kernel regression (RR) was more robust and accurate than GBLUP with non-linear Gaussian kernel regression (GAUSS) in the tails of the phenotypic distribution. The results of this study suggest that both GWAS and GS are effective for genetic improvement of citrus fruit traits. Furthermore, the data collected from breeding populations are beneficial for increasing the detection power of GWAS and the prediction accuracy of GS.Entities:
Mesh:
Year: 2017 PMID: 28680114 PMCID: PMC5498537 DOI: 10.1038/s41598-017-05100-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Linkage disequilibrium (LD) values (r 2) between pairs of SNPs plotted against physical distances between markers. Curves show local polynomial smoothed plots with kernel weight for the parental population (n = 111) and combined parental and breeding populations (n = 787).
Figure 2Genetic and phenotypic population structure of the parental population. (A) Ward’s hierarchical clustering based on Euclidean distance between genotypes. (B) ADMIXTURE-based estimation of the admixture proportions of individuals. Each color represents the inferred genetic contributions from K ancestral populations. (C) Heat map of log-converted fruit weight values.
Fruit quality traits evaluated in this study.
| Trait | Abbreviation | Continuous or categorical value | Number of levels | Description |
|---|---|---|---|---|
| Fruit weight | Weight | Continuous | — | Fruit weight (g) (instrumental) |
| Appearance | Appear | Categorical | 3 | Good, intermediate, bad (sensory) |
| Fruit shape | Shape | Categorical | 16 | Very strongly oblate, strongly oblate, oblate, globose with truncated apex, globose, ellipsoid, ellipsoid with prominently nippled apex, pyriform, and each category with presence or absence of neck (including collar) at base (visual) |
| Fruit hardness | FruH | Categorical | 5 | Very soft, soft, intermediate, hard, very hard (sensory) |
| Color of pericarp | ColorP | Categorical | 9 | Green, cream, yellow, yellowish orange, light orange, orange, deep orange, light reddish orange, reddish orange (visual) |
| Smoothness of pericarp | SmoothP | Categorical | 5 | Smooth, moderately smooth, intermediate, moderately rough, rough (visual) |
| Easiness of peeling | Peeling | Categorical | 5 | Easy, moderately easy, intermediate, moderately difficult, difficult (sensory) |
| Aroma intensity | Aroma | Categorical | 4 | Strong, intermediate, weak, none (sensory) |
| Color of flesh | ColorF | Categorical | 6 | Cream, yellow, yellowish orange, light orange, orange, deep orange (visual) |
| Flesh hardness | FleH | Categorical | 5 | Very soft, soft, intermediate, moderately hard, hard (sensory) |
| Juiciness | Juicy | Categorical | 3 | Juicy, intermediate, dry (sensory) |
| Firmness of locule membrane | FirmLM | Categorical | 5 | Very soft, soft, intermediate, moderately hard, hard (sensory) |
| Number of seeds | Seed | Categorical | 4 | None (0), a few (1–2), intermediate (3–5), many (>6) (visual) |
| Bitterness | Bitter | Categorical | 2 | Present or absent (sensory) |
| Taste | Taste | Categorical | 5 | Very good, good, intermediate, bad, very bad (sensory) |
| Sugar content | Brix | Continuous | — | Total soluble solid content of juice (%) (instrumental) |
| Acidity | Acid | Continuous | — | Acidity of juice (%) (instrumental) |
Figure 3Manhattan plots for 17 fruit quality traits in the combined parental and breeding populations. Mixed linear model used four principal components of population structure as covariates (Supplementary Fig. S6). Dashed lines indicate a false discovery rate of 0.05.
Figure 4Comparison of prediction models using the parental population. Prediction accuracy was measured as the Pearson’s correlation coefficient (r) between predicted genotypic values and phenotypic values. (A) Twelve methods were tested. RR: ridge kernel regression, GAUSS: Gaussian kernel regression. (B) Regression models were built based on the results of GWAS. The three SNPs that showed high –log10(p) values in GWAS (red points in Supplementary Fig. S5) were selected. MLR: Multiple Leaner Regression. (C) Prediction models that considered only additive or both additive and dominance effects were used.
Figure 5Prediction for fruit weight distribution tails. (A) All varieties except those in the large-fruit group (Fig. 2C), which is indicated by pink points, were used as the training set of the prediction model; the large-fruit group was used as the test set. (B) All varieties except those in the small-fruit group (Fig. 2C), which is indicated by blue points, were used as the training set; the small-fruit group was used as the test set. Grey points indicate the results of leave-one-out cross-validation. Predicted and observed values indicate log-transformed Weight. Pearson’s correlation coefficient (r) between predicted and observed values for the pink or blue points are shown on the plots. RR: linear ridge kernel regression, GAUSS: non-linear Gaussian kernel regression.
Figure 6Prediction accuracy of the breeding population. Prediction accuracy was measured as the Pearson’s correlation coefficient (r) between predicted genotypic values and phenotypic values. The prediction accuracy was calculated for combined all families. Only the mean prediction accuracy of all the methods is shown (Supplementary Fig. S10). Training populations are shown in the figure key. When the breeding population or combined parental and breeding populations were used as training sets, one family was excluded and its phenotype was predicted.