| Literature DB >> 30097588 |
Mai F Minamikawa1, Norio Takada2, Shingo Terakami2, Toshihiro Saito2, Akio Onogi1, Hiromi Kajiya-Kanegae1, Takeshi Hayashi3, Toshiya Yamamoto2, Hiroyoshi Iwata4.
Abstract
Breeding of fruit trees is hindered by their large size and long juvenile period. Genome-wide association study (GWAS) and genomic selection (GS) are promising methods for circumventing this hindrance, but preparing new large datasets for these methods may not always be practical. Here, we evaluated the potential of breeding populations evaluated routinely in breeding programs for GWAS and GS. We used a pear parental population of 86 varieties and breeding populations of 765 trees from 16 full-sib families, which were phenotyped for 18 traits and genotyped for 1,506 single nucleotide polymorphisms (SNPs). The power of GWAS and accuracy of genomic prediction were improved when we combined data from the breeding populations and the parental population. The accuracy of genomic prediction was improved further when full-sib data of the target family were available. The results suggest that phenotype data collected in breeding programs can be beneficial for GWAS and GS when they are combined with genome-wide marker data. The potential of GWAS and GS will be further extended if we can build a system for routine collection of the phenotype and marker genotype data for breeding populations.Entities:
Mesh:
Year: 2018 PMID: 30097588 PMCID: PMC6086889 DOI: 10.1038/s41598-018-30154-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1LD decay estimated from parental and combined populations. Curves show local polynomial smoothed plots with kernel weight for the parental population (n = 86) and combined parental and breeding populations (n = 851). Horizontal dashed lines represent the baseline r2 values based on the 95th percentile of the distribution of r2 values between pairs of unlinked markers.
Figure 2Structures of parental and combined populations. (A) Hierarchical clustering of the parental population. (B) Types of varieties (I, indigenous; O, old; M, modern; BL, breeding line) and release years (Supplementary Table S1). Asterisks indicate modern elite cultivars bred by the NARO Institute of Fruit Tree Science. (C) PCA of the parental population. Black and white diamonds indicate clusters I and II estimated by hierarchical clustering, respectively. (D) PCA of the combined population. Black circles indicate the parental population. Coloured circles indicate members of each family of the breeding population (see Supplementary Table S2).
Figure 3Manhattan plots for nine fruit quality traits in the single-locus GWAS using combined population. Dashed lines indicate a false discovery rate of 0.05. Linkage group 18 is a fictive linkage group for placing SNPs not mapped on 17 linkage groups.
Figure 4Prediction accuracy of single-trait models for the breeding populations. Prediction accuracy was measured as Pearson’s correlation coefficient (r) between predicted genotypic values and phenotypic values. The prediction accuracy was calculated for all families combined. (A) Five types of validation were compared. Only the mean prediction accuracy of all methods (B) is shown. (B) Twelve methods were tested. RR: ridge kernel regression, GAUSS: Gaussian kernel regression. Validation of type (iii) is shown (A); other validation types are shown in Supplementary Fig. S6. (C) Regression models were based on the results of the single-locus GWAS using parental or combined population. One or three SNPs that showed high −log10(p) values in GWAS were selected for MLR. (i), (iii), and (v) indicate validation types (A). MLR: multiple linear regression. (D) Prediction models that considered only additive or both additive and dominance effects were tested.
Phenotypic traits evaluated in this study.
| Trait | Abbreviation | Continuous or categorical value | Number of levels | Description | Rate of missing value in parental population | Rate of missing value in combined population |
|---|---|---|---|---|---|---|
| Harvest time | HarT | Continuous | — | Number of days to harvest from July 1st | 0 | 0 |
| Fruit weight | FruW | Continuous | — | Mature Fruit weight (g) | 0 | 0 |
| Flesh firmness | FruH | Continuous | — | Magness-Taylor pressure test (lb) | 0.01 | 0.004 |
| Sugar content | SugC | Continuous | — | Total soluble solid content of juice (%) | 0.01 | 0.004 |
| Acid content | Aci | Continuous | — | pH of juice | 0.01 | 0.004 |
| Fruit skin color | FruC | Categorical | 5 | Smooth (russet formation on 0–20% of the surface area of mature fruit), smooth (20–75%), smooth (75–95%) middle (95–99%), russet (100%) (visual) | 0.02 | 0.005 |
| Preharvest fruit drop | FruD | Continuous | — | Ratio of preharvest fruit drop (visual) | 0 | 0 |
| Heart rot | HeaR | Continuous | — | Ratio of heart rot (visual) | 0.01 | 0.004 |
| Watercore | WatC | Continuous | — | Ratio of watercore (visual) | 0.01 | 0.004 |
| Severe watercore | SWatC | Continuous | — | Ratio of severe watercore (visual) | 0.03 | — |
| Fruit shape in longitudinal section | FruS | Categorical | 5 | Round, oblate, broad elliptical, oval, obovate (visual) | 0.05 | — |
| Rust | Rust | Categorical | 4 | None, a few, intermediate, many (visual) | 0.07 | — |
| Appearance | Appear | Categorical | 5 | Very bad, bad, intermediate, good, very good (sensory) | 0.09 | — |
| Groove | Groove | Categorical | 3 | None, a few, many (visual) | 0.08 | — |
| Resistance to black spot 1 | BSR1 | Categorical | 3 | Weak, intermediate, strong (visual) | 0.02 | — |
| Resistance to black spot 2 | BSR2 | Categorical | 2 | Susceptibility, resistance (visual) | 0.02 | — |
| Vigor of tree | TreV | Categorical | 3 | Weak, intermediate, strong (visual) | 0.02 | — |
| Number of spurs | SpuN | Categorical | 3 | few, intermediate, many (visual) | 0.02 | — |
Figure 5Comparison of single- and multi-trait models for the breeding populations. Prediction accuracy was measured as Pearson’s correlation coefficient (r) between predicted genotypic values and phenotypic values. (i), (iii), and (v) indicate validation types (Fig. 4A). PHENIX: Bayesian multivariate mixed model fitted via variational Bayes, MGF: multiple-response Gaussian family.