| Literature DB >> 33404534 |
Yunbi Xu1,2,3, Xiaogang Liu1, Junjie Fu1, Hongwu Wang1, Jiankang Wang1, Changling Huang1, Boddupalli M Prasanna4, Michael S Olsen4, Guoying Wang1, Aimin Zhang5.
Abstract
Although long-term genetic gain has been achieved through increasing use of modern breeding methods and technologies, the rate of genetic gain needs to be accelerated to meet humanity's demand for agricultural products. In this regard, genomic selection (GS) has been considered most promising for genetic improvement of the complex traits controlled by many genes each with minor effects. Livestock scientists pioneered GS application largely due to livestock's significantly higher individual values and the greater reduction in generation interval that can be achieved in GS. Large-scale application of GS in plants can be achieved by refining field management to improve heritability estimation and prediction accuracy and developing optimum GS models with the consideration of genotype-by-environment interaction and non-additive effects, along with significant cost reduction. Moreover, it would be more effective to integrate GS with other breeding tools and platforms for accelerating the breeding process and thereby further enhancing genetic gain. In addition, establishing an open-source breeding network and developing transdisciplinary approaches would be essential in enhancing breeding efficiency for small- and medium-sized enterprises and agricultural research systems in developing countries. New strategies centered on GS for enhancing genetic gain need to be developed.Entities:
Keywords: genetic gain; genomic prediction; genomic selection; livestock breeding; molecular marker; open-source breeding
Year: 2019 PMID: 33404534 PMCID: PMC7747995 DOI: 10.1016/j.xplc.2019.100005
Source DB: PubMed Journal: Plant Commun ISSN: 2590-3462
Figure 1Flowchart of Background and Knowledge Relevant to Genomic Selection.
GWAS, genome-wide association study; QTL, quantitative trait loci; BSA, bulked sample analysis (Zou et al., 2016).
Figure 2Genomic Selection Procedure in Breeding Programs for Self-Pollinated (Left) and Cross-Pollinated (Right) Crops.
A training or reference population is used to estimate marker effects and then the genomic estimated breeding values of each individual in breeding populations, and the selected candidate lines can be regarded as founders for the next cycle of breeding. GS, genomic selection; DH, doubled haploid.
Factors Affecting Genetic Gain and Potential Contributors Associated with Genomic Selection in Plants.
| Components | Subcomponents | Contributors |
|---|---|---|
| Population | Population type | Bi- and multiparental populations, natural populations, mating-design populations, multiple-hybrid populations, population structure and relationship |
| Population size | ||
| Novel germplasm introduction | Exotic germplasm, transgene, genome editing, mutation, gene introgression | |
| Selection proportion | Breeding project, population size, heritability, TP/BP ratios | |
| Selection method | Phenotypic selection, GS, MARS, integrated selection | |
| Selection index | Indices based on breeding objectives, trait priorities | |
| Germplasm used to estimate GEBVs | Population type, size, relationship, and structure | |
| Genotype | Molecular marker | Marker type, marker density, LD between QTL and marker, functional versus neutral markers, genome distribution |
| Targeted genes | Transgene, mutation, genome editing | |
| Heritability | Field management | Experimental design, field management, trial site selection, uniform agronomic practice, environment management and control, precision phenotyping, envirotyping |
| Estimation | Population types, mating design, traits: major genes controlled or minor genes controlled | |
| GS model | Statistical model | Genetic effects, genotype-by-environment interaction, rrBLUP, Bayesian models, machine learning, pedigree information, non-additive effect, fixed effect, multivariate model, among others |
| Breeding scheme | Breeding program | Breeding objective, selection criteria, selection scheme, target-environmental selection, breeding cost |
| Integrated breeding platform | GS, MAS, MARS, genome editing, DH, seed DNA-based genotyping | |
| Off-season screening | Speed breeding, greenhouse, winter nursery |
DH, doubled haploid; MAS, marker-assisted selection; MARS, marker-assisted recurrent selection; rrBLUP, ridge-regression best linear unbiased prediction; GS, genomic selection; LD, linkage disequilibrium; QTL, quantitative trait locus; TP, training population; BP, breeding population.
Comparison of Genomic Selection between Livestock and Plants.
| Livestock | Plants | Shared | |
|---|---|---|---|
| Value-chain | Higher individual value with higher investment return | Lower individual value with low investment return | Reduced cost; improved efficiency and thus genetic gain |
| Cost | More tolerant to high cost | Less tolerant to high cost | Reduced cost and breeders' affordability for large-scale GS |
| Benefit | More benefit from early selection and reduced generation interval | Off-season selection; less benefit from early selection and reduced generation interval | Early selection; reduced generation interval; accumulating favorable alleles for complex traits |
| Genotyping | Relatively higher cost acceptable due to higher individual value; easier DNA/RNA extraction; available pedigree records and progeny testing data | Relatively lower cost required due to lower individual value; complicated DNA/RNA extraction; limited pedigree and progeny testing data | Flexible, low-cost, high-throughput markers and platforms; significant, functional markers and genes; good marker coverage; high-density markers |
| Phenotyping | Movable individuals; individual-based; usually smaller numbers | Fixed individuals; group- or population-based; larger numbers | High-throughput, precision, and low-cost protocols and platforms |
| Envirotyping | Relatively uniform sites and environments; easier to measure, control, and standardize | Diverse locations and environments; harder to measure, control, and standardize | Controlled and managed environments; modeled and optimized growth and development factors |
| Informatics and decision support | Less demanding as data are relatively few due to limited population types, numbers, and sizes | Highly demanding as data are sizeable due to populations of diverse types, larger numbers, and bigger sizes | Data collection, storage, and mining; modeling; making decision; big data-driven breeding |
| Population type | Largely heterozygous | Open- versus self-pollinated; natural versus designed; hetero- versus homozygous; temporary versus permanent; inbreeding versus distant | Desired for more training population types with known population structure |
| Population number | Small | Larger and multiple populations from specific parents or natural collection | Desired for more training and breeding populations |
| Population size | Small in pedigree and limited by siblings | Various sizes from small to large, species-dependent | Desired for large population sizes |
| Sharing and updating | Not sharable via seeds; not updatable via permanent or regenerated populations | Easier to share populations via seeds or tissue; updatable for permanent or regenerated populations | Sharable G-P-E information; updatable pedigrees and specific individuals |
| Genetic variation | Not possible to discover genetic variation via homozygous processing; not manageable for fine mapping and gene cloning via linkage mapping; difficult to create new alleles via mutation | Easier to discover genetic variation via homozygous processing; manageable for fine mapping and gene cloning via linkage mapping; easier to create new alleles via mutation due to controlled inbreeding | Developing markers for full-genome coverage; unlocking hidden genetic variation from closely related species; identifying markers and genes via GWAS; creating new alleles via gene transfer and genome editing |
| Heritability | Relatively higher due to weaker environmental effects and smaller experimental errors; environments are easier to be controlled or managed | Relatively lower due to stronger environmental effects and larger experimental errors; environments are more difficult to be controlled or managed | Improvable via managed trials with controlled environmental effects and errors |
| Selection intensity | Lower potential for increasing via larger population size or lower selection rate | Greater potential for increasing via larger population size and lower selection rate | Improvable using larger population size and lower selection rate |
| Breeding cycle time | Not manageable for rapid homozygous process; less sensitive to photoperiods; not or less adaptable to speed breeding via off-season trials or tissue culture; extremely sensitive to early selection | Rapid homozygous process via DH; probably sensitive to photoperiods; more adaptable to speed breeding via off-season trials and tissue culture; less sensitive to early selection | Shortened or accelerated cycle by early selection, shortened generation interval and accelerated generation, via clones and modified metabolism/pathways and adjusted growth |
| Statistical models | Less significant GEI; defined or known population structure; one model probably fit for the same population type | Very significant GEI; diverse levels of population structure; different models needed for diverse population types | Various statistical models: BLUP, GBLUP, rrBLUP, BayesA, wBSR, RKHS, BayesB, BayesCπ; biological effects: non-additive factors, epistasis, GEI, growth and development, networks, pathways |
| Germplasm evaluation | Not possible to maintain germplasm for a long term; non-renewable; pedigree-based evaluation | Easier to maintain for a long term under managed conditions; renewable; continuous and repeat evaluation with data accumulated | Evaluated for trait donors and gene discovery; identifying associated markers and genes via GWAS for GS; creating populations for model training and breeding |
| Prebreeding | Less important and less manageable | Important and practical | Desired for creating new germplasm more manageable to breeders |
| Stress tolerance | Abiotic: managed by controlled environments; biotic: managed by gene modification, surgery, internal medicine therapy | Abiotic: managed via environmental control, improved tolerance, chemical control or regulation; biotic: managed by integrated control and improved tolerance | Adjusted and enhanced adaptation and tolerance to abiotic and biotic stresses |
| Open-source breeding | More applicable for breeding parents or parental populations | Suitable for all cases in plant breeding | Sharing G-P-E information and even genetics and breeding materials; sharing GS-related platforms across livestock and plants |
BLUP, best linear unbiased prediction; GBLUP, genomic BLUP; GEI, genotype-by-environment interaction; G-P-E, genotype–phenotype–environment; GS, genomic selection; GWAS, genome-wide association study; RKHS, reproducing kernel Hilbert space; rrBLUP, ridge-regression BLUP; wBSR, weighted Bayesian shrinkage regression.
Figure 3An Integrated Breeding Platform for Genomic Selection.
The platform involves various breeding technologies, including doubled haploid (DH) technology, speed breeding, decision support tools, seed DNA-based genotyping, genome editing, and transgenosis.
Figure 4Open-Source Breeding and Genomic Selection Networks Provide Services to Developing Countries and Small- and Medium-Sized Breeding Companies.
All breeding-related information, including estimated marker effects and genomic estimated breeding values (GEBVs), even breeding materials, can be shared during the breeding process and after each breeding cycle (t and t + 1), functioning in the same way as in a multinational seed incorporation where each breeding team works as a small- and medium-sized company (1, 2, …, n).