| Literature DB >> 32785739 |
Jared Crain1, Steve Larson2, Kevin Dorn1,3, Traci Hagedorn4,5, Lee DeHaan6, Jesse Poland7,8.
Abstract
KEY MESSAGE: Paternity assignment and genome-wide association analyses for fertility were applied to a Thinopyrum intermedium breeding program. A lack of progeny between combinations of parents was associated with loci near self-incompatibility genes. In outcrossing species such as intermediate wheatgrass (IWG, Thinopyrum intermedium), polycrossing is often used to generate novel recombinants through each cycle of selection, but it cannot track pollen-parent pedigrees and it is unknown how self-incompatibility (SI) genes may limit the number of unique crosses obtained. This study investigated the potential of using next-generation sequencing to assign paternity and identify putative SI loci in IWG. Using a reference population of 380 individuals made from controlled crosses of 64 parents, paternity was assigned with 92% agreement using Cervus software. Using this approach, 80% of 4158 progeny (n = 3342) from a polycross of 89 parents were assigned paternity. Of the 89 pollen parents, 82 (92%) were represented with 1633 unique full-sib families representing 42% of all potential crosses. The number of progeny per successful pollen parent ranged from 1 to 123, with number of inflorescences per pollen parent significantly correlated to the number of progeny (r = 0.54, p < 0.001). Shannon's diversity index, assessing the total number and representation of families, was 7.33 compared to a theoretical maximum of 8.98. To test our hypothesis on the impact of SI genes, a genome-wide association study of the number of progeny observed from the 89 parents identified genetic effects related to non-random mating, including marker loci located near putative SI genes. Paternity testing of polycross progeny can impact future breeding gains by being incorporated in breeding programs to optimize polycross methodology, maintain genetic diversity, and reveal genetic architecture of mating patterns.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32785739 PMCID: PMC7547974 DOI: 10.1007/s00122-020-03666-1
Source DB: PubMed Journal: Theor Appl Genet ISSN: 0040-5752 Impact factor: 5.699
Gametophytic compatibility outcomes corresponding to three possible parental genotypes for a biallelic, self-incompatibility (SI) single-loci
| Male parent genotype | Female parent genotype | Progeny loci genotype code | Expected SI phenotype |
|---|---|---|---|
| SI1SI1 = − 1 | SI1SI1 = − 1 | − 1 | I |
| SI1SI1 = − 1 | SI1SI2 = 0 | − 1 | I |
| SI1SI1 = − 1 | SI2SI2 = 1 | + 1 | C |
| SI1SI2 = 0 | SI1SI1 = − 1 | + 1 | C |
| SI1SI2 = 0 | SI1SI2 = 0 | − 1 | I |
| SI1SI2 = 0 | SI2SI2 = 1 | + 1 | C |
| SI2SI2 = 1 | SI1SI1 = − 1 | + 1 | C |
| SI2SI2 = 1 | SI1SI2 = 0 | − 1 | I |
| SI2SI2 = 1 | SI2SI2 = 1 | − 1 | I |
Progeny genotypes with − 1 are assumed incompatible (I) as female mother contains the exact alleles as the pollen parent, and 1 is compatible (C) where the female mother contains contrasting alleles to the pollen parent. Coding system was used for each of 9358 biallelic single-nucleotide polymorphic markers to develop full in silico genomic profiles of progeny, with the expected phenotype corresponding to I or C with SI occurring
Fig. 1Histogram showing the distribution of the number of progeny of each pollen parent in a polycross breeding program, where paternity was determined through SNP markers. Red vertical line is median value (32), and the dashed black line is the average (41) (color figure online)
Fig. 2Matrix showing the number of progeny per cross with increasing color gradient, white diagonal is unobserved self-progeny; unobserved combinations are in tan, while crosses with five or more progeny are in dark blue. Female plants are on the x-axis with approximately equal distribution per line, observed vertically resulting from the population sampling. Male plants are in the y-axis and read horizontally. Some males such as 116SGH000028 crossed with many females, while other such as 16SGH000086 did not sire progeny (color figure online)
Putative S and Z SI gene alignment to the intermediate wheatgrass (IWG) genome and genome-wide association analysis (GWAS) markers with false discovery rate (FDR) values less than 0.05
| IWG_locus | Chr | Position | Homologous Group | − Log(p) | GWAS Model | BLAST type | BLAST_Database (IWG V2) | BLAST hit (IWG V2 gene) | Query sequence | Query citation |
|---|---|---|---|---|---|---|---|---|---|---|
| S01_12234882 | 1 | 12234882 | 1 | 6.11 | A | |||||
| S01_42697298 | 1 | 42697298 | 1 | 5.55 | B | |||||
| S01_339101017 | 1 | 339101017 | 1 | 3.87a | A | |||||
| S02_124049901 | 2 | 124049901 | 1 | 4.32a | A | |||||
| S02_333426395 | 2 | 333426395 | 1 | 4.46a | D | |||||
| S02_357481632 | 2 | 357481632 | 1 | 4.40a | A | |||||
| S02_408872524 | 2 | 408872524 | 1 | 4.10a | D | |||||
| S03_77463845 | 3 | 77463845 | 1 | 3.94a | A | |||||
| S03_161965824 | 3 | 161965824 | 1 | 3.62a | A | |||||
| S04_134219385 | 4 | 134219385 | 2 | 4.40 | B | |||||
| S05_184724149 | 5 | 184724149 | 2 | 5.29 | A | |||||
| S06_50454994 | 6 | 50454994 | 2 | 4.20a | A | |||||
| S06_234033536 | 6 | 234033536 | 2 | 4.63a | D | |||||
| S06_409578239 | 6 | 409578239 | 2 | 4.10a | B | |||||
| S06_486347536 | 6 | 486347536 | 2 | 4.06a | B | |||||
| S06_495791921 | 6 | 495791921 | 2 | 6.90 | B | |||||
| S06_495791961 | 6 | 495791961 | 2 | 6.90 | B | |||||
| S06_540749817 | 6 | 540749817 | 2 | 3.29a | B | |||||
| S10_488147266 | 10 | 488147266 | 4 | 4.77 | B | |||||
| S11_209271814 | 11 | 209271814 | 4 | 6.88 | B | |||||
| S12_13283789 | 12 | 13283789 | 4 | 5.58 | C | |||||
| S18_38158835 | 18 | 38158835 | 6 | 5.43 | B | |||||
| S20_276537900 | 20 | 276537900 | 7 | 5.64 | C | |||||
| S20_461811575 | 20 | 461811575 | 7 | 5.11 | B |
GWAS models: A. GWAS using in silico progeny genotype encoding for a single-loci, self-incompatibility gametophytic system for observed progeny combinations, B. principal component analysis (PCA) of progeny matrix principal component (PC) 1, C. PCA of progeny matrix PC 2, D. PCA of progeny matrix PC 3
Bold items are putative BLASTN and BLASTP hits
aSignificant for chromosome FDR = 0.05
Advantages and disadvantages of direct cross-breeding and polycross by random intermating for the breeding programs
| Direct Crosses | Random intermating | |
|---|---|---|
| Advantages | Large family size Known pedigree Make targeted/desired cross | Large number of families Resource efficient Infer pedigree from markers |
| Disadvantages | Resource-intensive Limited by available resources | May not observe desired combination Must use molecular markers Small family size |
Fig. 3Manhattan plot of genome-wide association analysis (GWAS) of four different models with 9358 markers. The dashed red horizontal line represents the false discovery rate (FDR) of 0.05 controlling for multiple comparisons across the entire genome. Solid red lines represent the FDR of 0.05 for multiple comparisons for each chromosome and were only considered for chromosomes harboring a putative SI gene (chromosomes 1, 2, 3, and 6). Panels represent: A GWAS using in silico progeny genotype encoding for a single-loci, self-incompatibility gametophytic system for observed progeny combinations, n = 7921. B Principal component analysis (PCA) of progeny matrix principal component (PC) 1. C PCA of PC 2. D PCA of PC 3, n = 89 (color figure online)