| Literature DB >> 34510383 |
Yu Zhang1, Yuexing Wang2, Wanying Zhou3, Shimao Zheng3, Runzhou Ye4.
Abstract
Quantitative trait locus (QTL) mapping often yields associations with dissimilar loci/genes as a consequence of diverse factors. One trait for which very limited agreement between mapping studies has been observed is resistance to white mold in soybean. To explore whether different approaches applied to a single data set could lead to more consistent results, haplotype-trait association and epistasis interaction effects were explored as a complement to a more conventional marker-trait analysis. At least 10 genomic regions were significantly associated with Sclerotinia sclerotiorum resistance in soybean, which have not been previously reported. At a significance level of α = 0.05, haplotype-trait association showed that the most prominent signal originated from a haplotype with 4-SNP (single nucleotide polymorphism) on chromosome 17, and single SNP-trait analysis located a nucleotide polymorphism at position rs34387780 on chromosome 3. All of the peak-SNPs (p-value < 0.05) of each chromosome also appeared in their respective haplotypes. Samples with extreme phenotypes were singled-out for association studies, 25-30% from each end of the phenotypic spectrum appeared in the present investigation to be the most appropriate sample size. Some key genes were identified by epistasis interaction analysis. By combining information on the nearest positional genes indicated that most loci have not been previously reported. Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses suggest potential candidate genes underlying callose deposition in the cell wall and mitogen-activated protein kinase (MAPK) signaling pathway-plant, as well as plant-pathogen interaction pathway, were activated. Integration of multi-method genome-wide association study (GWAS) revealed novel genomic regions and promising candidate genes in novel regions, which include Glyma.01g048500, Glyma.03g129100, Glyma.17g072200, and the Dishevelled (Dvl) family of proteins on chromosomes 1, 3, 17, and 20, respectively.Entities:
Keywords: Epistasis; Haplotype; Post-GWAS; SNP; Sclerotinia sclerotiorum
Mesh:
Year: 2021 PMID: 34510383 PMCID: PMC8755693 DOI: 10.1007/s13353-021-00654-z
Source DB: PubMed Journal: J Appl Genet ISSN: 1234-1983 Impact factor: 3.240
Fig. 1Distribution of SNP/haplotype: (a) Number of SNP on each chromosome. (b) Number of haplotype on each chromosome
Fig. 2Population structure diagram of the 126 soybean lines. Note: Red: group I; Green: group II. Vertical lines on the X-axis refer to each variety. The proportion of each color represents probability rate with which a given genotype belongs to each group
Haplotype-trait analysis of 126 lines (p-value < 0.05)
| Chromosome | Position of haplotype | Haplotype (1 = A, 2 = C, 3 = G, 4 = T) | ||
|---|---|---|---|---|
| 17 | 5,575,883|5,647,814|5,648,648|5,734,897 | 3114 | 0.1756 | 0.021 |
| 1 | 5,589,867|5,700,523|5,724,122|5,724,140 | 1223 | 0.1648 | 0.049 |
Single SNP-trait association of 126 lines using additive effects and additive effects + dominant effects (P-value < 0.05)
| GLM (general linear model) | |||||||
|---|---|---|---|---|---|---|---|
| Additive effects only | Additive effects + dominant effects | ||||||
| Chromosome | Position | Chromosome | Position | ||||
| 3 | 34,387,780 | 0.00816 | 0.01788 | 20 | 42,100,739 | 0.0132 | 0.0329 |
| 3 | 34,387,823 | 0.0086 | 0.01788 | 20 | 42,091,969 | 0.01418 | 0.0356 |
| 3 | 34,387,841 | 0.00816 | 0.01788 | 20 | 42,118,002 | 0.02327 | 0.059 |
| 20 | 42,091,969 | 0.01211 | 0.02673 | 18 | 55,469,724 | 0.03058 | |
| 1 | 5,589,867 | 0.0124 | 0.02752 | 1 | 5,589,867 | 0.03392 | |
| 20 | 42,118,002 | 0.01641 | 0.03613 | 3 | 34,387,780 | 0.04016 | |
| 3 | 34,395,745 | 0.01695 | 0.0376 | 3 | 34,387,823 | 0.04016 | |
| 3 | 34,387,945 | 0.0243 | 3 | 34,387,841 | 0.04016 | ||
| 20 | 42,100,739 | 0.03358 | 3 | ||||
| 20 | 42,122,908 | 0.03878 | 3 | ||||
Fig. 3Manhattan plot of the association between single SNP-Trait. Note: P-values for the association of each single nucleotide polymorphism with phenotype are shown on the y-axis. The SNPs are plotted on the x-axis according to their chromosomal location. Blue line: α = 0.05; red line: α = 0.01 (additive effects only)
Fig. 4Prioritizing candidate genes. Note: Triangle represents the position of haplotype, red font indicates QTL is within a gene, and green font indicates that the QTL is outside a gene
Fig. 5The top three epistasis interaction subnetworks
Top genes in the epistasis network (P < 1 × 10−8)
| ALL × ALL | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| SNP1 | SNP2 | |||||||||
| CHR1 | Position | Distance to SNP (bp) | Candidate gene | Annotation | CHR2 | Position | Distance to SNP (bp) | Candidate gene | Annotation | |
| 6 | 563,504 | Within | Glyma.06g007400 | Vacuolar-type H( +)-ATPase C3 | 18 | 57,104,652 | 2087 | Glyma.18g292800 | S-adenosyl-L-methionine-dependent methyltransferases superfamily protein | 2.475E − 9 |
| 7 | 41,611,205 | 3442 | Glyma.07g235100 | Molybdopterin biosynthesis MoaE family protein | 8 | 8,785,343 | 12,370 | Glyma.08g114700 | Calmodulin-binding family protein | 2.652E − 9 |
| 1 | 54,799,844 | 2853 | Glyma.01g219000 | Transcription initiation factor TFIIE, beta subunit | 15 | 26,893,418 | 60,406 | Glyma.15g204300 | Surfeit locus protein 5 subunit 22 of mediator complex | 2.827E − 9 |
| 6 | 563,504 | 29,305 | Arginine decarboxylase 2 | 18 | 57,114,367 | 11,802 | S-adenosyl-L-methionine-dependent methyltransferases superfamily protein | 5.596E − 9 | ||
| 1 | 54,799,844 | 2853 | Transcription initiation factor TFIIE, beta subunit | 15 | 37,971,247 | 49,861 | Ca(2)-dependent phospholipid-binding protein (Copine) family | 6.06E − 9 | ||
| 1 | 54,719,664 | 1102 | Cytochrome P450, family 87, subfamily A, polypeptide 2 | 15 | 26,893,418 | 60,406 | Surfeit locus protein 5 subunit 22 of mediator complex | 6.536E − 9 | ||
Prioritizing candidate genes(P < 1 × 10−7)
| Chr | Position of haplotype | Candidate gene | Annotation |
|---|---|---|---|
| 3 | 34,371,310/34371316/34371324/34385919/34387780/34387823/34387841/34387945/34387962/34395745/34395833/34407540/34446090/34453159 | 1.Glyma.03g128300 2.Glyma.03g128900 3.Glyma.03g129100 4.Glyma.03g129200 5.Glyma.03g129300 6.Glyma.03g129600 7.Glyma.03g129700 8.Glyma.03g130100 | 1. Glutamate synthase 2. Lycopene cyclase/lycopene β-cyclase 3. Pyrroline-5- carboxylate (P5C) reductase 4. Cytochrome P450, family 86, subfamily A, polypeptide 1 5. S-adenosyl-L-methionine-dependent methyltransferases superfamily protein 6. Glutaredoxin family protein 7. Cystathionine beta-lyase (CBL) 8. Calcium-dependent phosphotriesterase superfamily protein |
17 (01) 17 (02) | 5,575,883/5647814/5648648/5734897 31,585,092–31,585,542 | 1. Glyma.17g071700 2. Glyma.17g071900 3. Glyma.17g072200 4. Glyma.17g072400 5. Glyma.17g072700 6. Glyma.17g073300 7. Glyma.17g199400 | 1. Leucine-rich receptor-like protein kinase family protein 2. Calcium-binding EF-hand family protein 3. Cellulose synthase family protein 4. Heat shock protein 70B 5. Nucleic acid binding 6. Signal recognition particle receptor alpha subunit family protein 7. Glutathione S-transferase THETA 2 |
| 20 | 42,068,110/42080972/42091969/42100739/42100805/42118002/42122908 | 1. Glyma.20g182300 2. Glyma.20g182400 | 1. DVL family protein 2. Ras-related small GTP-binding family protein |
1 (01) 1 (02) | 5,589,867/5700523/5724122/5724140 6,337,006–6,625,502 | 1. Glyma.01g048500 2. Glyma.01g048800/Glyma.01g048900 3. Glyma.01g049000 4. Glyma.01g052300 5. Glyma.01g052500 6. Glyma.01g052900 7. Glyma.01g053100 | 1. Galactosyltransferase1 2. Glucose-methanol-choline (GMC) oxidoreductase family protein 3. Glutathione S-transferase THETA 3 4. Levansucrase 5. GDP-fucose protein O-fucosyltransferase 6. Early-responsive to dehydration stress protein 7. Argonaute family protein |
| 4 | 6,353,086/6353873 | 1. Glyma.04g075500 2. Glyma.04g075900 3. Glyma.04g076200 | 1. Plant calmodulin-binding protein-related 2. Peptidyl-tRNA hydrolase II (PTH2) family protein 3. WRKY DNA-binding protein 11 |
13 (01) 13 (02) | 21,831,889 37,847,000/37886116|37,951,427|37,979,893|38,024,854|38,027,686 | 1. Glyma.13g102000 2. Glyma.13g102200 3. Glyma.13g104100 4. Glyma.13g104300 5. Glyma.13g276800 6. Glyma.13g277100 7. Glyma.13g278800 | 1. WRKY DNA-binding protein 11 2. Protein kinase superfamily protein 3. Polyamine oxidase 5 4. Protein kinase superfamily protein 5. Protein kinase superfamily protein 6. Cytochrome P450, family 72, subfamily A, polypeptide 15 7. Leucine-rich repeat (LRR) family protein |
| 10 | 2,870,409–2,873,948 | Glyma.10g032900 | WRKY DNA-binding protein 21 |
Fig. 6The results of GO functional enrichment analysis
Fig. 7The results of KEGG pathways enrichment analysis