| Literature DB >> 34157965 |
Muez Berhe1,2, Komivi Dossa3,4,5, Jun You1, Pape Adama Mboup6, Idrissa Navel Diallo7,6, Diaga Diouf7, Xiurong Zhang1, Linhai Wang8.
Abstract
BACKGROUND: Sesame is a rare example of non-model and minor crop for which numerous genetic loci and candidate genes underlying features of interest have been disclosed at relatively high resolution. These progresses have been achieved thanks to the applications of the genome-wide association study (GWAS) approach. GWAS has benefited from the availability of high-quality genomes, re-sequencing data from thousands of genotypes, extensive transcriptome sequencing, development of haplotype map and web-based functional databases in sesame.Entities:
Keywords: GWAS; Genomics assisted breeding; Sesame; Statistical models
Mesh:
Year: 2021 PMID: 34157965 PMCID: PMC8218510 DOI: 10.1186/s12870-021-03046-x
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Comparison of power and efficiency of single and multi-locus models in GWAS for the detection of marker-trait associations
| Species | Sample size | Traits under study | Number of traits measured | SNPs number | Statistical Models | Number of models | Maximum QTN detected by each model | Co-detected QTN | Outperformed model | Recommended approach | References |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 188 | Flowering time | 6 | 216,130 | Single-locus | 2 | 25 | NA | mrMLM | Multi-locus | [ | |
| Multi-locus | 1 | 120 | |||||||||
| 199 | Flowering time | 4 | 216,130 | Single-locus | 4 | 21 | NA | FASTmrEMMA | Multi-locus | [ | |
| Multi-locus | 2 | 68 | |||||||||
| 144 | Embryonic callus regenerative capacity | 5 | 43427 | Single-locus | 1 | 1 | 63 | ISIS EMBLASSO | Multi-locus | [ | |
| Multi-locus | 4 | 160 | |||||||||
| 230 | Starch pasting properties | 7 | 145,232 | Single-locus | 1 | 7 | 7 | FASTmrEMMA, | Integrated | [ | |
| Multi-locus | 3 | 29 | |||||||||
| 160 | Fiber quality | 6 | 72,792 | Single-locus | 1 | NA | 70 | NA | Integrated | [ | |
| Multi-locus | 6 | ||||||||||
| 169 | Fiber quality | 5 | 53,848 | Single-locus | 3 | 342 | 15 | multi-locus | Integrated | [ | |
| Multi-locus | 3 | ||||||||||
| 368 | Plant height and number of models | 6 | 62,423 | Single-locus | 1 | 24 | NA | mrMLM | Multi-locus | [ | |
| Multi-locus | 1 | 64 | |||||||||
| 182 | Free amino acid level | 20 | 14,646 | Single-locus | 1 | 4 | 66 | pkWmEB | Integrated | [ | |
| Multi-locus | 6 | 117 | |||||||||
| 478 | Salt-tolerance | 5 | 165,529 | NA | 6 | NA | 56 | ISIS EM-BLASSO | Integrated | [ | |
| Multi-locus | 371 |
Summary of pan genome assembly in various plant species
| Plants | Number of assembled genome | References genome | Pan-genome | |||
|---|---|---|---|---|---|---|
| Number of total genes | % core gene | % of dispensable gene | References | |||
| Brassica | 21 | Darmor- | 105,672 | 56 | 42 | [ |
| Sesame | 5 | Zhongzhi13 | 26,472 | 58.21 | 41.79 | [ |
| Maize | 3 | B73 | 59,080 | 48.6 | 51.4 | [ |
| 96 | B73 | 4,400,000 | 74% | 26% | [ | |
| Rice | 66 | Nipponbare | 42,580 | 61.94 | 30.06 | [ |
| Arabidopsis | 18 | TAIR10 | 37,789 | 69.8 | 30.2 | [ |
| Soybean | 7 | GmaxW82 | NA | 48.6 | 51.4 | [ |
NA data not available
Summary of molecular marker based genetic diversity and population structure analysis in sesame
| Number of accessions | Source of collection | Marker type | Marker size | Detected alleles | Number of allele per locus | PIC | Genetic diversity | Sub populations identified | References |
|---|---|---|---|---|---|---|---|---|---|
| 96 | Asia and Africa (22 countries) | SSR | 33 | 137 | 4.15 | 0.45 | 0.508 | 5 | [ |
| 153 | Worldwide (22 countries) | SSR | 16 | 121 | 7.6 | 0.42 | 0.46 | 3 | [ |
| 404 | Chinese core collection | SRAP and SSR | 14 | 126 | 9 | 0.39 | 0.24 | 2 | [ |
| 453 | Chinese core collection | SRAD and SSR | 14 | 126 | 9 | 0.3467 | 0.2218 | 9 | [ |
| 49 | India | SSR | 20 | NA | 3 | 0.718 | NA | 2 | [ |
| 277 | 15 countries | SSR | 14 | 158 | 11.3 | 0.568 | NA | 4 | [ |
| 96 | China | SSR and InDels | 44 | 113 | 2.6 | 0.31 | 0.37 | 2 | [ |
| 545 | 390 from China, 155 outside China | SSR | 42 | 106 | NA | 0.41 | 0.645 | 3 | [ |
| 216 | Chinese core collection | SSR, SRAD and AFLP | 79 | 338 | 2 | 0.25 | 0.2090 | 2 | [ |
| 216 | Chinese core collection | SSR, SRAD and AFLP | 79 | 608 | 2 | 0.16 | 0.13 | 2 | [ |
| 130 | China | SSR and InDels | 88 | 325 | 3.69 | 0.36 | 0.432 | 2 | [ |
PIC Polymorphism Information Content
Summary of SNP marker based genetic diversity and population structure analysis in sesame
| Number of accessions | Sources of accessions | Number of effective SNPs | SNPs detection approach | Average marker density/SNP | Average nucleotide diversity of the panel | Genetic distance | Average gene diversity | Number of subgroups identified | References |
|---|---|---|---|---|---|---|---|---|---|
| 95 | Mediterranean sesame core collection (21 geographical regions) | 5,292 | ddRAD | 46SNP/kb | NA | 0.023 to 0.524 | 0.28 | [ | |
| 366 | HSRC-HAAS | 89,924 | SLAF-seq | 1SNP/2.6kb | 1.1×10-3 | 0.01 to 0.42 | 0.17 | 3 | [ |
| 705 | China gene bank | 254,781 | Whole-genome sequencing | 1SNP/50bp | 2.4×10-4 | 0.02 | 2 | [ | |
| 29 | China gene bank | 127,347 | Whole-genome sequencing | NA | 1.5 × 10-4 | NA | NA | NA | [ |
Fig. 1Flow chart showing key steps in GWAS implementation in sesame (prepared based on works at OCRI-CAAS)
Fig. 2Single-nucleotide polymorphism distributions on the 16 linkage groups (LGs) of the sesame genome assembly v1. The horizontal axis shows the LG length; the 0∼27841 legend insert shows the SNP density
Summary of RNA-seq data available for various investigated tissues in sesame
| Tissue sample | Condition/topic | Sample size | SRA accession numbers |
|---|---|---|---|
| Root | Salt | 30 | PRJNA524278 |
| Root | Osmotic stress | 12 | PRJNA552167 |
| Seed and capsule | Seed and carpel development | 22 | SRR6010084-SRR6010093-SRX396185-SRX396196 |
| Root | Drought | 30 | SRP095661 |
| Root, leaf, stem and shoot apical | Growth habit | KU240042 | |
| Flower buds | Fertile and sterile flower buds | 2 | SRP095661 |
| Root | Waterlogging | 6 | SRR2886790 |
| Leaf | Fusarium wilt disease | 8 | |
| Leaf, root, stem and flower | Multiple tissues | SRA122023 | |
| Seed | Seed developmental stage | 12 | SRP034617 |
| Seedling | Fusarium wilt disease | SRA047567.1 | |
| Seedling | Growth and development | 24 | SRA047563.1 |
| Seed | Oil content | 6 | JK045130-JK086377 |
| Root, leaf, flower, developing seed, and shoot tip | Multiple tissues | 5 | SRP006700 |
Fig. 3SiGeDiD: an online catalogue of functional genomic discoveries in sesame (http://sigedid.ucad.sn/)
Summary for GWAS results reported so far in sesame
| Sample size | Targeted traits | Group of traits | SNP size | SNP density | Significant associations identified | Number of candidate genes | Potential genes | References |
|---|---|---|---|---|---|---|---|---|
| 705 | 56 | Oil content, nutrient composition, yield components, morphological characteristics, growth cycle, coloration and disease resistance | 1,805,413 | 1SNP/50bp | 446 | 46 | - - - - - - - - - | [ |
| 705 | 39 | Yield index, seed traits, capsule number, capsule size, and capsule pericarp | 1,805,413 | 1SNP /50bp | 646 | 48 | - - | [ |
| 490 | 4 | Drought and salt tolerances | 1,005,413 | 2,7SNP / kb | 252 | 40 | - - - - - - - | [ |
| 400 | 5 | Drought (stem length, survival rate, wilting level, capsule number and seed yield) | 1,000,939 | 5SNP /kb | 569 | 102 | - - - | [ |
| 96 | 1 | Vitamin E | 5,962 | 2.3SNP /100 kb | 8 | 12 | - LG08_6621957 loci ~ γ-tocopherol - SLG03_13104062 loci ~β-tocotrienol | [ |
| 87 | 1 | Phytophthora disease-resistant | 8,883 | NA | 44 | 68 | - - - - | [ |
| 366 | 1 | Seed coat color | 42,781 | 1SNP/2.6kb | 224 | 92 | - - - - | [ |
| 327 | 7 | Root traits | 1,000,000 | 5SNP /kb | 409 | 32 | - | [ |
Fig. 4GWAS applications in sesame. a Circos plot summarizing genetic findings of important agronomic traits in sesame. (A) Pseudomolecules (LG), (B) gene density, (C) QTL position, (D) -log(p) of the peak SNPs, (E) pleiotropic QTLs; b Schematic diagram showing potential candidate genes discovered so far related to important agronomic traits in sesame. The image of the sesame plant has been specifically designed in this study
Fig. 5Application of new statistical multi-locus models in sesame. a and b Negative log10 P-values for association of root length (Y-axis) are plotted against SNP positions (X-axis) using the multi-locus models, mrMLM and FASTmrEMMA, respectively; c Negative log10 P-values for association of root length (Y-axis) are plotted against SNP positions (X-axis) using the single-locus model, EMMAX; d and e Negative log10 P-values for association of seed coat color (Y-axis) are plotted against SNP positions (X-axis) using the multi-locus models, mrMLM and FASTmrEMMA, respectively; f Negative log10 P-values for association of seed coat color (Y-axis) are plotted against SNP positions (X-axis) using the single-locus model, EMMAX. For both traits, a horizontal dash–dot line indicates the significant P-value threshold (10-6) and the significant SNPs are highlighted by red color, vertical line indicates overlapped most significant peaks at least in two models; g Venn diagrams showing the shared and uniquely detected significant SNPs by each model for root length GWAS respectively; h, Venn diagrams depicting the shared and uniquely detected significant SNPs by each model for seed coat color GWAS. The phenotypic and genotypic data for this analysis were obtained from 350 sesame accessions and 1,000,000 common SNPs for root length and data from 705 sesame accessions and 1,805,413 common SNPs for seed coat color GWAS study
Summary of significant SNPs associated with root length and seed coat color within the linkage groups (LG) identified by each model during GWAS in sesame
| Trait | LG | GWAS models | ||
|---|---|---|---|---|
| mrMLM FASTmrEMMA | mrMLM | Emmax | ||
| LG1 | 167 | 163 | 162 | |
| LG4 | 7 | 6 | 0 | |
| LG5 | 7 | 3 | 0 | |
| LG7 | 1 | 1 | 1 | |
| LG10 | 5 | 5 | 0 | |
| LG15 | 3 | 3 | 0 | |
| LG1 | 0 | 4 | 0 | |
| LG4 | 0 | 0 | 0 | |
| LG6 | 67 | 89 | 142 | |
| LG7 | 0 | 1 | 1 | |
| LG12 | 0 | 349 | 0 | |
| LG16 | 0 | 48 | 0 | |