| Literature DB >> 35422045 |
Jessen V Bredeson1, Jessica B Lyons1,2, Ibukun O Oniyinde3, Nneka R Okereke4, Olufisayo Kolade3, Ikenna Nnabue4, Christian O Nwadili4, Eva Hřibová5, Matthew Parker6, Jeremiah Nwogha4, Shengqiang Shu7, Joseph Carlson7, Robert Kariba8,9, Samuel Muthemba8,9, Katarzyna Knop6, Geoffrey J Barton6, Anna V Sherwood6,10, Antonio Lopez-Montes3,11, Robert Asiedu3, Ramni Jamnadass8,9, Alice Muchugi8,9, David Goodstein7, Chiedozie N Egesi3,4,12, Jonathan Featherston13, Asrat Asfaw3, Gordon G Simpson6,14, Jaroslav Doležel5, Prasad S Hendre8,9, Allen Van Deynze15, Pullikanti Lava Kumar3, Jude E Obidiegwu16, Ranjana Bhattacharjee17, Daniel S Rokhsar18,19,20,21,22.
Abstract
The nutrient-rich tubers of the greater yam, Dioscorea alata L., provide food and income security for millions of people around the world. Despite its global importance, however, greater yam remains an orphan crop. Here, we address this resource gap by presenting a highly contiguous chromosome-scale genome assembly of D. alata combined with a dense genetic map derived from African breeding populations. The genome sequence reveals an ancient allotetraploidization in the Dioscorea lineage, followed by extensive genome-wide reorganization. Using the genomic tools, we find quantitative trait loci for resistance to anthracnose, a damaging fungal pathogen of yam, and several tuber quality traits. Genomic analysis of breeding lines reveals both extensive inbreeding as well as regions of extensive heterozygosity that may represent interspecific introgression during domestication. These tools and insights will enable yam breeders to unlock the potential of this staple crop and take full advantage of its adaptability to varied environments.Entities:
Mesh:
Year: 2022 PMID: 35422045 PMCID: PMC9010478 DOI: 10.1038/s41467-022-29114-w
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Assembly and annotation statistics.
| Assembly statistic | Value |
|---|---|
| Scaffold sequence total/count | 480.0 Mb/25 |
| Scaffold N50 length/count | 24.0 Mb/9 |
| Scaffold N90 length/count | 19.5 Mb/18 |
| Contig sequence total/count | 479.5 Mb/532 |
| Contig N50 length/count | 4.5 Mb/31 |
| Contig N90 length/count | 565.0 kb/126 |
aThe longest transcript for each protein-coding gene.
bAll other splice isoforms.
Mapping populations used in this study.
| Pop. IDa | Inst. | Seed parent | Pollen parent | Putative parental relationb | Trait(s) studied |
|---|---|---|---|---|---|
| TDa1401 | IITA | TDa05/00015 | TDa99/00048 | Half avuncular | Anthracnose susceptibility (field, DLA) |
| TDa1402 | IITA | TDa05/00015 | TDa02/00012 | Fourth-degree relative | Anthracnose susceptibility (fieldc, DLA), tuber fresh weight, tuber dry weight, tuber flesh color, tuber oxidation, dry matter content |
| TDa1403 | IITA | TDa00/00005 | TDa02/00012 | Third-degree relative | Anthracnose susceptibility (field, DLA), tuber fresh weight, tuber dry weight, tuber flesh color, tuber oxidation, dry matter content |
| TDa1419 | IITA | TDa99/00240 | TDa02/00012 | Unrelated | Anthracnose susceptibility (field, DLAc), tuber fresh weight, tuber dry weight, dry matter contentc, tuber oxidationc, tuber flesh color |
| TDa1427 | IITA | TDa95/00328 | TDa02/00012 | Unrelated | Anthracnose susceptibility (field, DLA), tuber fresh weight, tuber dry weight, tuber flesh color, tuber oxidation, dry matter content |
| TDa1401B | NRCRI | TDa05/00015 | TDa99/00048 | Half avuncular | Anthracnose susceptibility (DLA), presence of corm, ability of corm to separate, corm type, tuber shape, tuber sizec, tuber surface texture, roots on tuber, placement of roots on tuber |
| TDa1506 TDa1621 | NRCRI NRCRI | TDa05/00015 | TDa02/00012 | Fourth-degree relative | (In TDa1506) Anthracnose susceptibility (DLA), presence of corm, ability of corm to separate, corm type, tuber shape, tuber size, tuber surface texture, roots on tuber, placement of roots on tuber |
| TDa1512 TDa1603 | NRCRI NRCRI | TDa00/00005 | TDa01/00039 | Parent– offspring | (In TDa1512) Anthracnose susceptibility (DLA), presence of corm, ability of corm to separate, corm type, tuber shape, tuber size, tuber surface texture, roots on tuber, placement of roots on tuber |
| TDa1610 | NRCRI | TDa99/00240 | TDa02/00012 | Unrelated | – |
Pop. ID mapping population identifier, Inst. institution that grew the plants and performed the phenotyping, Parental Relation parental relatedness as assessed in this study, DLA detached leaf assay.
aThe first two digits in a population ID denote the year of crossing. All crosses were performed at IITA and, where applicable, progeny were sent to NRCRI as botanical seeds. For mapping populations that share parents across institutes, subsets of the progeny were sent to NRCRI. For NRCRI crosses with the same parents but different population IDs (TDa1506/1621 and TDa1512/1603), the second population ID was assigned to those individuals from a cross performed with the same parents in a subsequent year. We treated these pairs as single populations for the purposes of linkage mapping, but individually for QTL analyses.
bPutative parental relations derived from Fig. 4.
cTraits for which significant QTL were identified (see Table 3).
Fig. 4Relationships between eight deeply sequenced D. alata breeding lines.
a Matrix of identity-by-descent (IBD) relatedness between all pairs of individuals (Supplementary Data 3). Blue represents the degree of diploid genome identity (IBD2); white, one haplotype (IBD1); and black, no shared haplotypes (IBD0). b Pedigree of relationships. Sequenced individuals are represented in black boxes with white text, while individuals not sequenced are in gray. Relationships known via IITA records (Supplementary Table 1) are drawn with solid lines. Relationships that could be confirmed using direct sequence comparison are highlighted with solid black lines, and those that could not be are colored grey. Inferred cryptic relationships are indicated with broken lines (first- and second-degree relations are represented as thick dashed and thin dotted lines, respectively). Unexpectedly, TDa95-310 is a parent of TDa00/00005 and a likely second-degree relative of TDa02/00012. c Regions of heterozygosity, autozygosity, and possible introgression. Within a background of intraspecific genetic variation (light cyan), large homozygous blocks (runs of homozygosity [ROH], dark cyan) appear common in the resequenced individuals, suggesting autozygosity from historical inbreeding. In addition, large blocks of exceptionally high heterozygosity (yellow) can also be observed, indicating possible introgressions (interspecific variation introduced via hybridization) in one or more of the unsampled pedigree founders. The recombination rate along each chromosome is shown in the track above. d Haplotype sharing between TDa95/00328 and all other resequenced individuals, and TDa95-310 and all others. Regions of the genome where an individual shares two haplotypes (i.e., they are IBD2) with TDa95/00328 (or TDa95-310) are highlighted in blue, one shares haplotype (IBD1) in gray, or shares no haplotypes (IBD0) in black. Source data are provided as a Source Data file.
Significant QTL identified in this study.
| Pop. ID | Trait | QTL peak position | Variant | Significance Windowa | |||
|---|---|---|---|---|---|---|---|
| TDa1402 | Anthracnose susceptibility (Field 2017) | Chr5: 22,308,637 | 53 | 1.69 × 10−4 | A/A,A/G,G/G | 0.4820 | 21,931,073 22,825,712 |
| TDa1402 | Anthracnose susceptibility (Field 2018) | Chr19: 8,369,514 | 49 | 1.25 × 10−2 | T/T,T/C | 0.2986 | 3,732,307 17,565,140 |
| TDa1419 | Anthracnose DLA 3-yr mean | Chr6: 61,001 | 243 | 1.28 × 10−2 | C/C,C/T | 0.0734 | 38,157 1,418,849 |
| TDa1419 | Dry matter | Chr18: 25,069,928 | 150 | 2.27 × 10−2 | C/C,C/T | 0.1020 | 24,779,355 25,415,124 |
| TDa1419 | Oxidation after 30 minb | Chr18: 26,496,992 | 151 | 5.86 × 10−3 | T/T,T/A,A/A | 0.1367 | 26,199,630 26,749,589 |
| TDa1419 | Oxidation after 180 minb | Chr18: 26,496,992 | 151 | 1.38 × 10−2 | T/T,T/A,A/A | 0.1188 | 26,199,630 26,749,589 |
| TDa1427 | Oxidation after 30 min | Chr18: 24,495,033 | 97 | 4.52 × 10−6 | A/A,A/G | 0.3127 | 24,034,264 24,938,398 |
| TDa1401B | Tuber size | Chr12: 310,852 | 53 | 4.19 × 10−2 | T/T,T/C,C/C | 0.2894 | 76,400 489,583 |
| TDa1512 | Tuber shape | Chr7: 3,115,608 | 43 | 3.17 × 10−2 | A/A,A/G | 0.3406 | 1,798,899 5,707,988 |
Pop. ID mapping population identifier, n the number of genotyped and phenotyped progeny used in QTL analysis, p-value empirical significance (α = 0.05) of the genotype-phenotype association at the peak locus, calculated by Wald statistic-based logistic regression and corrected for family-wise multiple testing by the max(T) method, Variant alleles segregating at QTL peak position, h2 narrow-sense heritability.
aCalculated as haplotypic linkage disequilibrium ≥0.9 relative to the peak QTL marker.
bSame QTL for both oxidation time points in TDa1419.
Fig. 1D. alata genome structure and recombination.
a HiC contact matrix of TDa95/00328 chromosomes 4 and 5. Within chromosomes, the band of high contact density along the diagonal reflects the well-ordered underlying assembly. The checkerboard pattern observed between 75 and 85 Mb indicates chromatin domain A/B compartmentalization[156] within chromosome 4. The winged pattern observed within chromosomes, particularly chromosome 5, showing elevated contact densities between chromosome ends is typical of Rabl-structured chromosomes in the nucleus[29]. Chromosomes are outlined with cyan boxes. Each pixel represents the intersection between a pair of 50 kb loci along the chromosomes. The density of contacts between two loci is proportional to pixel color, with darker pixels representing more contacts and lighter representing fewer. b A composite genetic linkage map (black points), integrating five mapping populations (colored points, legend), is shown for chromosome 5. The maps exhibit highly concordant marker orders (Kendall’s tau correlations between 0.9091 and 0.9626) and validate the large-scale correctness of the chromosome-scale assembly. The sigmoidal shape of the maps along the physical chromosome reflects suppressed recombination within the pericentromere. Individual component maps were scaled and shifted vertically to display their marker-order concordance. c The D. alata chromosome landscape is shown. Transposable elements (TEs; tan lines, left Y-axis) are enriched within the pericentromeres; while low-complexity repeat (LCR; brown, right Y-axis), protein-coding gene (dark blue line, left Y-axis), and meiotic recombination (cyan lines, right Y-axis) densities are elevated nearer the chromosome ends. Densities were computed using 500 kb bins. Composite map marker positions are shown as black ticks under the X-axis, with A/B chromatin compartment structure drawn below (A compartment domains in gold; B domains in dark cyan). cM centiMorgan, Mb megabase. Source data are provided as a Source Data file.
Fig. 2Dioscoreaceae chromosome evolution.
a Ribbon diagram demonstrating conserved chromosomal synteny and large-scale segmental collinearity (semi-transparent gray ribbons) between Dioscorea alata (black horizontal bars), D. rotundata (gold), and D. zingiberensis (cyan) one-to-one orthologous gene pairs. Only D. rotundata sequences with five or more collinear genes are shown. To improve visual clarity, some chromosomes, marked with asterisks, were reverse complemented with respect to their assembled sequences. Chromosome sizes are proportional to the number of annotated genes. b Dot plot showing evidence of two whole-genome duplications exposed by TDa95/00328 intragenomic comparison. Each point represents a mutual best-hit (MBH) gene pair and each white box (outlined in grey) represents the intersection of two chromosomes. Homoeology from the recent Dioscoreaceae delta duplication is shown in black and the ancient, core monocot tau duplication can be seen as clusters in blue (see also, Supplementary Fig. 9). c The synonymous substitution rate (K) histograms for orthologous (solid lines) or homoeologous (dotted lines) gene pairs between D. alata and select species comparators are shown: D. rotundata (n = 14,889), T. zeylanicus (n = 9013), D. alata delta (n = 1578), A. comosus (n = 6405), D. alata tau (n = 404), and S. polyrhiza (n = 4973). The D. alata–D. rotundata ortholog density was rescaled by 0.25 to emphasize other comparisons. d Shared segmental homoeology (semi-transparent gray) between D. alata chromosomes (black horizontal bars) resulting from the delta duplication is depicted with a ribbon diagram, as in panel a, but with putative centromere positions now included as gold circles (Supplementary Data 2). Source data are provided as a Source Data file.
Fig. 3Quantitative trait locus for anthracnose resistance.
a Exemplars of the yam anthracnose disease (YAD) field assessment severity rating scale (scored 1–5) used at IITA in Ibadan, Nigeria. b Genome-wide QTL association scan for YAD resistance in the TDa1402 genetic population (n = 53 biologically independent samples) for the year 2017. A statistically significant association (corrected p = 1.69 × 10−4) was found on chromosome 5, at 23.3 Mb. Per-locus Wald statistic-based logistic regression significance values (gray line) were corrected for multiple testing (black line) via max(T) adjustment with 1 × 106 permutations. The minimum significance threshold (α = 0.05) is represented with a cyan horizontal line. c Effect plot for the peak locus on chromosome 5 at 23.3 Mb, the genotypes (X-axis) of which explain 48.2% of the observed phenotypic variance (i.e., narrow-sense heritability, h2), suggests that an increased dose of the ‘A’ allele is associated with lower severity of YAD. Centerline and whisker plots, and their corresponding statistics (X-axis), represent the mean ± 95% confidence intervals. d Plot showing the strength of linkage disequilibrium (LD) between the peak marker (cyan diamond) and other loci (black points) in chromosome 5. LD was calculated as Pearson’s correlation (r) between alleles. Source data are provided as a Source Data file.