| Literature DB >> 34170619 |
Lilan Zhang1,2,3, Xiaokai Ma3, Xingtan Zhang3, Yi Xu1,2,3, Aminu Kurawa Ibrahim1,2,3, Jiayu Yao1,2,3, Huaxing Huang3, Shuai Chen3, Zhenyang Liao3, Qing Zhang3, Sylvain Niyitanga1,2, Jiaxin Yu3, Yi Liu3, Xiuming Xu3, Jingjing Wang3, Aifen Tao1,2, Jiantang Xu1,2, Siyuan Chen1,2, Xin Yang1,2, Qingyao He1,2, Lihui Lin1,2, Pingping Fang1,2, Liemei Zhang1, Ray Ming4, Jianmin Qi1,2, Liwu Zhang1,2,3.
Abstract
Cultivated jute, which comprises the two species Corchorus capsularis and C. olitorius, is the second most important natural fibre source after cotton. Here we describe chromosome-level assemblies of the genomes of both cultivated species. The C. capsularis and C. olitorius assemblies are each comprised of seven pseudo-chromosomes, with the C. capsularis assembly consisting of 336 Mb with 25,874 genes and the C. olitorius assembly containing 361 Mb with 28 479 genes. Although the two Corchorus genomes exhibit collinearity, the genome of C. olitorius contains 25 Mb of additional sequences than that of C. capsularis with 13 putative inversions, which might give a hint to the difference of phenotypic variants between the two cultivated jute species. Analysis of gene expression in isolated fibre tissues reveals candidate genes involved in fibre development. Our analysis of the population structures of 242 cultivars from C. capsularis and 57 cultivars from C. olitorius by whole-genome resequencing resulted in post-domestication bottlenecks occurred ~2000 years ago in these species. We identified hundreds of putative significant marker-trait associations (MTAs) controlling fibre fineness, cellulose content and lignin content of fibre by integrating data from genome-wide association studies (GWAS) with data from analyses of selective sweeps due to natural and artificial selection in these two jute species. Among them, we further validated that CcCOBRA1 and CcC4H1 regulate fibre quality in transgenic plants via improving the biosynthesis of the secondary cell wall. Our results yielded important new resources for functional genomics research and genetic improvement in jute and allied fibre crops.Entities:
Keywords: artificial selection; bast fibre; domestication; genome; jute (Corchorus)
Mesh:
Substances:
Year: 2021 PMID: 34170619 PMCID: PMC8541789 DOI: 10.1111/pbi.13652
Source DB: PubMed Journal: Plant Biotechnol J ISSN: 1467-7644 Impact factor: 9.803
Figure 1Genome assembly and phylogenetic analysis of Corchorus capsularis and C. olitorius. (a) Genome assembly layout. Comparison of C. capsularis var. ‘Huangma 179’ (HM179) and C. olitorius var. ‘Kuanyechangguo’ (KYCG) reference genomes contig level assembly with the previously published genomes contig level assembly. The top track shows the present genomes contig level assembly, while the bottom track shows the previously published genomes contig level assembly. (b) Basic genome information, including pseudo‐chromosomes (A), transposable elements density (B), gene density (C) and syntenic block (D), in C. capsularis and C. olitorius. (c) Genome‐wide distribution of the structural variations identified between C. capsularis and C. olitorius.
Global statistics comparison of Corchorus capsularis and C. olitorius assembly between the present genomes and previously published genomes
| Items | Previously published genomes | Present genomes | ||||
|---|---|---|---|---|---|---|
| Contig level assembly | Contig level assembly | Chromosomal level assembly | ||||
|
|
|
|
|
|
| |
| No. of sequences | 16,522 | 24,918 | 340 | 678 | 71 | 284 |
| Max length (Mb) | 0.34 | 0.36 | 17 | 15 | 57 | 60 |
| Total size (Mb) | 317 | 335 | 340 | 394 | 336 | 361 |
| N50 (Mb) | 0.046 | 0.045 | 3,2 | 1,5 | 46 | 50 |
| Average length (bp) | 19,197 | 13,440 | 1,001,680 | 580,535 | 4,797,158 | 1,386,532 |
Figure 2Overview of fibre development regulation in jute (Corchorus capsularis and C. olitorius). (a) Schematic map of fibre formation. Fibre formation related genes are listed in Table S11. (b) Comparison of copy numbers and expression of genes involved in fibre initiation and elongation among Cc, Co, Ath, Gra and Hca. On the left panel, x‐axis indicates number of genes and y‐axis presents name of target genes. Expression profile of genes involved in fibre initiation and elongation in stem barks between control (CK) and 3 days after GA3 treatment (GA3.3d) at the vigorous growth stage is shown on the right panel. Cc, C. capsularis; Co, C. olitorius; Ath, A. thaliana; Gra, G. raimondii; Hca, H. cannabinus. (c) Comparison of copy numbers (left panel) and expression (right panel) of genes involved in lignocellulosic biosynthesis enzymes among Cc, Co, Ath, Gra and Hca. The x‐ and y‐axes were same as aforementioned in b.
Figure 3Geographical distribution and population structure of Corchorus capsularis and C. olitorius populations. (a) The predicted distribution map of C. capsularis and C. olitorius inferred using their longitude and latitude positions. (b) Linkage disequilibrium (LD) decay revealed different patterns of LD decay for these two species, with values reaching half of the r values at 25 and 81 kb for C. capsularis and C. olitorius, respectively. (c) PCA shows clear separation among the C. capsularis and C. olitorius populations and C. aestuans outgroup. (d) Population admixture showing K = 2 using 299 accessions of both C. capsularis and C. olitorius. (e) Historical effective population size (Ne) for domesticated jute population beginning from 101 KYA (kilo‐years ago) to present. The plot showing jute population has undergone domesticated bottlenecks at ∼2000 years ago. The estimate is median (red line) from 200 bootstrap replicates with 87.5% and 97.5% confidence intervals (four grey lines). (f) Phylogentic relatonships among 242 accessions within C. capsularis. (g) Phylogentic relatonships among 57 accessions within C. olitorius.
Figure 4Genome‐wide genetic diversity and divergent selection between Corchorus capsularis and C. olitorius populations. (a) Population divergence (Fst) between C. capsularis and C. olitorius based on C. capsularis genome is shown in inner cycle on circos plot. SNP density, nucleotide diversity (π), Tajima’s D of C. capsularis populations are shown from outer to center on circos plot. (b) Population divergence (Fst) between C. capsularis and C. olitorius based on C. olitorius genome is shown in inner cycle on circos plot. SNP density, nucleotide diversity (π), Tajima’s D of C. olitorius populations are shown from outer to center on circos plot. (c) COG functioal classification of fixed sites under selection in C. capsularis population comparing to the C. olitorius population. (d) KEGG enrichments and of fixed sites under selection in C. capsularis population comparing to the C. olitorius population. (e) Manhattan plot of fixed sites under selection in C. capsularis population comparing to the C. olitorius population. (f) COG functioal classification of fixed sites under selection in C. olitorius population comparing to the C. capsularis population. (g) KEGG enrichments and fixed sites under selection in C. olitorius population comparing to the C. capsularis population. (h) Manhattan plot of fixed sites under selection in the C. olitorius population comparing to the C. capsularis population.
Figure 5Genome‐wide identification of genes in fibre quality in jute. (a) Manhattan plot showing significant jute locus associated with fibre fineness. Negative log10 value values from a genome‐wide scan are plotted against position on each of seven chromosomes. Dashed line represents the significance threshold. The COBRA family proteins are known to control the correct location of cellulose microfibrils in cell wall and directional elongation of cells in other plants. (b) The expression analysis of CcCOBRA1 in transgenic over‐expression lines of A. thaliana by qRT‐PCR. (c) The plant height of WT and transgenic over‐expression lines in A. thaliana. Error bars, SD (n = 3). **P < 0.01(Welch’s t‐test). (d) The cellulose content of wild types (WT) and transgenic A. thaliana plants. Error bars, SD (n = 3). **P < 0.01(Welch’s t‐test). (e) The transcription level of lignin marker genes, At4CL1 and AtCCOAOMT1, in transgenic lines and WT plants (data are the means ± SD of three biological replicates). (f) Manhattan plot showing significant jute locus associated with cellulose content of bast fibre. Negative log10 value values from a genome‐wide scan are plotted against position on each of seven chromosomes. Dashed line represents the significance threshold. (g) Boxplots for cellulose content based on the genotypes of CcC4H1 in different jute populations. Box edges represent the 0.25 quantile and 0.75 quantile with the median values shown by bold lines. Whiskers extend to data no more than 1.5 times the interquartile range, and remaining data are indicated by dots. Differences between the genotypes were analyzed by Welch’s t‐test. **P < 0.01. Cc, C. capsularis; Co, C. olitorius. (h) Gene structure and genetic variants in a candidate gene CcC4H1. Exons and introns are represented by boxes and lines, respectively. The position of the possibly causal micro‐structure variation is marked. (i) Genotyping of CcC4H1 gene in jute population by Kompetitive Allele Specific PCR (KASP). Blue and red points represent two homozygous genotype, green points represent parental complementary genotype, black squares represent non‐template control (NTC).
Figure 6Selective swept genes related to fibre quality traits in C. capsularis and C. olitorius populations. (a) Genome‐wide selective sweeps in cultivated C. capsularis showing 5% cutoff outlier of absolute CLR statistics across reference genomes, respectively. (b) Genome‐wide selective sweeps in cultivated C. olitorius showing 5% cutoff outlier of absolute CLR statistics across reference genomes, respectively. The red solid line indicates the candidate regions above the 5% cutoff outlier with significant deviations from neutrality. The highlighted signals showing overlapped genes identified in controlling the fibre development using gene family analysis and GWAS mapping.