| Literature DB >> 34736486 |
Shouchuang Wang1,2,3, Yong Xiao1,4, Zhi-Wei Zhou5,6, Jiaqing Yuan7, Hao Guo2, Zhuang Yang2,3, Jun Yang2, Pengchuan Sun8, Lisong Sun2,3, Yuan Deng2,3, Wen-Zhao Xie5, Jia-Ming Song6, Muhammad Tahir Ul Qamar6, Wei Xia2, Rui Liu1, Shufang Gong1, Yong Wang1, Fuyou Wang1, Xianqing Liu2, Alisdair R Fernie9, Xiyin Wang10, Haikuo Fan11, Ling-Ling Chen12, Jie Luo13,14.
Abstract
BACKGROUND: Coconut is an important tropical oil and fruit crop whose evolutionary position renders it a fantastic species for the investigation of the evolution of monocot chromosomes and the subsequent differentiation of ancient plants.Entities:
Mesh:
Year: 2021 PMID: 34736486 PMCID: PMC8567702 DOI: 10.1186/s13059-021-02522-9
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Summary of genome assembly and annotation
| Number of contigs | 2433 | 401 |
| Total size of contigs (bp) | 2,393,894,742 | 2,399,649,694 |
| Longest contig | 34,251,876 | 56,639,188 |
| Contig N50 count | 196 | 50 |
| Contig N50 length (bp) | 2,927,039 | 14,296,645 |
| Contig N90 count | 1044 | 177 |
| Contig N90 length (bp) | 442,494 | 3,844,418 |
| GC content | 37.49 | 37.57 |
| Number of protein-coding genes | 29,897 | 28,111 |
| Mean transcript length (bp) | 1698 | 1709 |
| Mean exon length (bp) | 272 | 272 |
| Mean exon per mRNA | 6.23 | 6.27 |
| Total repetitive sequences size (% of genome) | 2,000,576,416 (83.55%) | 2,015,579,366 (84.00%) |
Fig. 1Cocos nucifera genome assembly, and genomic features. a The circos diagram of Cn. tall and Cn. dwarf. The circles from outer to inner separately represented GC (guanine-cytosine) content (I); gene density (II); transposable element (TE) density (III); SNPs density (IV); InDels density (V); Colinear links (VI). b Genome collinear analysis between C. nucifera and N. colorata genomes (window size of 1 Mb). c Density distributions of the Ks values for homologous genes. d Phylogenetic tree of Cn. tall, Cn. dwarf, and other 15 angiosperm species including their divergence time based on orthologues of the single-gene family. The circles represent the diploid events and the squares represent the triploid events. e Clusters of orthologous and paralogous gene families in C. nucifera, and three palm species
Fig. 2Inference of monocot proto-chromosomes and reconstruction of evolutionary trajectories of the extant coconut (Cn) chromosomes. a Identification of proto-chromosomes through evaluating shared homology between extant chromosomes. The chromosomes of the outgroup monocot, S. polyrhiza (Sp), were compared to those in the other monocots. Inferred proto-chromosomes, A1–A10, were each shown in a specific color. Dotplots show shared gene collinearity, and orthologous correspondence were shown by filled and transparent rectangles. Greek letters were used to show which event produced the chromosomes with arrows pointed to. Cn: C. nucifera; Ac: A. comosus. b Plant phylogeny and reconstructed monocot proto-karyotypes. Proto-chromosomes at specific evolutionary nodes were inferred, such as nine proto-chromosomes before the WGD omega. Chromosomes are denoted with color scheme showing 10 monocot proto-chromosomes. WGD and WGT are shown, and some of which are named with Greek letters. c Reconstruction of evolutionary trajectories from monocot proto-chromosomes to form the extant coconut (Cn) chromosomes. Proto-chromosomes on different evolutionary nodes (A–E) were reconstructed and correspondingly named. Greek letters were used to relate to each polyploidization event. “EEJ” indicates end-to-end joining of chromosomes, “Reorg.” indicates reorganization, and “Ins.” indicates insertion. The sign of “Chr. B” represents mini-chromosome (B chromosome) produced by the “EEJ” process
Fig. 3Comparative genomic analysis between Cn. tall and Cn. dwarf. a Genome collinearity between Cn. tall and Cn. dwarf. The size less than 100 kb of colinear syntenic blocks are filtered out. b PAV distribution across the genomes of Cn. tall and Cn. dwarf. c The upper graph shows the percentage of PAVs with TEs in Cn. tall and Cn. dwarf. the other graph shows the distribution of PAVs in different gene parts. d Sequence comparison between Cn. tall genome and Cn. dwarf genome at chromosome 04 and 12, the figure below shows the expression of genes special presence in Cn. tall in different tissues of coconut. The values on the y-axis correspond to the expression levels of cellulose synthase A (GZ04G0103690) and gibberellin-regulated protein (GZ12G0246340). Cellulose synthase A, gibberellin-regulated protein, and other genes are indicated with blue, red, and grey, respectively. e Gene copy number and domain architecture of genes, which related to differences in Cn. tall (Cn. T), Cn. dwarf (Cn. D), E. guineensis (Eg), M. acuminate (Ma), and O. sativa (Os). The red numbers indicate differences in the number of genes between coconuts and other species
Fig. 4Molecular mechanism of fiber content difference between Cn. tall and Cn. dwarf. a Fruits of Cn. tall and Cn. dwarf with a growth period of 8 months and 12 months. b The average thickness and weight of coconut fiber in the growth period of 8 months and 12 months. c Synthetic pathway of coconut lignin. The red numbers indicate the same number of these genes in Cn. tall and Cn. dwarf, while the purple numbers indicate different numbers. d, e The expression patterns of structural genes (d) and regulatory genes (e) related to lignin metabolism and synthesis in different tissues of Cn. tall and Cn. dwarf. f The phylogenetic tree analysis of the key gene family CESA for coconut cellulose synthesis. g The expression pattern of CESA gene related to cellulose synthesis in different tissues of coconut. h Epidermal cells of leaves of Cn. tall and Cn. dwarf stained by phloroglucinol/HCl
Fig. 5Multi-omics integrated analysis reveals the molecular basis of coconut plant height differences. a The leaf internode phenotype of Cn. tall and Cn. dwarf. b Genome-wide association analysis of the leaf internode phenotype of coconut. The upper panel shows the Manhattan plot of SNP-GWAS for leaf internode, and the lower panel shows copy number variation between Cn. tall and Cn. dwarf in chr12 chromosome, and the right panel is haplotype analysis of leaf spacing. P values were determined using two-tailed Student’s t test. c Images of Cn. tall and Cn. dwarf at seedling stage. d Comparison of height and GA20ox expression level by qRT-PCR between Cn. tall and Cn. dwarf. e Depiction of gibberellins metabolic pathway and gene expression bias between Cn. tall and Cn. dwarf. f Difference of gibberellin content in Cn. tall and Cn. dwarf. at seedling stage. g Optical microscope photographs of cells of Cn. tall and Cn. dwarf. h Differences in leaf cell length of Cn. tall and Cn. dwarf at seedling stage