| Literature DB >> 32893861 |
John P Hamilton1, Grant T Godden2, Emily Lanier3, Wajid Waheed Bhat3, Taliesin J Kinser2,4, Brieanne Vaillancourt1, Haiyan Wang1, Joshua C Wood1, Jiming Jiang1,5,6, Pamela S Soltis2, Douglas E Soltis2,4, Bjoern Hamberger3,6, C Robin Buell1,6,7.
Abstract
BACKGROUND: Plants exhibit wide chemical diversity due to the production of specialized metabolites that function as pollinator attractants, defensive compounds, and signaling molecules. Lamiaceae (mints) are known for their chemodiversity and have been cultivated for use as culinary herbs, as well as sources of insect repellents, health-promoting compounds, and fragrance.Entities:
Keywords: beautyberry; callicarpenal; clerodane; gene cluster; insect repellent; kolavenyl diphosphate; specialized metabolites; terpene synthase
Year: 2020 PMID: 32893861 PMCID: PMC7476102 DOI: 10.1093/gigascience/giaa093
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:A, Callicarpa americana L. (beautyberry) plant with fruit. B, Somatic chromosome squash of a root tip cell of C. americana with 2n = 34. Bar = 10 μm.
Metrics of final Callicarpa americana L. genome assembly
| Feature | Metric |
|---|---|
| Canu-derived contigs | |
| N50 Contig size (bp) | 7,510,543 |
| NG50 Contig size (bp) | 6,369,058 |
| L50 Contig count | 25 |
| LG50 Contig count | 27 |
| Total assembly size (bp) | 506,106,333 |
| No. of contigs | 965 |
| Maximum contig length (bp) | 18,804,173 |
| Minimum contig length (bp) | 1,028 |
| Hi-C scaffolded assembly | |
| N50 Scaffold size (bp) | 29,054,287 |
| NG50 Scaffold size (bp) | 28,692,425 |
| L50 Scaffold count | 8 |
| LG50 Scaffold count | 9 |
| Total assembly size (bp) | 506,362,408 |
| No. of scaffolds | 328 |
| Maximum scaffold length (bp) | 39,429,362 |
| Minimum scaffold length (bp) | 1,028 |
| No. of pseudomolecules | 17 |
| Total pseudomolecule size (bp) | 493,744,786 |
| No. of unanchored scaffolds | 311 |
| Total unanchored scaffolds size (bp) | 12,617,622 |
| Pseudomolecules (bp) | |
| Chr01 | 39,429,362 |
| Chr02 | 32,953,817 |
| Chr03 | 32,428,638 |
| Chr04 | 32,381,817 |
| Chr05 | 31,681,419 |
| Chr06 | 31,029,626 |
| Chr07 | 29,370,463 |
| Chr08 | 29,054,287 |
| Chr09 | 28,692,425 |
| Chr10 | 28,677,202 |
| Chr11 | 28,224,296 |
| Chr12 | 27,270,263 |
| Chr13 | 27,197,714 |
| Chr14 | 27,108,606 |
| Chr15 | 23,772,120 |
| Chr16 | 22,946,943 |
| Chr17 | 21,525,788 |
Callicarpa americana L. gene annotation summary
| Working model set | High-confidence model set | |
|---|---|---|
| No. of gene models | 67,826 | 62,993 |
| No. of loci | 36,480 | 32,164 |
| Maximum transcript length (bp) | 16,862 | 15,978 |
| Maximum CDS length (bp) | 16,269 | 15,294 |
| Mean transcript length (bp) | 2,004.6 | 2,096.2 |
| Mean CDS length (bp) | 1,305.6 | 1,355.2 |
| Mean exon length (bp) | 323.5 | 323.8 |
| Mean intron length (bp) | 500.4 | 497.4 |
| Single exon transcripts | 18,140 | 14,496 |
CDS: coding sequence.
Figure 2:Comparative genome analyses with Callicarpa americana L. A. Upset plot showing orthologous groups between C. americana and 5 other angiosperms: Amborella trichopoda Baill [40] (Amborella), Oryza sativa L. (Rice, MSU v7), Arabidopsis thaliana (L.) Heynh (Araport 11 [42]) and 2 Lamiaceae species, Tectona grandis (teak, Tectonoideae [6]) and Salvia splendens Ker Gawl. (scarlet sage; Nepetoideae [4]). Only the 30 largest intersections are shown. B. Syntenic relationship between T. grandis (teak) and C. americana (beautyberry). The upper row shows the 17 C. americana pseudomolecules with syntenic alignments to the 19 T. grandis pseudomolecules.
Figure 3:Whole-genome duplication (WGD) events inferred from the Callicarpa americana L. (beautyberry) genome. Gaussian distributions produced by mixture models in the mixtools R package [49] are shown as overlays on the KS distribution, with red or blue color-coded peaks representing putative WGD events that were either corroborated or not corroborated (i.e., false-positive results), respectively, by results from SiZer analysis ([50]; lower plot). The SiZer plot shows significant increases (blue) or decreases (red), or no significant changes (pink) across the KS distribution at various (log-transformed) bandwidths to distinguish true data features from noise.
Figure 4:Phylogenetic analysis and classification of the Callicarpa americana terpene synthase family [6]. Shown are the distinct terpene synthase gene families TPS-a through TPS-g. Highlighted in boxes are TPSs clustered in proximity on the genomic pseudomolecules. C. americana TPSs are in boldface; red stars indicate functionally characterized members of the TPS-c subfamily; dots on branches indicate bootstrap support ≥80%. The phylogeny was rooted with the bifunctional Physcomitrella patens (moss) PpCPS/EKS. Annotation of C. americana and reference TPSs are given in Supplementary Tables S7 and S8.
Figure 5:Tissue-specific expression of the Callicarpa americana L. terpene synthase gene family. Expression is in transcripts per million. TPS subfamily classification of C. americana TPSs is given in Supplementary Table S8. Red stars indicate functionally characterized members of the TPS-c subfamily.
Figure 6:Activities of functionally characterized Callicarpa americana L. TPS-c. Dotted arrows indicate putative further functionalization by Class I diTPS and cytochromes P450 to diterpene products accumulating in C. americana.
Physically clustered specialized metabolite biosynthetic pathways in Callicarpa americana L. as identified by PlantiSMASH
| Type | No. |
|---|---|
| Alkaloid | 5 |
| Lignan | 2 |
| Lignan-saccharide | 1 |
| Polyketide | 3 |
| Saccharide | 15 |
| Saccharide-polyketide | 1 |
| Saccharide-terpene | 2 |
| Terpene | 9 |
| Terpene-polyketide | 1 |
| Uncharacterized | 8 |
| Total | 47 |