| Literature DB >> 28086860 |
Thomas Brockmöller1, Zhihao Ling1, Dapeng Li1, Emmanuel Gaquerel2, Ian T Baldwin1, Shuqing Xu3.
Abstract
BACKGROUND: Nicotiana attenuata (coyote tobacco) is an ecological model for studying plant-environment interactions and plant gene function under real-world conditions. During the last decade, large amounts of genomic, transcriptomic and metabolomic data have been generated with this plant which has provided new insights into how native plants interact with herbivores, pollinators and microbes. However, an integrative and open access platform that allows for the efficient mining of these -omics data remained unavailable until now. DESCRIPTION: We present the Nicotiana attenuata Data Hub (NaDH) as a centralized platform for integrating and visualizing genomic, phylogenomic, transcriptomic and metabolomic data in N. attenuata. The NaDH currently hosts collections of predicted protein coding sequences of 11 plant species, including two recently sequenced Nicotiana species, and their functional annotations, 222 microarray datasets from 10 different experiments, a transcriptomic atlas based on 20 RNA-seq expression profiles and a metabolomic atlas based on 895 metabolite spectra analyzed by mass spectrometry. We implemented several visualization tools, including a modified version of the Electronic Fluorescent Pictograph (eFP) browser, co-expression networks and the Interactive Tree Of Life (iTOL) for studying gene expression divergence among duplicated homologous. In addition, the NaDH allows researchers to query phylogenetic trees of 16,305 gene families and provides tools for analyzing their evolutionary history. Furthermore, we also implemented tools to identify co-expressed genes and metabolites, which can be used for predicting the functions of genes. Using the transcription factor NaMYB8 as an example, we illustrate that the tools and data in NaDH can facilitate identification of candidate genes involved in the biosynthesis of specialized metabolites.Entities:
Keywords: Co-expression analysis; Metabolomics; Nicotiana attenuata; Phylogenomics; Transcriptomics; genomics
Mesh:
Year: 2017 PMID: 28086860 PMCID: PMC5237228 DOI: 10.1186/s12864-016-3465-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Overview of data structure and utilities in the NaDH. The NaDH consists of 10 major utilities, which can be accessed by either gene ID or metabolite spectrum ID
Overview of included species for comparative genomics
| Species | Version | # of genes | URL | Reference |
|---|---|---|---|---|
|
| r2.0 | 33,449 |
| [ |
|
| r1.0 | 27,911 |
| [ |
|
| TAIR 10 | 27,416 |
| [ |
|
| v2.0 | 35,336 |
| [ |
|
| v1.0 | 21,503 |
| [ |
|
| v2.0 | 28,140 |
| [ |
|
| v3.0 | 41,335 |
| [ |
|
| ITAG2.3 | 34,727 |
| [ |
|
| v2.5.1 | 42,035 |
| [ |
|
| v3.4 | 35,119 |
| [ |
|
| Genoscope.12X | 26,346 |
| [ |
The number of detected duplication events in N. attenuata
| Duplication time | # of duplications |
|---|---|
|
| 3,929 |
| Shared among | 2,577 |
| Shared among Solanaceae | 6,760 |
| Shared with | 240 |
| Shared among core eudicots | 8,548 |
| Total | 22,054 |
A large number of duplication events is shared among all Solanaceae, which supports a shared whole-genome triplication event
Overview of RNA-Seq data
| Library ID/ | Tissue | Treatment/development stage | # of expressed genes | Additional note on sampling procedure |
|---|---|---|---|---|
| NA1498ROT | Root | Rosette stage plants, treated with 5 μL 1:1 diluted | 15,499 | Roots of rosette stage plants that were treated three times on leaves were collected for RNA extraction. The treatments were performed at 10 am and 6 pm on the day before sampling and 10 am on the day of sampling. Samples were collected at 11 am. |
| NA1500LET | Leaf | Rosette stage plants, treated with 5 μL 1:1 diluted | 12,179 | Local leaves of rosette stage plants that were treated three times on leaves were collected for RNA extraction. The treatments were performed at10 am, 6 pm on the day before sampling and 10 am on the day of sampling. Samples were collected at 11 am. |
| NA1717LEC | Leaf | Rosette stage plants, no treatment | 11,840 | Rosette stage leaves were collected for RNA extraction. Samples were collected at 11 am. |
| NA1504STT | Stem | Rosette stage plants, treated with 5 μL 1:1 diluted | 14,682 | Stems of rosette stage plants that were treated three times on leaves were collected for RNA extraction. The treatments were performed at 10 am, 6 pm on the day before sampling and 10 am on the day of sampling. Samples were collected at 11 am. |
| NA1505COE | Corolla | Early developmental stage, no treatment | 13,662 | Samples were collected in the afternoon, 60 samples were pooled. |
| NA1515COL | Corolla | Late developmental stage, no treatment | 13,486 | Samples were collected at 6 pm (open flowers) and 9 am (closed flower after opening), 4-10 samples were pooled. |
| NA1506STI | Stigma | Mature stigma, no treatment | 14,485 | Stigma samples were collected in the afternoon, 40 samples were pooled. |
| NA1507POL | Pollen tube | No treatment | 3,490 | Pollen tubes were pooled. |
| NA1508SNP | Style | Mature style without pollination | 13,492 | Styles were collected at 7 am, anthers were removed one day before, and 50 samples were pooled. |
| NA1509STO | Style | Mature style, pollinated with pollens from different genotype | 13,365 | Styles were collected at two hours after pollination, at 7 am. Anthers were removed one day before, and 30 samples were pooled. |
| NA1510STS | Style | Mature style, self-pollinated | 13,533 | Styles were collected at two hours after pollination, at 7 am. Anthers were removed one day before, and 30 samples were pooled. |
| NA1511NEC | Nectary | Mature nectary, no treatment | 12,928 | Samples were collected in the afternoon, 60 samples were pooled. |
| NA1512ANT | Anther | Mature anther no treatment | 11,550 | Samples were collected in the afternoon, 60 samples were pooled. |
| NA1513OVA | Ovary | Mature ovary, no treatment | 13,960 | Samples were collected in the afternoon, 60 samples were pooled. |
| NA1514PED | Pedicel | Mature pedicel, no treatment | 14,550 | Samples collected at 9 am (heading down) and 4 pm (heading up) were pooled. |
| NA1516OFL | Flower | Fully opened flowers, no treatment | 14,390 | Both morning (7 am) and evening (6 pm) flowers were collected, 1 sample of each were pooled. |
| NA1517FLB | Flower bud | Two early developmental stages of flowers, no treatment | 14,543 | Samples were collected at 6 pm, 1 bud and 1 middle stage flower were collected. Sepals were removed from the samples. |
| NA1501SES | Seed | Treated with liquid smoke | 9,227 | 100 mg seeds treated with 1:50 diluted liquid smoke solution for 9-15 min were used for RNA extraction. |
| NA1502SEW | Seed | Treated with water | 8,872 | 100 mg seeds treated with water for 9-15 min were used for total RNA extraction. |
| NA1503SED | Seed | Dry seeds | 8,681 | 100 mg dried seeds directly used for total RNA extraction. |
The raw reads information and methods used for generating these data are available under NCBI accession number PRJNA317743
Fig. 2Examples of visualization on co-expressed genes and metabolites. a Genes that are co-expressed with a transcription factor (TF) NaMYB8 (NIATv7_g41919) (red). Each node represents a co-expressed gene, radiating position of the node represents the most recent duplication events it has experienced and clock-wise position of the node represents the highest expression among four tissues and relative tissue specificity. b Metabolite spectra that are co-expressed with NaAT1 (NIATv7_g11614), a gene from phenolamides pathway. Color indicate different functional classes of metabolite spectra: dark green: phenolamides; pink: different 17-hydrogeranyllinalool diterpene glycosides; blue: quinate conjugates; grey: not annotated metabolite spectra. Node number shows the m/z value of the metabolite spectrum and letters refers to different isotopes. Edges indicate structure similarities among metabolite spectra: yellow: NDP based similarity, red: NL based similarity. NDP was calculated based on shared fragments and NL was calculated based on shared neutral losses
Fig. 3Example of visualizing gene expression and evolution. a Expression of NaDH29 (NIATv7_g06682) in leaves and roots of control or wounding or M. sexta oral secretion induced plants. b Presence and absence of N’,N”-dicaffeoylspermidine (DCS) [m/z: 470.2284287, retention time: 395.616992] different N. attenuata tissues. c Phylogenetic tree of 4CL gene family visualized using iTOL. Gene structure of each gene is shown on the right side. Colors refer to different species
Fig. 4Co-expressed genes and metabolite spectra of the phenolamides pathway. a Genes co-expressed with NaMYB8 (NIATv7_g41919). Each circle indicates one gene and the tree structure visualizes their phylogenetic history and the age of the duplication events: N: Nicotiana specific; S: Solanaceae specific; O: older duplication. The red color indicates that the gene is correlated with NaMYB8. The thickness of the black lines connecting NaMYB8 with other genes indicates the expression correlation coefficient. PAL: Phenylalanine ammonia-lyase; C4H: cinnamate-4-hydroxylase; C3H: coumarate-3-hydroxylase; 4CL: 4-coumarate coenzyme A ligase. b Co-expression of metabolites and genes from the phenolamides pathway. CP: N-caffeoylputrescine, CoP: N-coumaroylputrescine, MCoS: N-coumarylspermidine, MCS: N-caffeoylspermidine, CoCS: N-coumaryl, caffeoyl spermidine, DCS: N’,N”-dicaffeoylspermidine, DFS: N’,N”-di-feruloyl-spermidine, dashed line: unknown reaction, dashed box: metabolite not measurable with used mass spectrometry method. Red lines indicate correlation coefficient higher than 0.6