| Literature DB >> 27756914 |
Zi-Xin Zhang1,2,3,4, Shu-Niu Zhao1,3, Gao-Feng Liu4, Zu-Mei Huang1,3, Zhen-Mu Cao2, Shan-Han Cheng1,2,3, Shi-Sen Lin1,3.
Abstract
The Indian pepper 'Guijiangwang' (Capsicum frutescens L.), one of the world's hottest chili peppers, is rich in capsaicinoids. The accumulation of the alkaloid capsaicin and its analogs in the epidermal cells of the placenta contribute to the pungency of Capsicum fruits. To identify putative genes involved in capsaicin biosynthesis, RNA-Seq was used to analyze the pepper's expression profiles over five developmental stages. Five cDNA libraries were constructed from the total RNA of placental tissue and sequenced using an Illumina HiSeq 2000. More than 19 million clean reads were obtained from each library, and greater than 50% of the reads were assignable to reference genes. Digital gene expression (DGE) profile analysis using Solexa sequencing was performed at five fruit developmental stages and resulted in the identification of 135 genes of known function; their expression patterns were compared to the capsaicin accumulation pattern. Ten genes of known function were identified as most likely to be involved in regulating capsaicin synthesis. Additionally, 20 new candidate genes were identified related to capsaicin synthesis. We use a combination of RNA-Seq and DGE analyses to contribute to the understanding of the biosynthetic regulatory mechanism(s) of secondary metabolites in a nonmodel plant and to identify candidate enzyme-encoding genes.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27756914 PMCID: PMC5069471 DOI: 10.1038/srep34121
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1‘Guijiaowang’ fruits and capsaicinoid content of fruit Placenta at different development stage.
Summary of read mapping.
| Sample ID | Total Reads | Total Base Pairs | Total Mapped | Perfect Match | <=2 bp Mismatch | Unique Match | Multi-positionMatch | Total UnmappedReads |
|---|---|---|---|---|---|---|---|---|
| 10DAP-1 | 20,002,310 | 980,113,190 | 11,056,527 | 7,793,992 | 3,262,535 | 10,612,172 | 444,355 | 8,945,783 |
| (100.00%) | (100.00%) | (55.28%) | (38.97%) | (16.31%) | (53.05%) | (2.22%) | (44.72%) | |
| 10DAP-2 | 22,664,771 | 1,110,573,779 | 12,614,493 | 8,454,100 | 4,160,393 | 12,085,566 | 528,927 | 10,050,278 |
| (100.00%) | (100.00%) | (55.66%) | (37.30%) | (18.36%) | (53.32%) | (2.33%) | (44.34%) | |
| 20DAP-1 | 20,270,262 | 993,242,838 | 10,940,658 | 7,719,242 | 3,221,416 | 10,554,431 | 386,227 | 9,329,604 |
| (100.00%) | (100.00%) | (53.97%) | (38.08%) | (15.89%) | (52.07%) | (1.91%) | (46.03%) | |
| 20DAP-2 | 26,549,007 | 1,300,901,343 | 14,776,901 | 9,918,207 | 4,858,694 | 14,212,830 | 564,071 | 11,772,106 |
| (100.00%) | (100.00%) | (55.66%) | (37.36%) | (18.30%) | (53.53%) | (2.1 2%) | (44.34%) | |
| 30DAP-1 | 22,672,894 | 1,110,971,806 | 1,805,213 | 8,326,551 | 3,478,662 | 11,367,776 | 437,437 | 10,867,681 |
| (100.00%) | (100.00%) | (52.07%) | (36.72%) | (15.34%) | (50.1 4%) | (1.93%) | (47.93%) | |
| 30DAP-2 | 26,246,243 | 1,286,065,907 | 14,341,495 | 9,566,924 | 4,774,571 | 13,774,887 | 566,608 | 11,904,748 |
| (100.00%) | (100.00%) | (54.64%) | (36.45%) | (18.19%) | (52.48%) | (2.16%) | (45.36%) | |
| 40DAP-1 | 19,486,774 | 954,851,926 | 10,985,869 | 7,756,014 | 3,229,855 | 10,546,443 | 439,426 | 8,500,905 |
| (100.00%) | (100.00%) | (56.38%) | (39.80%) | (16.57%) | (54.1 2%) | (2.25%) | (43.62%) | |
| 40DAP-2 | 22,990,103 | 1,126,515,047 | 12,605,059 | 8,506,821 | 4,098,238 | 12,110,211 | 494,848 | 10,385,044 |
| (100.00%) | (100.00%) | (54.83%) | (37.00%) | (17.83%) | (52.68%) | (2.1 5%) | (45.17%) | |
| 50DAP-1 | 19,169,737 | 939,317,113 | 10,307,130 | 7,254,746 | 3,052,384 | 9,976,232 | 330,898 | 8,862,607 |
| (100.00%) | (100.00%) | (53.77%) | (37.84%) | (15.92%) | (52.04%) | (1.73%) | (46.23%) | |
| 50DAP-2 | 25,828,212 | 1,265,582,388 | 14,090,274 | 9,488,068 | 4,602,206 | 13,587,872 | 502,402 | 11,737,938 |
| (100.00%) | (100.00%) | (54.55%) | (36.74%) | (17.82%) | (52.61%) | (1.95%) | (45.45%) |
Figure 2Sequencing saturation analysis of ten sequenced samples.
Detected number of genes with at least one read per library.
| Sample ID | 10DAP-1 | 10DAP-2 | 20DAP-1 | 20DAP-2 | 30DAP-1 | 30DAP-2 | 40DAP-1 | 40DAP-2 | 50DAP-1 | 50DAP-2 | all |
|---|---|---|---|---|---|---|---|---|---|---|---|
| number of genes | 2369 | 23806 | 23067 | 23678 | 23359 | 23394 | 21939 | 21851 | 21812 | 22617 | 28434 |
| union number of genes | 25137 | 24954 | 24891 | 23564 | 23885 |
Figure 3Venn diagram showing the number of gene detected in five different development stages.
Each set represent a stage of development. Numbers in each intersection represent the number of genes detected with at least one read (gene tag) in these disjoint sets (intersections).
Figure 4Statistic chart of DEG among the different development stages.
The number of up-regulated and down-regulated genes between 10 DAP and 20 DAP; 20 DAP and 30DAP; 30 DAP and 40 DAP; 40DAP AND 50DAP are summarized.
Figure 5Distribution of genes coverage from ten sequenced samples.
Figure 6Veen diagram show the number of DGEs identified between consecutive time points.
Figure 7Gene Ontology enrichment analysis of DEGs.
Y-axis (right) represents the number of the genes in each category; Y-axis (left) represents the percentage of a specific category of the genes within that main category.
Figure 8Putative modles of capsaicin biosynthetic pathway.
Figure 9Hierarchical cluster and heat map analysis of 79 capsaicin synthesis genes.
Gray bar indicated that the gene expression is too low to detect.
Figure 10Molecular phylogenetic tree of the amino acid sequences of 21 acyltransferase genes.
20 new candidated genes related to capsaicin synthesis.
| GENE ID | Length | 10DAP_RPKM | 20DAP_RPKM | 30DAP_RPKM | 40DAP_RPKM | 50DAP-1_RPKM | Gene annotation |
|---|---|---|---|---|---|---|---|
| CA10g05810 | 153 | 0.270897565 | 4.632356898 | 32.10711615 | 79.94950869 | 3.258065086 | — |
| CA06g15270 | 1329 | 0.124747411 | 0 | 69.05206492 | 72.28247366 | 0.440541056 | deacetylvindoline O-acetyltransferase-like |
| CA06g14200 | 1251 | 3.50043523 | 4.214128214 | 58.73589544 | 94.95480662 | 2.130973175 | flavin monooxygenase-like protein |
| CA02g15930 | 1512 | 54.61459812 | 33.68706994 | 133.6180502 | 157.7851836 | 0 | probable allantoinase 1-like isoform 1 |
| CA00g86910 | 429 | 0.206619196 | 3.882277667 | 62.5625232 | 111.4971353 | 21.3271175 | primary amine oxidase |
| CA12g06180 | 1365 | 0 | 0 | 3.111355076 | 62.78991238 | 6.689178696 | UDP-glucose:glucosyltransferase |
| CA10g13930 | 1083 | 3.146542825 | 19.44624331 | 278.2872464 | 277.5108439 | 33.72695386 | oxidoreductase |
| CA07g19010 | 996 | 2.420185835 | 0 | 85.37430203 | 85.20375455 | 1.509002015 | heterodimeric geranylgeranyl pyrophosphatesynthase small subunit |
| CA04g13110 | 1200 | 6.322755235 | 22.24629044 | 22.38486546 | 133.9452759 | 0 | alcohol dehydrogenase 1-like |
| CA05g05650 | 573 | 1.549090431 | 0 | 0 | 388.6058543 | 54.8225296 | hypothetical protein PRUPE_ppa015802mg |
| CA04g17720 | 759 | 2.36921944 | 0 | 0 | 376.0728045 | 50.5589510 | conserved hypothetical protein |
| CA02g04610 | 678 | 2.441634465 | 1.235262568 | 28.20991203 | 46.09254969 | 0 | glutathione S-transferase parA |
| CA08g13530 | 2217 | 23.94262013 | 21.88566736 | 28.89121321 | 135.5366216 | 0 | ferric reduction oxidase 7 |
| CA01g18250 | 972 | 5.857889906 | 2.000742733 | 1.035359393 | 50.71081276 | 0 | bifunctional epoxide hydrolase 2-like |
| CA01g05600 | 1131 | 0.794976815 | 1.161030877 | 1.339644492 | 63.33405326 | 0 | inositol 2-dehydrogenase/D-chiro-inositol 3-dehydrogenase-like |
| CA03g22430 | 831 | 0.049876447 | 0.042403273 | 0.227908833 | 14.87811901 | 0.482963625 | trans-resveratrol di-O-methyltransferase-like |
| CA04g21310 | 1053 | 0.168356382 | 0.234244861 | 11.58326028 | 27.8063755 | 1.90267745 | uncharacterized PKHD-type hydroxylase |
| CA07g04930 | 336 | 0.387163579 | 3.863877137 | 31.75933895 | 104.5324016 | 4.871828722 | — |
| CA01g11490 | 690 | 1.901970416 | 1.543666686 | 41.12433866 | 65.65691205 | 0.562322006 | 26.5 kDa heat shock protein |
| CA02g02940 | 1251 | 2.456964293 | 0 | 76.42204095 | 45.73067326 | 1.748725536 | protein ASPARTIC PROTEASE IN GUARD CELL 2-like |
Nucleotide sequences of primers used in real-time quantitative polymerase chain reaction.
| Gene ID | Name | Forward primer(5′-3′) | Reverse primer(5′-3′) | Size (bp) |
|---|---|---|---|---|
| CA04g06670 | Actin | TGTTGGACTCTGGTGATGGTGTG | CCGTTCAGCCGTGGTGGTG | 164 |
| CA05g20790 | PAL | AACTGCTCAACAACAACATTACC | CATCCAACACTTCTCCGTTAGG | 150 |
| CA03g19580 | NADH-GOGAT | TCCAGTCAGGTTGAAGAAGAGAAG | CAGATTGTGTCGTGAGGAGAGG | 187 |
| CA06g25940 | C4H | ACGGTGAGCATTGGAGGAAGATG | CCATTCGTCGCAGATTCAGGATTC | 148 |
| CA03g30250 | HCT | ACTTGTATGGCACGAGGACTAGC | GCAACAGGAGTGGCAGTGAATATC | 134 |
| CA08g09680 | C3H | GGCAATGGCTGAGGTGATC | GTTGGCGTTGGCTCTGTG | 196 |
| CA02g14470 | CCoAOMT | ATCGGGAGAACTATGAGATTGGTC | TTGTCCTTGTCAGCGTCCAC | 166 |
| CA02g19270 | AT1 | AGTTCCTTCACCAAGATTCGTAGG | TCCTGAGTCTGCTGACACCTTAG | 169 |
| CA01g32960 | AT3 | CCATCAAACCACCATGTCCATACC | TCAGTCCGTAGGCATCCATCATAG | 168 |
| CA02g04610 | GST | ACTTTGGTGGTGATGACAATTTGG | GTGAGGATGAGGAAGGGAGTTTG | 182 |
Figure 11Real-time quantitative validation of digital genes expression data.