| Literature DB >> 27840630 |
Xiang Shi1, Haijing Sun1, Yitai Chen2, Hongwei Pan1, Shufeng Wang1.
Abstract
Salix integra is a shrub willow native to northeastern China, Japan, Korea, and Primorsky Krai in the far southeast of Russia, and has been identified as cadmium (Cd)-accumulating trees in recent years. Although many physiological studies have been conducted with these plants, little is known about the molecular basis underlying Cd response in this plant, and this is confirmed by the very few number of gene sequences (only 39 nucleotide sequences) available in public databases. Advances in genomics for Salix are promising for future improvement in identification of new candidate genes involved in metal tolerance and accumulation. Thus, high-throughput transcriptome sequencing is essential for generating enormous transcript sequences from S. integra, especially for the purpose of Cd toxicity-responsive genes discovery. Using Illumina paired-end sequencing, approximately 60.05 million high-quality reads were obtained. De novo assembly yielded 80,105 unigenes with an average length of 703 bp, A total of 50,221 (63%) unigenes were further functionally annotated by comparing their sequences to different proteins and functional domain databases. GO annotation reveals 1849 Cd responsive genes involving in Cd binding, transport, and detoxification and cellular Cd homeostasis, and these genes were highly enriched in plant response to Cd ion and Cd ion transport. By searching against the PlantCyc database, 509 unigenes were assigned to 14 PlantCyc pathways related to Cd transport and cellular detoxification, and many of them are genes encoding heavy metal ATPases (HMAs), nature resistance-associated with microphage proteins (NRAMPs), ATP-binding cassette (ABC) transporters, etc., Comprehensive RT-qPCR analysis of these selected genes in different tissues of S. integra under the control and Cd treatment revealed metallothionein-like protein (MT2A and MT2B), Metal tolerance protein (MTP1), ABCB25, NRAMP5, and ZIP1 may be involved in the Cd transport and detoxification in leaves, while NRAMP2, ZIP8, and NRAMP5 may be related to Cd transport in roots. Our study will enrich the sequence information of S. integra in public database, and would provide some new understanding of the molecular mechanisms of heavy metal tolerance and detoxification in willows.Entities:
Keywords: Cd stress; Cd transportation; Salix integra; de novo assembly; transcription factor; willow
Year: 2016 PMID: 27840630 PMCID: PMC5083712 DOI: 10.3389/fpls.2016.01577
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Summary of Illumina transcriptome sequencing for .
| 60,047,711 | 12,128,689,157 | 45.44 | 0.17 | 90.11 |
Length distribution of assembled transcripts and unigenes.
| 0–100 | 0 | 0 |
| 100–200 | 0 | 0 |
| 200–300 | 31,775 | 27,828 |
| 300–400 | 18,250 | 14,974 |
| 400–500 | 10,899 | 7976 |
| 500–600 | 7646 | 4897 |
| 600–700 | 6207 | 3421 |
| 700–800 | 5180 | 2477 |
| 800–900 | 4841 | 2012 |
| 900–1000 | 4158 | 1513 |
| 1000–1100 | 4047 | 1377 |
| 1100–1200 | 3816 | 1185 |
| 1200–1300 | 3688 | 986 |
| 1300–1400 | 3521 | 956 |
| 1400–1500 | 3372 | 878 |
| 1500–1600 | 3239 | 804 |
| 1600–1700 | 3109 | 779 |
| 1700–1800 | 2890 | 774 |
| 1800–1900 | 2702 | 685 |
| 1900–2000 | 2620 | 658 |
| 2000–2100 | 2326 | 550 |
| 2100–2200 | 2216 | 563 |
| 2200–2300 | 2017 | 477 |
| 2300–2400 | 1883 | 444 |
| 2400–2500 | 1736 | 389 |
| 2500–2600 | 1502 | 301 |
| 2600–2700 | 1423 | 309 |
| 2700–2800 | 1282 | 277 |
| 2800–2900 | 1146 | 252 |
| 2900–3000 | 1106 | 204 |
| >3000 | 10,764 | 2159 |
| Total number | 14,9361 | 80,105 |
| Total length | 171,835,806 | 56,285,368 |
| N50 length | 1971 | 1191 |
| Mean length | 1,150.473055 | 702.6449 |
Figure 1The dependence of unigene lengths on the number of reads assembled into each unigene.
Functional annotation of the unigenes of .
| COG_annotation | 12,221 | 15.3 |
| GO_annotation | 32,570 | 40.7 |
| KEGG_annotation | 10,552 | 13.2 |
| Swissprot_annotation | 29,793 | 37.2 |
| TrEMBL_annotation | 36,923 | 46.1 |
| Nr_annotation | 37,283 | 46.5 |
| Nt_annotation | 41,823 | 52.2 |
| All_annotated | 50,221 | 62.7 |
Figure 2Species distribution in BLAST hits in Nr dababase. 36,041 BLASTX-hit unigenes were calculated.
Figure 3Clusters of orthologous groups (COG) classification. In total, 12,221 of the 80,105 unigenes with Nr hits were grouped into 25 classifications.
Figure 4Functional annotation of assembled sequences based on gene ontology (GO) categorization. The unigenes are summarized into three main categories: cellular component, molecular function and biological process.
GO annotation of unigenes related to Cd responses.
| cadmium ion transport | GO:0015691 | 82 |
| cadmium ion transmembrane transport | GO:0070574 | 17 |
| cellular cadmium ion homeostasis | GO:0006876 | 3 |
| cellular response to cadmium ion | GO:0071276 | 8 |
| detoxification of cadmium ion | GO:0071585 | 4 |
| response to cadmium ion | GO:0046686 | 1750 |
| cadmium ion binding | GO:0046870 | 27 |
| cadmium ion transmembrane transporter activity | GO:0015086 | 55 |
| cadmium-transporting ATPase activity | GO:0015434 | 1 |
| Total | 1849 |
Figure 5Cadmium transportation and detoxification pathways represented in the PlantCyc annotation of the unigenes. In total 13,973 unigenes of the 80,105 unigenes were annotated to 607 pathways.
Frequency of SSRs in .
| 1 | – | – | – | – | – | 1040 | 1345 | 0 | 2423 | 42.19 |
| 2 | – | 570 | 386 | 218 | 187 | 157 | 96 | 0 | 1614 | 28.10 |
| 3 | 911 | 430 | 254 | 24 | 0 | 0 | 0 | 0 | 1619 | 28.19 |
| 4 | 62 | 15 | 0 | 0 | 0 | 0 | 0 | 0 | 77 | 1.34 |
| 5 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 | 0.17 |
| Total | 983 | 1015 | 640 | 242 | 187 | 1197 | 1441 | 38 | 5743 | |
| % | 17.12 | 17.67 | 11.14 | 4.21 | 3.26 | 20.84 | 25.09 | 0.66 | ||
Figure 6RT-qPCR analysis of 15 cadmium transportation and detoxification-related candidate unigenes in . The gene names, sequences and the primers used for RT-qPCR analysis are shown in File S1. Standard error of the mean for three repetitions (five biological replicates included) is represented by the error bars.