| Literature DB >> 24999345 |
Ya-Fen Lin1, Edouard I Severing2, Bas Te Lintel Hekkert3, Elio Schijlen3, Mark G M Aarts1.
Abstract
Noccaea caerulescens is an extremophile plant species belonging to the Brassicaceae family. It has adapted to grow on soils containing high, normally toxic, concentrations of metals such as nickel, zinc, and cadmium. Next to being extremely tolerant to these metals, it is one of the few species known to hyperaccumulate these metals to extremely high concentrations in their aboveground biomass. In order to provide additional molecular resources for this model metal hyperaccumulator species to study and understand the mechanism of adaptation to heavy metal exposure, we aimed to provide a comprehensive database of transcript sequences for N. caerulescens. In this study, 23,830 transcript sequences (isotigs) with an average length of 1025 bp were determined for roots, shoots and inflorescences of N. caerulescens accession "Ganges" by Roche GS-FLEX 454 pyrosequencing. These isotigs were grouped into 20,378 isogroups, representing potential genes. This is a large expansion of the existing N. caerulescens transcriptome set consisting of 3705 unigenes. When translated and compared to a Brassicaceae proteome set, 22,232 (93.2%) of the N. caerulescens isotigs (corresponding to 19,191 isogroups) had a significant match and could be annotated accordingly. Of the remaining sequences, 98 isotigs resembled non-plant sequences and 1386 had no significant similarity to any sequence in the GenBank database. Among the annotated set there were many isotigs with similarity to metal homeostasis genes or genes for glucosinolate biosynthesis. Only for transcripts similar to Metallothionein3 (MT3), clear evidence for an additional copy was found. This comprehensive set of transcripts is expected to further contribute to the discovery of mechanisms used by N. caerulescens to adapt to heavy metal exposure.Entities:
Keywords: Brassicaceae; cadmium; gene expression; metal hyperaccumulation; metal hypertolerance; phytoremediation; zinc
Year: 2014 PMID: 24999345 PMCID: PMC4064536 DOI: 10.3389/fpls.2014.00261
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Summary of the .
| Transcript assembly | Number |
|---|---|
| Total reads | 834,911 |
| Total bases | 3*108 |
| Percentage aligned reads | 86.24 |
| No. assembled reads | 638,451 |
| No. isotigs | 23,836 |
| No. bases | 24,420,501 |
| Average isotig size | 1025 |
| N50 isotig size | 1185 |
| Largest isotig size | 6573 |
| No. isogroups | 20,378 |
| Average no. isotigs/isogroup | 1.2 |
| Largest isotig count | 77 |
| No. isogroups with one isotig | 18,052 |
| Q40 Plus bases | 97.3 % |
The Newbler software was used to assemble raw sequence reads into isotigs (representing putative transcripts) and isogroups (representing putative genes). N50 is the length for which the collection of all isotigs of that length or longer contains at least half of the total of the lengths of the isotigs. Quality value 40 (Q40) means the error rate is below 1 in 10,000 nucleotides.
Figure 1Classification of The number of best-hits of N. caerulescens isotigs to the Brassicaceae proteome (comprising those of Thellungiella (Eutrema) halophila, Arabidopsis thaliana, Arabidopsis lyrata, Brassica rapa, and Capsella rubella) are listed. The 1604 isotigs without any significant hit to Brassicaceae proteome were further compared to the NCBI NR protein database, identifying best-hit similarities to 33 proteins of plant origin and 98 of non-plant origin. The non-hit isotigs were further matched to the available genome sequences of the listed Brassicaceae reference species, identifying another 74 best-hit similarities. Finally, a set of 1399 isotigs remained for which no protein similarity could be found. (B) Venn diagram showing the number of genes in each of the five reference species that correspond to the best-hit in that species for each N. caerulescens isotig sequence.
.
| isogroup05302 | 1413 | Carubv10005538m|PACid:20894480 | 1.49E-152 | AT4G35040 | Basic-leucine zipper (bZIP) transcription factor | |
| isogroup16405 | 559 | AT4G19690.2|PACid:19648297 | 1.31E-110 | AT4G19690 | Iron-regulated transporter 1 | |
| isogroup00440 | 1428 | Thhalv10023475m|PACid:20201432 | 0 | AT1G60960 | Iron regulated transporter 3 | |
| isogroup00441 | 1341 | Carubv10015186m|PACid:20899838 | 2.19E-110 | AT3G12750 | Zinc transporter 1 precursor | |
| isogroup07896 | 1085 | Thhalv10013963m|PACid:20204020 | 0 | AT5G59520 | ZRT/IRT-like protein 2 | |
| isogroup06106 | 1285 | AT1G10970.1|PACid:19650740 | 0 | AT1G10970 | Zinc transporter 4 precursor | |
| isogroup06071 | 1293 | Thhalv10016885m|PACid:20179820 | 5.49E-180 | AT2G30080 | ZIP metal ion transporter family | |
| isogroup09981 | 918 | Thhalv10025584m|PACid:20196090 | 1.39E-169 | AT4G33020 | ZIP metal ion transporter family | |
| isogroup06264 | 1263 | Thhalv10007930m|PACid:20188086 | 1.39E-165 | AT1G31260 | Zinc transporter 10 precursor | |
| isogroup09047 | 988 | AT1G55910.1|PACid:19649515 | 0 | AT1G55910 | Zinc transporter 11 precursor | |
| isogroup00207 | 2104 | AT2G46800.2|PACid:19641115 | 2.92E-85 | AT2G46800 | Zinc transporter of Arabidopsis thaliana | |
| isogroup01278 | 1466 | Thhalv10020889m|PACid:20182053 | 0 | AT3G12100 | Cation efflux family protein | |
| isogroup10156 | 907 | Thhalv10001431m|PACid:20189077 | 9.05E-167 | AT2G47830 | Cation efflux family protein | |
| isogroup03594 | 1841 | Thhalv10011469m|PACid:20184771 | 0 | AT1G51610 | Cation efflux family protein | |
| isogroup05294 | 1416 | Thhalv10006008m|PACid:20190151 | 0 | AT3G58060 | Cation efflux family protein | |
| isogroup07586 | 1117 | 497437|PACid:16043531 | 0 | AT2G39450 | Cation efflux family protein | |
| isogroup17157 | 526 | Thhalv10027647m|PACid:20189919 | 6.90E-11 | AT2G04620 | Cation efflux family protein | |
| isogroup00017 | 2604 | Thhalv10018389m|PACid:20191829 | 0 | AT1G80830 | Natural resistance-associated macrophage protein 1 | |
| isogroup18133 | 488 | Thhalv10011381m|PACid:20184824 | 6.26E-43 | AT1G47240 | NRAMP metal ion transporter 2 | |
| isogroup03577 | 1845 | Bra000573|PACid:22711711 | 0 | AT2G23150 | Natural resistance-associated macrophage protein 3 | |
| isogroup06857 | 1197 | Thhalv10004033m|PACid:20199053 | 0 | AT5G67330 | Natural resistance associated macrophage protein 4 | |
| isogroup13240 | 720 | Thhalv10026903m|PACid:20193643 | 1.53E-159 | AT4G18790 | NRAMP metal ion transporter family protein | |
| isogroup05270 | 1417 | AT5G03280.1|PACid:19671290 | 0 | AT5G03280 | NRAMP metal ion transporter family protein | |
| isogroup05627 | 1367 | Thhalv10012456m|PACid:20203745 | 0 | AT5G03280 | NRAMP metal ion transporter family protein | |
| isogroup11803 | 801 | Thhalv10012456m|PACid:20203745 | 7.09E-37 | AT5G03280 | NRAMP metal ion transporter family protein | |
| isogroup17702 | 504 | Carubv10004142m|PACid:20894021 | 1.03E-48 | AT4G37270 | Heavy metal atpase 1 | |
| isogroup03139 | 2048 | 490893|PACid:16045505 | 0 | AT4G37270 | Heavy metal atpase 1 | |
| isogroup08856 | 1001 | Thhalv10024341m|PACid:20194930 | 3.36E-139 | AT4G30110 | Heavy metal atpase 2 | |
| isogroup10109 | 910 | Bra024107|PACid:22710111 | 4.02E-103 | Bra024107 | Heavy metal atpase 2 | |
| isogroup02580 | 2745 | 857597|PACid:16066225 | 0 | Bra011165 | Heavy metal atpase 2 | |
| isogroup00021 | 4417 | Carubv10012822m|PACid:20897962 | 0 | AT2G19110 | Heavy metal atpase 4 | |
| isogroup07162 | 1157 | AT4G33520.1|PACid:19645606 | 1.41E-96 | AT4G33520 | P-type ATP-ase 1 | |
| isogroup11167 | 839 | Thhalv10000758m|PACid:20202842 | 3.25E-166 | AT5G44790 | Copper-exporting ATPase (RAN1) | |
| isogroup02963 | 2165 | Carubv10000184m|PACid:20908511 | 0 | AT5G21930 | P-type ATPase of Arabidopsis 2 | |
| isogroup02761 | 2382 | Bra013764|PACid:22706007 | 0 | Bra013764 | YELLOW STRIPE like 1 | |
| isogroup01006 | 2083 | Thhalv10012886m|PACid:20203959 | 0 | AT5G53550 | YELLOW STRIPE like 3 | |
| isogroup03348 | 1934 | Thhalv10020160m|PACid:20182600 | 0 | AT3G17650 | YELLOW STRIPE like 5 | |
| isogroup00783 | 2244 | Thhalv10003758m|PACid:20198225 | 0 | AT3G27020 | YELLOW STRIPE like 6 | |
| isogroup17698 | 497 | Thhalv10018216m|PACid:20192307 | 1.52E-84 | AT1G65730 | YELLOW STRIPE like 7 | |
| isogroup12528 | 757 | Thhalv10018216m|PACid:20192307 | 1.23E-138 | AT1G65730 | YELLOW STRIPE like 7 | |
| isogroup05885 | 1322 | Thhalv10013388m|PACid:20203275 | 8.58E-175 | AT5G13740 | Zinc induced facilitator 1 | |
| isogroup13593 | 697 | Bra008805|PACid:22694599 | 3.84E-111 | AT5G13750 | Zinc induced facilitator-like 1 | |
| isogroup14641 | 646 | Thhalv10013417m|PACid:20206568 | 1.30E-136 | AT5G13750 | Zinc induced facilitator-like 1 | |
| isogroup02224 | 478 | Bra021053|PACid:22721993 | 1.81E-60 | AT3G43790 | Zinc induced facilitator-like 2 | |
| isogroup08493 | 1027 | Bra021053|PACid:22721993 | 3.28E-139 | AT3G43790 | Zinc induced facilitator-like 2 | |
| isogroup15107 | 622 | Thhalv10014123m|PACid:20206622 | 3.97E-119 | AT5G04950 | Nicotianamine synthase 1 | |
| isogroup06206 | 1273 | Thhalv10013812m|PACid:20203757 | 0 | AT5G56080 | Nicotianamine synthase 2 | |
| isogroup09071 | 987 | Thhalv10008206m|PACid:20187253 | 0 | AT1G09240 | Nicotianamine synthase 3 | |
| isogroup06921 | 1184 | Thhalv10024166m|PACid:20201706 | 0 | AT1G56430 | Nicotianamine synthase 4 | |
| isogroup04168 | 1656 | 473263|PACid:16034489 | 1.04E-12 | AT1G30400 | Multidrug resistance-associated protein 1 | |
| isotig01105 | 1975 | AT1G30400.2|PACid:19655137 | 3.18E-06 | AT1G30400 | Multidrug resistance-associated protein 1 | |
| isogroup11349 | 828 | Thhalv10006546m|PACid:20187496 | 6.64E-96 | AT1G30400 | Multidrug resistance-associated protein 1 | |
| isotig01102 | 4514 | Thhalv10016133m|PACid:20181262 | 6.34E-06 | AT2G34660 | Multidrug resistance-associated protein 2 | |
| isogroup13185 | 724 | 482430|PACid:16055424 | 3.02E-83 | AT2G34660 | Multidrug resistance-associated protein 2 | |
| isogroup17760 | 504 | Thhalv10019893m|PACid:20181977 | 4.35E-23 | AT3G13080 | Multidrug resistance-associated protein 3 | |
| isogroup02973 | 2159 | Thhalv10019893m|PACid:20181977 | 0 | AT3G13080 | Multidrug resistance-associated protein 3 | |
| isogroup08374 | 1033 | AT3G13080.1|PACid:19658374 | 3.01E-12 | AT3G13080 | Multidrug resistance-associated protein 3 | |
| isotig00322 | 5401 | Bra018722|PACid:22695420 | 0 | AT2G47800 | Multidrug resistance-associated protein 4 | |
| isotig00323 | 5294 | Bra018722|PACid:22695420 | 0 | AT2G47800 | Multidrug resistance-associated protein 4 | |
| isotig00324 | 5063 | Bra018722|PACid:22695420 | 0 | AT2G47800 | Multidrug resistance-associated protein 4 | |
| isotig00326 | 4956 | Bra018722|PACid:22695420 | 0 | AT2G47800 | Multidrug resistance-associated protein 4 | |
| isogroup02478 | 3126 | Carubv10008087m|PACid:20891448 | 0 | AT1G04120 | Multidrug resistance-associated protein 5 | |
| isogroup10208 | 901 | Thhalv10006549m|PACid:20186680 | 0 | AT1G04120 | Multidrug resistance-associated protein 5 | |
| isogroup14209 | 665 | Thhalv10006549m|PACid:20186680 | 3.34E-38 | AT1G04120 | Multidrug resistance-associated protein 5 | |
| isogroup15243 | 615 | Thhalv10019895m|PACid:20181972 | 3.67E-53 | AT3G21250 | Multidrug resistance-associated protein 6 | |
| isogroup07630 | 1109 | Thhalv10019895m|PACid:20181972 | 3.29E-07 | AT3G21250 | Multidrug resistance-associated protein 6 | |
| isogroup20180 | 277 | Bra034706|PACid:22686665 | 9.37E-37 | AT3G13100 | Multidrug resistance-associated protein 7 | |
| isogroup12703 | 742 | Thhalv10019894m|PACid:20183109 | 1.81E-11 | AT3G13100 | Multidrug resistance-associated protein 7 | |
| isogroup14335 | 661 | Bra034706|PACid:22686665 | 3.50E-136 | AT3G13100 | Multidrug resistance-associated protein 7 | |
| isogroup04767 | 1517 | Bra003402|PACid:22702084 | 6.74E-14 | AT3G60160 | Multidrug resistance-associated protein 9 | |
| isogroup09719 | 937 | 486482|PACid:16051413 | 7.50E-07 | AT3G60160 | Multidrug resistance-associated protein 9 | |
| isotig00325 | 5040 | Thhalv10005741m|PACid:20190810 | 0 | AT3G62700 | Multidrug resistance-associated protein 10 | |
| isotig00327 | 4933 | Thhalv10005741m|PACid:20190810 | 0 | AT3G62700 | Multidrug resistance-associated protein 10 | |
| isogroup15495 | 603 | AT3G59140.1|PACid:19663037 | 6.18E-119 | AT3G59140 | Multidrug resistance-associated protein 14 | |
| isogroup18407 | 480 | 486382|PACid:16034938 | 5.72E-91 | AT3G59140 | Multidrug resistance-associated protein 14 | |
| isogroup15392 | 608 | AT1G14870.1|PACid:19649994 | 1.51E-88 | AT1G14870 | Plant Cadmium Resistance 2 | |
| isogroup15934 | 581 | Bra026796|PACid:22713063 | 1.43E-111 | AT1G14870 | Plant Cadmium Resistance 2 | |
| isogroup19618 | 414 | Thhalv10019897m|PACid:20182879 | 1.37E-66 | AT3G16340 | Pleiotropic drug resistance 1 | |
| isogroup02600 | 2692 | Thhalv10019897m|PACid:20182879 | 8.02E-38 | AT3G16340 | Pleiotropic drug resistance 1 | |
| isogroup14595 | 649 | AT2G29940.1|PACid:19639436 | 6.90E-12 | AT2G29940 | Pleiotropic drug resistance 3 | |
| isogroup02662 | 2535 | Thhalv10001880m|PACid:20200369 | 4.23E-33 | AT2G26910 | Pleiotropic drug resistance 4 | |
| isogroup03404 | 1906 | Thhalv10001880m|PACid:20200369 | 3.89E-17 | AT2G26910 | Pleiotropic drug resistance 4 | |
| isogroup08926 | 999 | 870083|PACid:16064851 | 3.50E-09 | AT2G37280 | Pleiotropic drug resistance 5 | |
| isogroup11286 | 836 | Thhalv10016141m|PACid:20179582 | 4.60E-08 | AT2G37280 | Pleiotropic drug resistance 5 | |
| isogroup07295 | 1148 | AT2G36380.1|PACid:19640612 | 6.09E-16 | AT2G36380 | Pleiotropic drug resistance 6 | |
| isogroup11080 | 843 | Thhalv10016140m|PACid:20180787 | 2.61E-09 | AT2G36380 | Pleiotropic drug resistance 6 | |
| isogroup13734 | 691 | Thhalv10016140m|PACid:20180787 | 2.58E-10 | AT2G36380 | Pleiotropic drug resistance 6 | |
| isogroup01130 | 1897 | 471713|PACid:16040487 | 4.48E-14 | AT1G15210 | Pleiotropic drug resistance 7 | |
| isogroup12430 | 764 | Carubv10012050m|PACid:20892263 | 2.05E-11 | AT1G15210 | Pleiotropic drug resistance 7 | |
| isogroup14785 | 638 | Bra026157|PACid:22695455 | 4.48E-126 | Bra026157 | Pleiotropic drug resistance 7 | |
| isogroup14778 | 637 | Bra003137|PACid:22699106 | 1.69E-99 | AT3G53480 | Pleiotropic drug resistance 9 | |
| isogroup01205 | 1707 | Thhalv10016611m|PACid:20180411 | 0 | AT2G38170 | Cation exchanger 1 | |
| isotig04417 | 925 | Thhalv10020744m|PACid:20182084 | 3.06E-164 | AT3G13320 | Cation exchanger 2 | |
| isotig01832 | 974 | Thhalv10020744m|PACid:20182084 | 9.22E-94 | AT3G13320 | Cation exchanger 2 | |
| isogroup16996 | 532 | Bra009640|PACid:22694292 | 5.20E-82 | AT5G01490 | Cation exchanger 4 | |
| isotig04418 | 682 | Thhalv10011487m|PACid:20184135 | 1.59E-79 | AT1G55730 | Cation exchanger 5 | |
| isotig01833 | 876 | Carubv10009154m|PACid:20889697 | 3.20E-93 | AT1G55730 | Cation exchanger 5 | |
| isogroup07282 | 1146 | 488652|PACid:16035533 | 3.74E-179 | AT5G17860 | Calcium exchanger 7 | |
| isogroup15879 | 583 | 488652|PACid:16035533 | 4.17E-79 | AT5G17860 | Calcium exchanger 7 | |
| isogroup06161 | 1278 | 897667|PACid:16066029 | 0 | AT3G14070 | Cation exchanger 9 | |
| isotig03039 | 1670 | Thhalv10003194m|PACid:20208031 | 0 | AT5G44070 | Phytochelatin synthase 1 (PCS1) | |
| isotig03040 | 1095 | Thhalv10007644m|PACid:20187384 | 0 | AT1G03980 | Phytochelatin synthase 2 | |
| isogroup01808 | 910 | Bra029765|PACid:22685892 | 6.28E-24 | Bra029765 | Metallothionein 2A | |
| isogroup17171 | 526 | Thhalv10015167m|PACid:20205299 | 1.01E-16 | AT3G09390 | Metallothionein 2A | |
| isogroup17490 | 513 | Thhalv10021853m|PACid:20182985 | 4.64E-24 | AT3G15353 | Metallothionein 3 | |
| isogroup19215 | 444 | Thhalv10021853m|PACid:20182985 | 6.02E-27 | AT3G15353 | Metallothionein 3 | |
| isogroup16374 | 560 | Thhalv10019396m|PACid:20192145 | 6.62E-42 | AT5G44420 | Plant defensin 1.2 | |
| isogroup19505 | 425 | Bra015811|PACid:22701509 | 5.12E-40 | AT5G44420 | Plant defensin 1.2 |
Gene class abbreviation are: bZIP, basic-leucine zipper; ZIP/IRT, ZRT/IRT-like Protein; MTP, Metal Tolerance Protein; NRAMP, Natural Resistance-Associated Macrophage Protein; HMA, Heavy Metal ATPase; YSL, Yellow Stripe Like protein; ZIF, Zinc Induced Facilitator; NAS, Nicotianamine Synthase; MRP/ABCC, Multidrug Resistance-associated Protein; PCR, Plant Cadmium Resistance; PDR, Pleiotropic Drug Resistance Protein; CAX, Calcium exchanger; PCS, Phytochelatin Synthase; MT, Metallothionein; PDF, Plant Defensin.
These isotigs were both grouped into isogroup00190.
These isotigs were all grouped into isogroup00027.
These isotigs were both grouped into isogroup01669.
These isotigs were both grouped into isogroup00399.
These isotigs were both grouped into isogroup00980.
This isogroup was found to be most similar to AtHMA3.
Figure 2Gene Ontology (GO) classification of . N. caerulescens isogroups were classified into three major functional GO groups: Biological Process (A), Cellular Component (B), and Molecular Function (C) and subsequently sub-classified as indicated.
Figure 3Phylogenetic comparison of . Maximum Likelihood trees, showing the longest isotigs (indicated in red) belonging to different isogroups of the ZRT/IRT-like Protein (ZIP) (A) and Metal Tolerance Protein (MTP) gene families (B) as identified in Table 2, and compared at amino acid sequence level to A. thaliana genes to indicate the most likely orthologs. When all proteins listed in Table 2 were manually examined for N. caerulescens specific gene duplications, this appeared to be the case only for Metallothionein 3 (MT3). The Neighbor Joining tree is shown in (C), displaying the comparison of nucleotide sequences of MT3 isotigs to most similar genes of the previously used five Brassicaceae reference gene sets, indicates that N. caerulescens expresses an additional gene copy. Note that all branches in the Maximum Likelihood and Neighbor Joining trees with 50% or less bootstrap support were collapsed.