Literature DB >> 27995074

De novo transcriptome assembly of the mycoheterotrophic plant Monotropa hypopitys.

Alexey V Beletsky1, Mikhail A Filyushin1, Eugeny V Gruzdev1, Alexander M Mazur1, Egor B Prokhortchouk1, Elena Z Kochieva1, Andrey V Mardanov1, Nikolai V Ravin1, Konstantin G Skryabin1.   

Abstract

Monotropa hypopitys (pinesap) is a non-photosynthetic obligately mycoheterotrophic plant of the family Ericaceae. It obtains the carbon and other nutrients from the roots of surrounding autotrophic trees through the associated mycorrhizal fungi. In order to understand the evolutionary changes in the plant genome associated with transition to a heterotrophic lifestyle, we performed de novo transcriptomic analysis of M. hypopitys using next-generation sequencing. We obtained the RNA-Seq data from flowers, flower bracts and roots with haustoria using Illumina HiSeq2500 platform. The raw data obtained in this study can be available in NCBI SRA database with accession number of SRP069226. A total of 10.3 GB raw sequence data were obtained, corresponding to 103,357,809 raw reads. A total of 103,025,683 reads were filtered after removing low-quality reads and trimming the adapter sequences. The Trinity program was used to de novo assemble 98,349 unigens with an N50 of 1342 bp. Using the TransDecoder program, we predicted 43,505 putative proteins. 38,416 unigenes were annotated in the Swiss-Prot protein sequence database using BLASTX. The obtained transcriptomic data will be useful for further studies of the evolution of plant genomes upon transition to a non-photosynthetic lifestyle and the loss of photosynthesis-related functions.

Entities:  

Keywords:  Monotropa hypopitys; Parasitic plant; RNA-Seq; Transcriptome

Year:  2016        PMID: 27995074      PMCID: PMC5154972          DOI: 10.1016/j.gdata.2016.11.020

Source DB:  PubMed          Journal:  Genom Data        ISSN: 2213-5960


Specifications

Direct link to deposited data

https://www.ncbi.nlm.nih.gov/sra/SRP069226.

Introduction

Monotropa hypopitys (pinesap) is a non-photosynthetic obligately mycoheterotrophic plant of the family Ericaceae, subfamily Monotropoideae [1]. It obtains the carbon and other nutrients from the roots of surrounding autotrophic trees through the associated mycorrhizal fungi [2]. The above-ground part of M. hypopitys plant is up to 35 cm tall unbranched adventitious raceme-like inflorescence with several flowers on the top and scale-like flower bracts which cover most of the inflorescence. Short roots are invested by a net of sheathing mycorrhizas forming haustorium-like structures used to attach the fungal partner. Transition to a heterotrophic lifestyle and the loss of photosynthesis is expected to relax the selective pressure on photosynthetic machinery which becomes unnecessary. The most evident consequence is a reduction in the size and gene content of the chloroplast genome, which correlates with the loss of genes related to photosynthesis [3], [4], [5]. Recently we sequenced the chloroplast genome of M. hypopitys and found that it is highly reduced in size and lacks genes encoding NADH dehydrogenase, photosynthesis-related proteins, the plastid-encoded RNA polymerase and some other functions [6], [7]. Transition to heterotrophy is expected to be associated also with the changes in the nuclear genome of a parasitic plant since it encodes a number of functions related to autotrophic lifestyle. However, these changes are poorly known and limited to few studies of transcriptomes of parasitic plants (e.g. [8], [9], [10]). In this study we carried out sequencing and de novo assembly of transcriptome for M. hypopitys.

Experimental design, materials and methods

Plant materials

Several Monotropa hypopitys plants, including the underground parts with a piece of soil, were collected in the forest in Kaluga region, Russia and quickly transported to the laboratory for isolation of nucleic acids.

RNA isolation, library preparation, and sequencing

Fresh flowers and leaves (often considered as flower bracts) of two individual Monotropa hypopitys plants and two samples of pooled roots with haustoria were used for RNA isolation. Roots were carefully rinsed in water to eliminate attached soil immediately before RNA extraction. Total RNA was isolated from ~ 300 mg tissue for each six samples using the RNeasy Plant Mini kit (Qiagen, Valencia, CA). mRNA library preparation was performed using a NEBNext® mRNA Library Prep Reagent Set for Illumina® according to the manufacturer's instructions (New England BioLabs Inc., Ipswich, MA, USA). The libraries were sequenced by MyGene Co. (Moscow, Russia) using the HiSeq 2500 platform. About 17 million of 100-bp single end reads were generated for each sample.

De novo transcriptome assembly, identification of protein coding regions, and annotation

We obtained a total of 10.3 GB raw sequence data, corresponding to 103,357,809 raw reads (18.5 mln for flowers of plant 1.7 mln for leaves of plant 1, 14.3 mln for flowers of plant 2, 17.7 mln for leaves of plant 2, 16.1 mln for root sample 1, and 18.9 mln for root sample 2). A total of 103,025,683 high quality reads were filtered after removing of adapter sequences and quality trimming with Cutadapt [11] and Sickle (https://github.com/najoshi/sickle), respectively. The information on the transcriptome sequencing and assembly is summarised in Table 1.
Table 1

Summary of the sequencing and de novo assembly.

IndexMonotropa hypopitys
No of assembled reads103,025,683
Number of unigenes98,349
N50 length of unigene (bp)1342
Average unigene length (bp)780
Predicted proteins43,505
Unigens annotated in TrEMBL37,974
Unigens annotated in Swiss-Prot38,416
De novo assembly of transcriptome with clean reads from the combined six RNA-seq datasets was carried out using Trinity 2.1.1 with default parameters [12]. The assembly generated 98,349 unigens ranging in length from 201 to 12,993 bp, with an N50 of 1342 bp. Coding regions prediction in the assembled transcripts was performed by the TransDecoder program (http://transdecoder.github.io) implemented in the Trinity software. As a result, we predicted a total of 43,505 proteins. Trinotate (https://trinotate.github.io/) was used to assign hits from TrEMBL and Swiss-Prot databases (http://www.uniprot.org/uniprot/), and to assign GO terms and pfam domains. 37,974 unigenes were annotated in the TrEMBL protein database using predicted protein sequences and 38,416 unigenes were annotated in the Swiss-Prot database using BLASTX. The relative levels of transcription of protein-coding genes in each of four tissue samples were calculated by mapping of cleaned reads on the assembled transcripts employing Trinity scripts, RSEM [13] and Bowtie 2 program [14]. Cross sample normalization of transcription levels was done using Trinity scripts (TMM method). The data on the relative expression of annotated genes are available in Supplementary File 1. 822 of the assigned proteins were probably derived from root-associated fungi since (i) they have best BLASTP hits in fungal proteomes, (ii) the corresponding transcripts were expressed only in root samples but not in flowers and leaves, and (iii) nucleotide sequences of these genes were not found in the genome sequences derived from the above-ground parts of M. hypopitys plants (to be published elsewhere). In conclusion, we have sequenced and de novo assembled the transcriptome of the non-photosynthetic parasitic plant M. hypopitys. The obtained transcriptome data will be useful for further studies of the evolution of plant genomes upon transition to a non-photosynthetic lifestyle and the loss of photosynthesis-related functions.

Conflict of interest

The authors declare no conflicts of interest in this study.
Organism/cell line/tissueMonotropa hypopitys/flowers, flower bracts, and roots
SexN/A
Sequencer or array typeIllumina HiSeq2500
Data formatRaw data: FASTQ file
Experimental factorsDe novo transcriptome assembly of Monotropa hypopitys
Experimental featuresFlowers and leaves (flower bracts) of two individual Monotropa hypopitys plants and two pooled samples of roots with haustoria were harvested for total RNA extraction, sequencing, de novo transcriptome assembly and annotation
ConsentN/A
Sample source locationSample was collected in Kaluga region, Russia
  10 in total

1.  Transcriptomes of the parasitic plant family Orobanchaceae reveal surprising conservation of chlorophyll synthesis.

Authors:  Norman J Wickett; Loren A Honaas; Eric K Wafula; Malay Das; Kan Huang; Biao Wu; Lena Landherr; Michael P Timko; John Yoder; James H Westwood; Claude W dePamphilis
Journal:  Curr Biol       Date:  2011-12-08       Impact factor: 10.834

2.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

3.  Investigating the path of plastid genome degradation in an early-transitional clade of heterotrophic orchids, and implications for heterotrophic angiosperms.

Authors:  Craig F Barrett; John V Freudenstein; Jeff Li; Dustin R Mayfield-Jones; Leticia Perez; J Chris Pires; Cristian Santos
Journal:  Mol Biol Evol       Date:  2014-08-28       Impact factor: 16.240

4.  The loss of photosynthetic pathways in the plastid and nuclear genomes of the non-photosynthetic mycoheterotrophic eudicot Monotropa hypopitys.

Authors:  Nikolai V Ravin; Eugeny V Gruzdev; Alexey V Beletsky; Alexander M Mazur; Egor B Prokhortchouk; Mikhail A Filyushin; Elena Z Kochieva; Vitaly V Kadnikov; Andrey V Mardanov; Konstantin G Skryabin
Journal:  BMC Plant Biol       Date:  2016-11-16       Impact factor: 4.215

5.  Loss of photosynthetic and chlororespiratory genes from the plastid genome of a parasitic flowering plant.

Authors:  C W dePamphilis; J D Palmer
Journal:  Nature       Date:  1990-11-22       Impact factor: 49.962

6.  Function and evolution of a minimal plastid genome from a nonphotosynthetic parasitic plant.

Authors:  K H Wolfe; C W Morden; J D Palmer
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

7.  De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis.

Authors:  Brian J Haas; Alexie Papanicolaou; Moran Yassour; Manfred Grabherr; Philip D Blood; Joshua Bowden; Matthew Brian Couger; David Eccles; Bo Li; Matthias Lieber; Matthew D MacManes; Michael Ott; Joshua Orvis; Nathalie Pochet; Francesco Strozzi; Nathan Weeks; Rick Westerman; Thomas William; Colin N Dewey; Robert Henschel; Richard D LeDuc; Nir Friedman; Aviv Regev
Journal:  Nat Protoc       Date:  2013-07-11       Impact factor: 13.491

8.  A full-length enriched cDNA library and expressed sequence tag analysis of the parasitic weed, Striga hermonthica.

Authors:  Satoko Yoshida; Juliane K Ishida; Nasrein M Kamal; Abdelbagi M Ali; Shigetou Namba; Ken Shirasu
Journal:  BMC Plant Biol       Date:  2010-03-30       Impact factor: 4.215

9.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.

Authors:  Bo Li; Colin N Dewey
Journal:  BMC Bioinformatics       Date:  2011-08-04       Impact factor: 3.307

10.  Pscroph, a parasitic plant EST database enriched for parasite associated transcripts.

Authors:  Manuel J Torres; Alexey A Tomilov; Natalya Tomilova; Russell L Reagan; John I Yoder
Journal:  BMC Plant Biol       Date:  2005-11-16       Impact factor: 4.215

  10 in total
  2 in total

1.  The YABBY Genes of Leaf and Leaf-Like Organ Polarity in Leafless Plant Monotropa hypopitys.

Authors:  Anna V Shchennikova; Marya A Slugina; Alexey V Beletsky; Mikhail A Filyushin; Andrey A Mardanov; Olga A Shulga; Elena Z Kochieva; Nikolay V Ravin; Konstantin G Skryabin
Journal:  Int J Genomics       Date:  2018-04-24       Impact factor: 2.326

2.  Annotated genome sequences of the carnivorous plant Roridula gorgonias and a non-carnivorous relative, Clethra arborea.

Authors:  Stefanie Hartmann; Michaela Preick; Silke Abelt; André Scheffel; Michael Hofreiter
Journal:  BMC Res Notes       Date:  2020-09-10
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.