Literature DB >> 22540679

Reference-free transcriptome assembly in non-model animals from next-generation sequencing data.

V Cahais1, P Gayral, G Tsagkogeorga, J Melo-Ferreira, M Ballenghien, L Weinert, Y Chiari, K Belkhir, V Ranwez, N Galtier.   

Abstract

Next-generation sequencing (NGS) technologies offer the opportunity for population genomic study of non-model organisms sampled in the wild. The transcriptome is a convenient and popular target for such purposes. However, designing genetic markers from NGS transcriptome data requires assembling gene-coding sequences out of short reads. This is a complex task owing to gene duplications, genetic polymorphism, alternative splicing and transcription noise. Typical assembling programmes return thousands of predicted contigs, whose connection to the species true gene content is unclear, and from which SNP definition is uneasy. Here, the transcriptomes of five diverse non-model animal species (hare, turtle, ant, oyster and tunicate) were assembled from newly generated 454 and Illumina sequence reads. In two species for which a reference genome is available, a new procedure was introduced to annotate each predicted contig as either a full-length cDNA, fragment, chimera, allele, paralogue, genomic sequence or other, based on the number of, and overlap between, blast hits to the appropriate reference. Analyses showed that (i) the highest quality assemblies are obtained when 454 and Illumina data are combined, (ii) typical de novo assemblies include a majority of irrelevant cDNA predictions and (iii) assemblies can be appropriately cleaned by filtering contigs based on length and coverage. We conclude that robust, reference-free assembly of thousands of genes from transcriptomic NGS data is possible, opening promising perspectives for transcriptome-based population genomics in animals. A Galaxy pipeline implementing our best-performing assembling strategy is provided.
© 2012 Blackwell Publishing Ltd.

Entities:  

Mesh:

Year:  2012        PMID: 22540679     DOI: 10.1111/j.1755-0998.2012.03148.x

Source DB:  PubMed          Journal:  Mol Ecol Resour        ISSN: 1755-098X            Impact factor:   7.090


  71 in total

Review 1.  Next-generation sequencing in clinical virology: Discovery of new viruses.

Authors:  Sibnarayan Datta; Raghvendra Budhauliya; Bidisha Das; Soumya Chatterjee; Vijay Veer
Journal:  World J Virol       Date:  2015-08-12

2.  Transcriptome resources for the white-footed mouse (Peromyscus leucopus): new genomic tools for investigating ecologically divergent urban and rural populations.

Authors:  Stephen E Harris; Rachel J O'Neill; Jason Munshi-South
Journal:  Mol Ecol Resour       Date:  2014-07-16       Impact factor: 7.090

3.  Comparative population genomics in animals uncovers the determinants of genetic diversity.

Authors:  J Romiguier; P Gayral; M Ballenghien; A Bernard; V Cahais; A Chenuil; Y Chiari; R Dernat; L Duret; N Faivre; E Loire; J M Lourenco; B Nabholz; C Roux; G Tsagkogeorga; A A-T Weber; L A Weinert; K Belkhir; N Bierne; S Glémin; N Galtier
Journal:  Nature       Date:  2014-08-20       Impact factor: 49.962

4.  De novo transcriptome assembly for a non-model species, the blood-sucking bug Triatoma brasiliensis, a vector of Chagas disease.

Authors:  A Marchant; F Mougel; C Almeida; E Jacquin-Joly; J Costa; M Harry
Journal:  Genetica       Date:  2014-09-19       Impact factor: 1.082

5.  CPC2: a fast and accurate coding potential calculator based on sequence intrinsic features.

Authors:  Yu-Jian Kang; De-Chang Yang; Lei Kong; Mei Hou; Yu-Qi Meng; Liping Wei; Ge Gao
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

6.  RNA-seq in the tetraploid Xenopus laevis enables genome-wide insight in a classic developmental biology model organism.

Authors:  Nirav M Amin; Panna Tandon; Erin Osborne Nishimura; Frank L Conlon
Journal:  Methods       Date:  2013-06-20       Impact factor: 3.608

7.  Analysis of the transcriptome of the Indonesian coelacanth Latimeria menadoensis.

Authors:  Alberto Pallavicini; Adriana Canapa; Marco Barucca; Jessica Alfőldi; Maria Assunta Biscotti; Francesco Buonocore; Gianluca De Moro; Federica Di Palma; Anna Maria Fausto; Mariko Forconi; Marco Gerdol; Daisy Monica Makapedua; Jason Turner-Meier; Ettore Olmo; Giuseppe Scapigliati
Journal:  BMC Genomics       Date:  2013-08-08       Impact factor: 3.969

8.  Inferring bona fide transfrags in RNA-Seq derived-transcriptome assemblies of non-model organisms.

Authors:  Stanley Kimbung Mbandi; Uljana Hesse; Peter van Heusden; Alan Christoffels
Journal:  BMC Bioinformatics       Date:  2015-02-21       Impact factor: 3.169

9.  Comparative physiological and transcriptomic analyses of photosynthesis in Sphagneticola calendulacea (L.) Pruski and Sphagneticola trilobata (L.) Pruski.

Authors:  Min-Ling Cai; Qi-Lei Zhang; Jun-Jie Zhang; Wen-Qiao Ding; Hong-Ying Huang; Chang-Lian Peng
Journal:  Sci Rep       Date:  2020-10-20       Impact factor: 4.379

10.  SNP detection from de novo transcriptome sequencing in the bivalve Macoma balthica: marker development for evolutionary studies.

Authors:  Eric Pante; Audrey Rohfritsch; Vanessa Becquet; Khalid Belkhir; Nicolas Bierne; Pascale Garcia
Journal:  PLoS One       Date:  2012-12-26       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.