Literature DB >> 22417298

Cutoffs and k-mers: implications from a transcriptome study in allopolyploid plants.

Nicole Gruenheit1, Oliver Deusch, Christian Esser, Matthias Becker, Claudia Voelckel, Peter Lockhart.   

Abstract

BACKGROUND: Transcriptome analysis is increasingly being used to study the evolutionary origins and ecology of non-model plants. One issue for both transcriptome assembly and differential gene expression analyses is the common occurrence in plants of hybridisation and whole genome duplication (WGD) and hybridization resulting in allopolyploidy. The divergence of duplicated genes following WGD creates near identical homeologues that can be problematic for de novo assembly and also reference based assembly protocols that use short reads (35 - 100 bp).
RESULTS: Here we report a successful strategy for the assembly of two transcriptomes made using 75 bp Illumina reads from Pachycladon fastigiatum and Pachycladon cheesemanii. Both are allopolyploid plant species (2n = 20) that originated in the New Zealand Alps about 0.8 million years ago. In a systematic analysis of 19 different coverage cutoffs and 20 different k-mer sizes we showed that i) none of the genes could be assembled across all of the parameter space ii) assembly of each gene required an optimal set of parameter values and iii) these parameter values could be explained in part by different gene expression levels and different degrees of similarity between genes.
CONCLUSIONS: To obtain optimal transcriptome assemblies for allopolyploid plants, k-mer size and k-mer coverage need to be considered simultaneously across a broad parameter space. This is important for assembling a maximum number of full length ESTs and for avoiding chimeric assemblies of homeologous and paralogous gene copies.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22417298      PMCID: PMC3378427          DOI: 10.1186/1471-2164-13-92

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


  50 in total

1.  CAP3: A DNA sequence assembly program.

Authors:  X Huang; A Madan
Journal:  Genome Res       Date:  1999-09       Impact factor: 9.043

2.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

3.  Identification of non-specific lipid transfer protein-1 as a calmodulin-binding protein in Arabidopsis.

Authors:  Zhe Wang; Wanqin Xie; Fang Chi; Cuifeng Li
Journal:  FEBS Lett       Date:  2005-03-14       Impact factor: 4.124

4.  ABySS: a parallel assembler for short read sequence data.

Authors:  Jared T Simpson; Kim Wong; Shaun D Jackman; Jacqueline E Schein; Steven J M Jones; Inanç Birol
Journal:  Genome Res       Date:  2009-02-27       Impact factor: 9.043

5.  Benchmarking next-generation transcriptome sequencing for functional and evolutionary genomics.

Authors:  John G Gibbons; Eric M Janson; Chris Todd Hittinger; Mark Johnston; Patrick Abbot; Antonis Rokas
Journal:  Mol Biol Evol       Date:  2009-08-25       Impact factor: 16.240

Review 6.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

7.  MODIFIED VACUOLE PHENOTYPE1 is an Arabidopsis myrosinase-associated protein involved in endomembrane protein trafficking.

Authors:  April E Agee; Marci Surpin; Eun Ju Sohn; Thomas Girke; Abel Rosado; Brian W Kram; Clay Carter; Adam M Wentzell; Daniel J Kliebenstein; Hak Chul Jin; Ohkmae K Park; Hailing Jin; Glenn R Hicks; Natasha V Raikhel
Journal:  Plant Physiol       Date:  2009-10-30       Impact factor: 8.340

Review 8.  Plant defensins.

Authors:  Bart P H J Thomma; Bruno P A Cammue; Karin Thevissen
Journal:  Planta       Date:  2002-10-08       Impact factor: 4.116

Review 9.  The flowering world: a tale of duplications.

Authors:  Yves Van de Peer; Jeffrey A Fawcett; Sebastian Proost; Lieven Sterck; Klaas Vandepoele
Journal:  Trends Plant Sci       Date:  2009-10-07       Impact factor: 18.313

10.  How to apply de Bruijn graphs to genome assembly.

Authors:  Phillip E C Compeau; Pavel A Pevzner; Glenn Tesler
Journal:  Nat Biotechnol       Date:  2011-11-08       Impact factor: 54.908

View more
  31 in total

1.  Transcriptomics of salinity tolerance capacity in Arctic charr (Salvelinus alpinus): a comparison of gene expression profiles between divergent QTL genotypes.

Authors:  Joseph D Norman; Moira M Ferguson; Roy G Danzmann
Journal:  Physiol Genomics       Date:  2013-12-24       Impact factor: 3.107

2.  De novo assembly and characterization of the transcriptome of the parasitic weed dodder identifies genes associated with plant parasitism.

Authors:  Aashish Ranjan; Yasunori Ichihashi; Moran Farhi; Kristina Zumstein; Brad Townsley; Rakefet David-Schwartz; Neelima R Sinha
Journal:  Plant Physiol       Date:  2014-01-07       Impact factor: 8.340

3.  Insights into the Evolution of Hydroxyproline-Rich Glycoproteins from 1000 Plant Transcriptomes.

Authors:  Kim L Johnson; Andrew M Cassin; Andrew Lonsdale; Gane Ka-Shu Wong; Douglas E Soltis; Nicholas W Miles; Michael Melkonian; Barbara Melkonian; Michael K Deyholos; James Leebens-Mack; Carl J Rothfels; Dennis W Stevenson; Sean W Graham; Xumin Wang; Shuangxiu Wu; J Chris Pires; Patrick P Edger; Eric J Carpenter; Antony Bacic; Monika S Doblin; Carolyn J Schultz
Journal:  Plant Physiol       Date:  2017-04-26       Impact factor: 8.340

4.  High-throughput sequencing and de novo transcriptome assembly of Swertia japonica to identify genes involved in the biosynthesis of therapeutic metabolites.

Authors:  Amit Rai; Michimi Nakamura; Hiroki Takahashi; Hideyuki Suzuki; Kazuki Saito; Mami Yamazaki
Journal:  Plant Cell Rep       Date:  2016-07-04       Impact factor: 4.570

5.  RNA-Seq Assembly - Are We There Yet?

Authors:  Simon Schliesky; Udo Gowik; Andreas P M Weber; Andrea Bräutigam
Journal:  Front Plant Sci       Date:  2012-09-25       Impact factor: 5.753

6.  Chips and tags suggest plant-environment interactions differ for two alpine Pachycladon species.

Authors:  Claudia Voelckel; Nicole Gruenheit; Patrick Biggs; Oliver Deusch; Peter Lockhart
Journal:  BMC Genomics       Date:  2012-07-19       Impact factor: 3.969

7.  De novo sequence assembly and characterisation of a partial transcriptome for an evolutionarily distinct reptile, the tuatara (Sphenodon punctatus).

Authors:  Hilary C Miller; Patrick J Biggs; Claudia Voelckel; Nicola J Nelson
Journal:  BMC Genomics       Date:  2012-08-31       Impact factor: 3.969

8.  De novo transcriptome sequence assembly and analysis of RNA silencing genes of Nicotiana benthamiana.

Authors:  Kenlee Nakasugi; Ross N Crowhurst; Julia Bally; Craig C Wood; Roger P Hellens; Peter M Waterhouse
Journal:  PLoS One       Date:  2013-03-28       Impact factor: 3.240

9.  Rapid quantification of sequence repeats to resolve the size, structure and contents of bacterial genomes.

Authors:  David Williams; William L Trimble; Meghan Shilts; Folker Meyer; Howard Ochman
Journal:  BMC Genomics       Date:  2013-08-08       Impact factor: 3.969

10.  Toward understanding the genetic basis of adaptation to high-elevation life in poikilothermic species: a comparative transcriptomic analysis of two ranid frogs, Rana chensinensis and R. kukunoris.

Authors:  Weizhao Yang; Yin Qi; Ke Bi; Jinzhong Fu
Journal:  BMC Genomics       Date:  2012-11-01       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.