Literature DB >> 27458203

SNP calling from RNA-seq data without a reference genome: identification, quantification, differential analysis and impact on the protein sequence.

Hélène Lopez-Maestre1,2, Lilia Brinza3, Camille Marchet4, Janice Kielbassa5, Sylvère Bastien1,2, Mathilde Boutigny1,2, David Monnin1, Adil El Filali1, Claudia Marcia Carareto6, Cristina Vieira1,2, Franck Picard1, Natacha Kremer1, Fabrice Vavre1,2, Marie-France Sagot1,2, Vincent Lacroix7,2.   

Abstract

SNPs (Single Nucleotide Polymorphisms) are genetic markers whose precise identification is a prerequisite for association studies. Methods to identify them are currently well developed for model species, but rely on the availability of a (good) reference genome, and therefore cannot be applied to non-model species. They are also mostly tailored for whole genome (re-)sequencing experiments, whereas in many cases, transcriptome sequencing can be used as a cheaper alternative which already enables to identify SNPs located in transcribed regions. In this paper, we propose a method that identifies, quantifies and annotates SNPs without any reference genome, using RNA-seq data only. Individuals can be pooled prior to sequencing, if not enough material is available from one individual. Using pooled human RNA-seq data, we clarify the precision and recall of our method and discuss them with respect to other methods which use a reference genome or an assembled transcriptome. We then validate experimentally the predictions of our method using RNA-seq data from two non-model species. The method can be used for any species to annotate SNPs and predict their impact on the protein sequence. We further enable to test for the association of the identified SNPs with a phenotype of interest.
© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27458203      PMCID: PMC5100560          DOI: 10.1093/nar/gkw655

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  37 in total

1.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

2.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

3.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

4.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

5.  Evolutionary relationships of Drosophila mojavensis geographic host races and their sister species Drosophila arizonae.

Authors:  L K Reed; M Nyboer; T A Markow
Journal:  Mol Ecol       Date:  2007-03       Impact factor: 6.185

6.  Population genetics and geographic variation of alcohol dehydrogenase (Adh) paralogs and glucose-6-phosphate dehydrogenase (G6pd) in Drosophila mojavensis.

Authors:  Luciano M Matzkin
Journal:  Mol Biol Evol       Date:  2003-12-05       Impact factor: 16.240

7.  Using cascading Bloom filters to improve the memory usage for de Brujin graphs.

Authors:  Kamil Salikhov; Gustavo Sacomoto; Gregory Kucherov
Journal:  Algorithms Mol Biol       Date:  2014-02-24       Impact factor: 1.405

Review 8.  Sequencing pools of individuals - mining genome-wide polymorphism data without big funding.

Authors:  Christian Schlötterer; Raymond Tobler; Robert Kofler; Viola Nolte
Journal:  Nat Rev Genet       Date:  2014-09-23       Impact factor: 53.242

9.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

10.  Transcriptome and genome sequencing uncovers functional variation in humans.

Authors:  Tuuli Lappalainen; Michael Sammeth; Marc R Friedländer; Peter A C 't Hoen; Jean Monlong; Manuel A Rivas; Mar Gonzàlez-Porta; Natalja Kurbatova; Thasso Griebel; Pedro G Ferreira; Matthias Barann; Thomas Wieland; Liliana Greger; Maarten van Iterson; Jonas Almlöf; Paolo Ribeca; Irina Pulyakhina; Daniela Esser; Thomas Giger; Andrew Tikhonov; Marc Sultan; Gabrielle Bertier; Daniel G MacArthur; Monkol Lek; Esther Lizano; Henk P J Buermans; Ismael Padioleau; Thomas Schwarzmayr; Olof Karlberg; Halit Ongen; Helena Kilpinen; Sergi Beltran; Marta Gut; Katja Kahlem; Vyacheslav Amstislavskiy; Oliver Stegle; Matti Pirinen; Stephen B Montgomery; Peter Donnelly; Mark I McCarthy; Paul Flicek; Tim M Strom; Hans Lehrach; Stefan Schreiber; Ralf Sudbrak; Angel Carracedo; Stylianos E Antonarakis; Robert Häsler; Ann-Christine Syvänen; Gert-Jan van Ommen; Alvis Brazma; Thomas Meitinger; Philip Rosenstiel; Roderic Guigó; Ivo G Gut; Xavier Estivill; Emmanouil T Dermitzakis
Journal:  Nature       Date:  2013-09-15       Impact factor: 49.962

View more
  31 in total

1.  Single nucleotide variant counts computed from RNA sequencing and cellular traffic into human kidney allografts.

Authors:  Gaurav Thareja; Hua Yang; Shahina Hayat; Franco B Mueller; John R Lee; Michelle Lubetzky; Darshana M Dadhania; Aziz Belkadi; Surya V Seshan; Karsten Suhre; Manikkam Suthanthiran; Thangamani Muthukumar
Journal:  Am J Transplant       Date:  2018-05-15       Impact factor: 8.086

Review 2.  Cancer transcriptome profiling at the juncture of clinical translation.

Authors:  Marcin Cieślik; Arul M Chinnaiyan
Journal:  Nat Rev Genet       Date:  2017-12-27       Impact factor: 53.242

3.  Complementarity of assembly-first and mapping-first approaches for alternative splicing annotation and differential analysis from RNAseq data.

Authors:  Clara Benoit-Pilven; Camille Marchet; Emilie Chautard; Leandro Lima; Marie-Pierre Lambert; Gustavo Sacomoto; Amandine Rey; Audric Cologne; Sophie Terrone; Louis Dulaurier; Jean-Baptiste Claude; Cyril F Bourgeois; Didier Auboeuf; Vincent Lacroix
Journal:  Sci Rep       Date:  2018-03-09       Impact factor: 4.379

4.  Playing hide and seek with repeats in local and global de novo transcriptome assembly of short RNA-seq reads.

Authors:  Leandro Lima; Blerina Sinaimeri; Gustavo Sacomoto; Helene Lopez-Maestre; Camille Marchet; Vincent Miele; Marie-France Sagot; Vincent Lacroix
Journal:  Algorithms Mol Biol       Date:  2017-02-22       Impact factor: 1.405

5.  Identification of QTLs and joint QTL segments of leaflet traits at different canopy layers in an interspecific RIL population of soybean.

Authors:  Jian Zeng; Meng Li; Hongmei Qiu; Yufei Xu; Beibei Feng; Fangyuan Kou; Xianchao Xu; Muhammad Khuram Razzaq; Junyi Gai; Yueqiang Wang; Guangnan Xing
Journal:  Theor Appl Genet       Date:  2022-10-07       Impact factor: 5.574

6.  Scalable, ultra-fast, and low-memory construction of compacted de Bruijn graphs with Cuttlefish 2.

Authors:  Jamshed Khan; Marek Kokot; Sebastian Deorowicz; Rob Patro
Journal:  Genome Biol       Date:  2022-09-08       Impact factor: 17.906

7.  A de novo assembled high-quality chromosome-scale Trifolium pratense genome and fine-scale phylogenetic analysis.

Authors:  Zhenfei Yan; Lijun Sang; Yue Ma; Yong He; Juan Sun; Lichao Ma; Shuo Li; Fuhong Miao; Zixin Zhang; Jianwei Huang; Zengyu Wang; Guofeng Yang
Journal:  BMC Plant Biol       Date:  2022-07-11       Impact factor: 5.260

8.  Thermosensitive sex chromosome dosage compensation in ZZ/ZW softshell turtles, Apalone spinifera.

Authors:  Basanta Bista; Zhiqiang Wu; Robert Literman; Nicole Valenzuela
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2021-07-26       Impact factor: 6.671

9.  Candidate genes for shell colour polymorphism in Cepaea nemoralis.

Authors:  Jesse Kerkvliet; Tjalf de Boer; Menno Schilthuizen; Ken Kraaijeveld
Journal:  PeerJ       Date:  2017-09-18       Impact factor: 2.984

10.  Determinants of the Efficacy of Natural Selection on Coding and Noncoding Variability in Two Passerine Species.

Authors:  Pádraic Corcoran; Toni I Gossmann; Henry J Barton; Jon Slate; Kai Zeng
Journal:  Genome Biol Evol       Date:  2017-11-01       Impact factor: 3.416

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.