Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 SNP calling from RNA-seq data without a reference genome: identification, quantification, differential analysis and impact on the protein sequence.

Literature DB >> 27458203

SNP calling from RNA-seq data without a reference genome: identification, quantification, differential analysis and impact on the protein sequence.

Hélène Lopez-Maestre^1,2, Lilia Brinza³, Camille Marchet⁴, Janice Kielbassa⁵, Sylvère Bastien^1,2, Mathilde Boutigny^1,2, David Monnin¹, Adil El Filali¹, Claudia Marcia Carareto⁶, Cristina Vieira^1,2, Franck Picard¹, Natacha Kremer¹, Fabrice Vavre^1,2, Marie-France Sagot^1,2, Vincent Lacroix^7,2.

Abstract

SNPs (Single Nucleotide Polymorphisms) are genetic markers whose precise identification is a prerequisite for association studies. Methods to identify them are currently well developed for model species, but rely on the availability of a (good) reference genome, and therefore cannot be applied to non-model species. They are also mostly tailored for whole genome (re-)sequencing experiments, whereas in many cases, transcriptome sequencing can be used as a cheaper alternative which already enables to identify SNPs located in transcribed regions. In this paper, we propose a method that identifies, quantifies and annotates SNPs without any reference genome, using RNA-seq data only. Individuals can be pooled prior to sequencing, if not enough material is available from one individual. Using pooled human RNA-seq data, we clarify the precision and recall of our method and discuss them with respect to other methods which use a reference genome or an assembled transcriptome. We then validate experimentally the predictions of our method using RNA-seq data from two non-model species. The method can be used for any species to annotate SNPs and predict their impact on the protein sequence. We further enable to test for the association of the identified SNPs with a phenotype of interest.

Entities: Chemical Disease Species

Mesh：

Substances：
Genetic Markers

Year: 2016 PMID： 27458203 PMCID： PMC5100560 DOI： 10.1093/nar/gkw655

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

37 in total

1. BLAT--the BLAST-like alignment tool.

Authors: W James Kent
Journal: Genome Res Date: 2002-04 Impact factor: 9.043

2. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors: Weizhong Li; Adam Godzik
Journal: Bioinformatics Date: 2006-05-26 Impact factor: 6.937

3. Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors: Daniel R Zerbino; Ewan Birney
Journal: Genome Res Date: 2008-03-18 Impact factor: 9.043

4. Fast gapped-read alignment with Bowtie 2.

Authors: Ben Langmead; Steven L Salzberg
Journal: Nat Methods Date: 2012-03-04 Impact factor: 28.547

5. Evolutionary relationships of Drosophila mojavensis geographic host races and their sister species Drosophila arizonae.

Authors: L K Reed; M Nyboer; T A Markow
Journal: Mol Ecol Date: 2007-03 Impact factor: 6.185

6. Population genetics and geographic variation of alcohol dehydrogenase (Adh) paralogs and glucose-6-phosphate dehydrogenase (G6pd) in Drosophila mojavensis.

Authors: Luciano M Matzkin
Journal: Mol Biol Evol Date: 2003-12-05 Impact factor: 16.240

7. Using cascading Bloom filters to improve the memory usage for de Brujin graphs.

Authors: Kamil Salikhov; Gustavo Sacomoto; Gregory Kucherov
Journal: Algorithms Mol Biol Date: 2014-02-24 Impact factor: 1.405

Review 8. Sequencing pools of individuals - mining genome-wide polymorphism data without big funding.

Authors: Christian Schlötterer; Raymond Tobler; Robert Kofler; Viola Nolte
Journal: Nat Rev Genet Date: 2014-09-23 Impact factor: 53.242

9. An integrated map of genetic variation from 1,092 human genomes.

Authors: Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal: Nature Date: 2012-11-01 Impact factor: 49.962

10. Transcriptome and genome sequencing uncovers functional variation in humans.

Authors: Tuuli Lappalainen; Michael Sammeth; Marc R Friedländer; Peter A C 't Hoen; Jean Monlong; Manuel A Rivas; Mar Gonzàlez-Porta; Natalja Kurbatova; Thasso Griebel; Pedro G Ferreira; Matthias Barann; Thomas Wieland; Liliana Greger; Maarten van Iterson; Jonas Almlöf; Paolo Ribeca; Irina Pulyakhina; Daniela Esser; Thomas Giger; Andrew Tikhonov; Marc Sultan; Gabrielle Bertier; Daniel G MacArthur; Monkol Lek; Esther Lizano; Henk P J Buermans; Ismael Padioleau; Thomas Schwarzmayr; Olof Karlberg; Halit Ongen; Helena Kilpinen; Sergi Beltran; Marta Gut; Katja Kahlem; Vyacheslav Amstislavskiy; Oliver Stegle; Matti Pirinen; Stephen B Montgomery; Peter Donnelly; Mark I McCarthy; Paul Flicek; Tim M Strom; Hans Lehrach; Stefan Schreiber; Ralf Sudbrak; Angel Carracedo; Stylianos E Antonarakis; Robert Häsler; Ann-Christine Syvänen; Gert-Jan van Ommen; Alvis Brazma; Thomas Meitinger; Philip Rosenstiel; Roderic Guigó; Ivo G Gut; Xavier Estivill; Emmanouil T Dermitzakis
Journal: Nature Date: 2013-09-15 Impact factor: 49.962

31 in total

1. Single nucleotide variant counts computed from RNA sequencing and cellular traffic into human kidney allografts.

Authors: Gaurav Thareja; Hua Yang; Shahina Hayat; Franco B Mueller; John R Lee; Michelle Lubetzky; Darshana M Dadhania; Aziz Belkadi; Surya V Seshan; Karsten Suhre; Manikkam Suthanthiran; Thangamani Muthukumar
Journal: Am J Transplant Date: 2018-05-15 Impact factor: 8.086

Review 2. Cancer transcriptome profiling at the juncture of clinical translation.

Authors: Marcin Cieślik; Arul M Chinnaiyan
Journal: Nat Rev Genet Date: 2017-12-27 Impact factor: 53.242

3. Complementarity of assembly-first and mapping-first approaches for alternative splicing annotation and differential analysis from RNAseq data.

Authors: Clara Benoit-Pilven; Camille Marchet; Emilie Chautard; Leandro Lima; Marie-Pierre Lambert; Gustavo Sacomoto; Amandine Rey; Audric Cologne; Sophie Terrone; Louis Dulaurier; Jean-Baptiste Claude; Cyril F Bourgeois; Didier Auboeuf; Vincent Lacroix
Journal: Sci Rep Date: 2018-03-09 Impact factor: 4.379

4. Playing hide and seek with repeats in local and global de novo transcriptome assembly of short RNA-seq reads.

Authors: Leandro Lima; Blerina Sinaimeri; Gustavo Sacomoto; Helene Lopez-Maestre; Camille Marchet; Vincent Miele; Marie-France Sagot; Vincent Lacroix
Journal: Algorithms Mol Biol Date: 2017-02-22 Impact factor: 1.405

5. Identification of QTLs and joint QTL segments of leaflet traits at different canopy layers in an interspecific RIL population of soybean.

Authors: Jian Zeng; Meng Li; Hongmei Qiu; Yufei Xu; Beibei Feng; Fangyuan Kou; Xianchao Xu; Muhammad Khuram Razzaq; Junyi Gai; Yueqiang Wang; Guangnan Xing
Journal: Theor Appl Genet Date: 2022-10-07 Impact factor: 5.574