MOTIVATION: Structural variations and in particular copy number variations (CNVs) have dramatic effects of disease and traits. Technologies for identifying CNVs have been an active area of research for over 10 years. The current generation of high-throughput sequencing techniques presents new opportunities for identification of CNVs. Methods that utilize these technologies map sequencing reads to a reference genome and look for signatures which might indicate the presence of a CNV. These methods work well when CNVs lie within unique genomic regions. However, the problem of CNV identification and reconstruction becomes much more challenging when CNVs are in repeat-rich regions, due to the multiple mapping positions of the reads. RESULTS: In this study, we propose an efficient algorithm to handle these multi-mapping reads such that the CNVs can be reconstructed with high accuracy even for repeat-rich regions. To our knowledge, this is the first attempt to both identify and reconstruct CNVs in repeat-rich regions. Our experiments show that our method is not only computationally efficient but also accurate.
MOTIVATION: Structural variations and in particular copy number variations (CNVs) have dramatic effects of disease and traits. Technologies for identifying CNVs have been an active area of research for over 10 years. The current generation of high-throughput sequencing techniques presents new opportunities for identification of CNVs. Methods that utilize these technologies map sequencing reads to a reference genome and look for signatures which might indicate the presence of a CNV. These methods work well when CNVs lie within unique genomic regions. However, the problem of CNV identification and reconstruction becomes much more challenging when CNVs are in repeat-rich regions, due to the multiple mapping positions of the reads. RESULTS: In this study, we propose an efficient algorithm to handle these multi-mapping reads such that the CNVs can be reconstructed with high accuracy even for repeat-rich regions. To our knowledge, this is the first attempt to both identify and reconstruct CNVs in repeat-rich regions. Our experiments show that our method is not only computationally efficient but also accurate.
Authors: Enrique Gonzalez; Hemant Kulkarni; Hector Bolivar; Andrea Mangano; Racquel Sanchez; Gabriel Catano; Robert J Nibbs; Barry I Freedman; Marlon P Quinones; Michael J Bamshad; Krishna K Murthy; Brad H Rovin; William Bradley; Robert A Clark; Stephanie A Anderson; Robert J O'connell; Brian K Agan; Seema S Ahuja; Rosa Bologna; Luisa Sen; Matthew J Dolan; Sunil K Ahuja Journal: Science Date: 2005-01-06 Impact factor: 47.728
Authors: Robert Lucito; John Healy; Joan Alexander; Andrew Reiner; Diane Esposito; Maoyen Chi; Linda Rodgers; Amy Brady; Jonathan Sebat; Jennifer Troge; Joseph A West; Seth Rostan; Ken C Q Nguyen; Scott Powers; Kenneth Q Ye; Adam Olshen; Ennapadam Venkatraman; Larry Norton; Michael Wigler Journal: Genome Res Date: 2003-09-15 Impact factor: 9.043
Authors: Jonathan Sebat; B Lakshmi; Dheeraj Malhotra; Jennifer Troge; Christa Lese-Martin; Tom Walsh; Boris Yamrom; Seungtai Yoon; Alex Krasnitz; Jude Kendall; Anthony Leotta; Deepa Pai; Ray Zhang; Yoon-Ha Lee; James Hicks; Sarah J Spence; Annette T Lee; Kaija Puura; Terho Lehtimäki; David Ledbetter; Peter K Gregersen; Joel Bregman; James S Sutcliffe; Vaidehi Jobanputra; Wendy Chung; Dorothy Warburton; Mary-Claire King; David Skuse; Daniel H Geschwind; T Conrad Gilliam; Kenny Ye; Michael Wigler Journal: Science Date: 2007-03-15 Impact factor: 47.728
Authors: Timothy J Aitman; Rong Dong; Timothy J Vyse; Penny J Norsworthy; Michelle D Johnson; Jennifer Smith; Jonathan Mangion; Cheri Roberton-Lowe; Amy J Marshall; Enrico Petretto; Matthew D Hodges; Gurjeet Bhangal; Sheetal G Patel; Kelly Sheehan-Rooney; Mark Duda; Paul R Cook; David J Evans; Jan Domin; Jonathan Flint; Joseph J Boyle; Charles D Pusey; H Terence Cook Journal: Nature Date: 2006-02-16 Impact factor: 49.962
Authors: Jean Monlong; Patrick Cossette; Caroline Meloche; Guy Rouleau; Simon L Girard; Guillaume Bourque Journal: Nucleic Acids Res Date: 2018-08-21 Impact factor: 16.971
Authors: Mark N Ragheb; Christopher B Ford; Michael R Chase; Philana Ling Lin; Joanne L Flynn; Sarah M Fortune Journal: BMC Genomics Date: 2013-03-05 Impact factor: 3.969
Authors: Loredana M Genovese; Filippo Geraci; Lucia Corrado; Eleonora Mangano; Romina D'Aurizio; Roberta Bordoni; Marco Severgnini; Giovanni Manzini; Gianluca De Bellis; Sandra D'Alfonso; Marco Pellegrini Journal: Front Genet Date: 2018-05-02 Impact factor: 4.599