Literature DB >> 33606251

Statistical Considerations on NGS Data for Inferring Copy Number Variations.

Jie Chen1.   

Abstract

The next-generation sequencing (NGS) technology has revolutionized research in genetics and genomics, resulting in massive NGS data and opening more fronts to answer unresolved issues in genetics. NGS data are usually stored at three levels: image files, sequence tags, and alignment reads. The sizes of these types of data usually range from several hundreds of gigabytes to several terabytes. Biostatisticians and bioinformaticians are typically working with the aligned NGS read count data (hence the last level of NGS data) for data modeling and interpretation.To horn in on the use of NGS technology, researchers utilize it to profile the whole genome to study DNA copy number variations (CNVs) for an individual subject (or patient) as well as groups of subjects (or patients). The resulting aligned NGS read count data are then modeled by proper mathematical and statistical approaches so that the loci of CNVs can be accurately detected. In this book chapter, a summary of most popularly used statistical methods for detecting CNVs using NGS data is given. The goal is to provide readers with a comprehensive resource of available statistical approaches for inferring DNA copy number variations using NGS data.

Entities:  

Keywords:  Bayesian analysis; CNVs; Information criterion; Likelihood ratio test; NGS reads; Read counts; Read depth; Statistical change point analysis

Mesh:

Year:  2021        PMID: 33606251     DOI: 10.1007/978-1-0716-1103-6_2

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  34 in total

1.  CNAseg--a novel framework for identification of copy number changes in cancer from second-generation sequencing data.

Authors:  Sergii Ivakhno; Tom Royce; Anthony J Cox; Dirk J Evers; R Keira Cheetham; Simon Tavaré
Journal:  Bioinformatics       Date:  2010-10-21       Impact factor: 6.937

2.  BioHMM: a heterogeneous hidden Markov model for segmenting array CGH data.

Authors:  J C Marioni; N P Thorne; S Tavaré
Journal:  Bioinformatics       Date:  2006-03-13       Impact factor: 6.937

3.  Bayesian Hidden Markov Modeling of Array CGH Data.

Authors:  Subharup Guha; Yi Li; Donna Neuberg
Journal:  J Am Stat Assoc       Date:  2008-06-01       Impact factor: 5.033

4.  Global variation in copy number in the human genome.

Authors:  Richard Redon; Shumpei Ishikawa; Karen R Fitch; Lars Feuk; George H Perry; T Daniel Andrews; Heike Fiegler; Michael H Shapero; Andrew R Carson; Wenwei Chen; Eun Kyung Cho; Stephanie Dallaire; Jennifer L Freeman; Juan R González; Mònica Gratacòs; Jing Huang; Dimitrios Kalaitzopoulos; Daisuke Komura; Jeffrey R MacDonald; Christian R Marshall; Rui Mei; Lyndal Montgomery; Kunihiro Nishimura; Kohji Okamura; Fan Shen; Martin J Somerville; Joelle Tchinda; Armand Valsesia; Cara Woodwark; Fengtang Yang; Junjun Zhang; Tatiana Zerjal; Jane Zhang; Lluis Armengol; Donald F Conrad; Xavier Estivill; Chris Tyler-Smith; Nigel P Carter; Hiroyuki Aburatani; Charles Lee; Keith W Jones; Stephen W Scherer; Matthew E Hurles
Journal:  Nature       Date:  2006-11-23       Impact factor: 49.962

5.  High-resolution mapping of copy-number alterations with massively parallel sequencing.

Authors:  Derek Y Chiang; Gad Getz; David B Jaffe; Michael J T O'Kelly; Xiaojun Zhao; Scott L Carter; Carsten Russ; Chad Nusbaum; Matthew Meyerson; Eric S Lander
Journal:  Nat Methods       Date:  2008-11-30       Impact factor: 28.547

6.  Relative impact of nucleotide and copy number variation on gene expression phenotypes.

Authors:  Barbara E Stranger; Matthew S Forrest; Mark Dunning; Catherine E Ingle; Claude Beazley; Natalie Thorne; Richard Redon; Christine P Bird; Anna de Grassi; Charles Lee; Chris Tyler-Smith; Nigel Carter; Stephen W Scherer; Simon Tavaré; Panagiotis Deloukas; Matthew E Hurles; Emmanouil T Dermitzakis
Journal:  Science       Date:  2007-02-09       Impact factor: 47.728

7.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.

Authors:  Ben Langmead; Cole Trapnell; Mihai Pop; Steven L Salzberg
Journal:  Genome Biol       Date:  2009-03-04       Impact factor: 13.583

8.  Systematic bias in high-throughput sequencing data and its correction by BEADS.

Authors:  Ming-Sin Cheung; Thomas A Down; Isabel Latorre; Julie Ahringer
Journal:  Nucleic Acids Res       Date:  2011-06-06       Impact factor: 16.971

9.  Detecting common copy number variants in high-throughput sequencing data by using JointSLM algorithm.

Authors:  Alberto Magi; Matteo Benelli; Seungtai Yoon; Franco Roviello; Francesca Torricelli
Journal:  Nucleic Acids Res       Date:  2011-02-14       Impact factor: 16.971

10.  SW-ARRAY: a dynamic programming solution for the identification of copy-number changes in genomic DNA using array comparative genome hybridization data.

Authors:  Thomas S Price; Regina Regan; Richard Mott; Asa Hedman; Ben Honey; Rachael J Daniels; Lee Smith; Andy Greenfield; Ana Tiganescu; Veronica Buckle; Nicki Ventress; Helena Ayyub; Anita Salhan; Susana Pedraza-Diaz; John Broxholme; Jiannis Ragoussis; Douglas R Higgs; Jonathan Flint; Samantha J L Knight
Journal:  Nucleic Acids Res       Date:  2005-06-16       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.