Literature DB >> 34114005

Shall genomic correlation structure be considered in copy number variants detection?

Fei Qin1, Xizhi Luo2, Guoshuai Cai3, Feifei Xiao2.   

Abstract

Copy number variation has been identified as a major source of genomic variation associated with disease susceptibility. With the advent of whole-exome sequencing (WES) technology, massive WES data have been generated, allowing for the identification of copy number variants (CNVs) in the protein-coding regions with direct functional interpretation. We have previously shown evidence of the genomic correlation structure in array data and developed a novel chromosomal breakpoint detection algorithm, LDcnv, which showed significantly improved detection power through integrating the correlation structure in a systematic modeling manner. However, it remains unexplored whether the genomic correlation exists in WES data and how such correlation structure integration can improve the CNV detection accuracy. In this study, we first explored the correlation structure of the WES data using the 1000 Genomes Project data. Both real raw read depth and median-normalized data showed strong evidence of the correlation structure. Motivated by this fact, we proposed a correlation-based method, CORRseq, as a novel release of the LDcnv algorithm in profiling WES data. The performance of CORRseq was evaluated in extensive simulation studies and real data analysis from the 1000 Genomes Project. CORRseq outperformed the existing methods in detecting medium and large CNVs. In conclusion, it would be more advantageous to model genomic correlation structure in detecting relatively long CNVs. This study provides great insights for methodology development of CNV detection with NGS data.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  copy number variation detection; correlation structure

Mesh:

Year:  2021        PMID: 34114005      PMCID: PMC8768456          DOI: 10.1093/bib/bbab215

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   13.994


  36 in total

1.  Integrated detection and population-genetic analysis of SNPs and copy number variation.

Authors:  Steven A McCarroll; Finny G Kuruvilla; Joshua M Korn; Simon Cawley; James Nemesh; Alec Wysoker; Michael H Shapero; Paul I W de Bakker; Julian B Maller; Andrew Kirby; Amanda L Elliott; Melissa Parkin; Earl Hubbell; Teresa Webster; Rui Mei; James Veitch; Patrick J Collins; Robert Handsaker; Steve Lincoln; Marcia Nizzari; John Blume; Keith W Jones; Rich Rava; Mark J Daly; Stacey B Gabriel; David Altshuler
Journal:  Nat Genet       Date:  2008-09-07       Impact factor: 38.330

2.  A very fast and accurate method for calling aberrations in array-CGH data.

Authors:  Matteo Benelli; Giuseppina Marseglia; Genni Nannetti; Roberta Paravidino; Federico Zara; Franca Dagna Bricarelli; Francesca Torricelli; Alberto Magi
Journal:  Biostatistics       Date:  2010-03-05       Impact factor: 5.899

3.  An accurate and powerful method for copy number variation detection.

Authors:  Feifei Xiao; Xizhi Luo; Ning Hao; Yue S Niu; Xiangjun Xiao; Guoshuai Cai; Christopher I Amos; Heping Zhang
Journal:  Bioinformatics       Date:  2019-09-01       Impact factor: 6.937

4.  Origins and functional impact of copy number variation in the human genome.

Authors:  Donald F Conrad; Dalila Pinto; Richard Redon; Lars Feuk; Omer Gokcumen; Yujun Zhang; Jan Aerts; T Daniel Andrews; Chris Barnes; Peter Campbell; Tomas Fitzgerald; Min Hu; Chun Hwa Ihm; Kati Kristiansson; Daniel G Macarthur; Jeffrey R Macdonald; Ifejinelo Onyiah; Andy Wing Chun Pang; Sam Robson; Kathy Stirrups; Armand Valsesia; Klaudia Walter; John Wei; Chris Tyler-Smith; Nigel P Carter; Charles Lee; Stephen W Scherer; Matthew E Hurles
Journal:  Nature       Date:  2009-10-07       Impact factor: 49.962

5.  CMDS: a population-based method for identifying recurrent DNA copy number aberrations in cancer from high-resolution data.

Authors:  Qunyuan Zhang; Li Ding; David E Larson; Daniel C Koboldt; Michael D McLellan; Ken Chen; Xiaoqi Shi; Aldi Kraja; Elaine R Mardis; Richard K Wilson; Ingrid B Borecki; Michael A Province
Journal:  Bioinformatics       Date:  2009-12-23       Impact factor: 6.937

6.  Neurodevelopmental disease genes implicated by de novo mutation and copy number variation morbidity.

Authors:  Bradley P Coe; Holly A F Stessman; Arvis Sulovari; Madeleine R Geisheker; Trygve E Bakken; Allison M Lake; Joseph D Dougherty; Ed S Lein; Fereydoun Hormozdiari; Raphael A Bernier; Evan E Eichler
Journal:  Nat Genet       Date:  2018-12-17       Impact factor: 38.330

7.  Copy number variation detection and genotyping from exome sequence data.

Authors:  Niklas Krumm; Peter H Sudmant; Arthur Ko; Brian J O'Roak; Maika Malig; Bradley P Coe; Aaron R Quinlan; Deborah A Nickerson; Evan E Eichler
Journal:  Genome Res       Date:  2012-05-14       Impact factor: 9.043

8.  Detecting common copy number variants in high-throughput sequencing data by using JointSLM algorithm.

Authors:  Alberto Magi; Matteo Benelli; Seungtai Yoon; Franco Roviello; Francesca Torricelli
Journal:  Nucleic Acids Res       Date:  2011-02-14       Impact factor: 16.971

9.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

10.  CONY: A Bayesian procedure for detecting copy number variations from sequencing read depths.

Authors:  Yu-Chung Wei; Guan-Hua Huang
Journal:  Sci Rep       Date:  2020-06-26       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.