Literature DB >> 24603986

Estimating optimal window size for analysis of low-coverage next-generation sequence data.

Arief Gusnanto1, Charles C Taylor1, Ibrahim Nafisah2, Henry M Wood1, Pamela Rabbitts1, Stefano Berri1.   

Abstract

MOTIVATION: Current high-throughput sequencing has greatly transformed genome sequence analysis. In the context of very low-coverage sequencing (<0.1×), performing 'binning' or 'windowing' on mapped short sequences ('reads') is critical to extract genomic information of interest for further evaluation, such as copy-number alteration analysis. If the window size is too small, many windows will exhibit zero counts and almost no pattern can be observed. In contrast, if the window size is too wide, the patterns or genomic features will be 'smoothed out'. Our objective is to identify an optimal window size in between the two extremes.
RESULTS: We assume the reads density to be a step function. Given this model, we propose a data-based estimation of optimal window size based on Akaike's information criterion (AIC) and cross-validation (CV) log-likelihood. By plotting the AIC and CV log-likelihood curve as a function of window size, we are able to estimate the optimal window size that minimizes AIC or maximizes CV log-likelihood. The proposed methods are of general purpose and we illustrate their application using low-coverage next-generation sequence datasets from real tumour samples and simulated datasets.
AVAILABILITY AND IMPLEMENTATION: An R package to estimate optimal window size is available at http://www1.maths.leeds.ac.uk/∼arief/R/win/.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2014        PMID: 24603986     DOI: 10.1093/bioinformatics/btu123

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  10 in total

1.  DNA copy number profiling using single-cell sequencing.

Authors:  Xuefeng Wang; Hao Chen; Nancy R Zhang
Journal:  Brief Bioinform       Date:  2018-09-28       Impact factor: 11.622

2.  Prediction of tumour pathological subtype from genomic profile using sparse logistic regression with random effects.

Authors:  Özlem Kaymaz; Khaled Alqahtani; Henry M Wood; Arief Gusnanto
Journal:  J Appl Stat       Date:  2020-03-11       Impact factor: 1.416

3.  FocalCall: An R Package for the Annotation of Focal Copy Number Aberrations.

Authors:  Oscar Krijgsman; Christian Benner; Gerrit A Meijer; Mark A van de Wiel; Bauke Ylstra
Journal:  Cancer Inform       Date:  2014-12-01

4.  Using low-coverage whole genome sequencing technique to analyze the chromosomal copy number alterations in the exfoliative cells of cervical cancer.

Authors:  Tong Ren; Jing Suo; Shikai Liu; Shu Wang; Shan Shu; Yang Xiang; Jing He Lang
Journal:  J Gynecol Oncol       Date:  2018-06-29       Impact factor: 4.401

5.  High-Resolution Copy Number Patterns From Clinically Relevant FFPE Material.

Authors:  Anastasia Filia; Alastair Droop; Mark Harland; Helene Thygesen; Juliette Randerson-Moor; Helen Snowden; Claire Taylor; Joey Mark S Diaz; Joanna Pozniak; Jérémie Nsengimana; Jon Laye; Julia A Newton-Bishop; D Timothy Bishop
Journal:  Sci Rep       Date:  2019-06-20       Impact factor: 4.379

Review 6.  Progress in Methods for Copy Number Variation Profiling.

Authors:  Veronika Gordeeva; Elena Sharova; Georgij Arapidi
Journal:  Int J Mol Sci       Date:  2022-02-15       Impact factor: 5.923

7.  Targeted or whole genome sequencing of formalin fixed tissue samples: potential applications in cancer genomics.

Authors:  Sarah Munchel; Yen Hoang; Yue Zhao; Joseph Cottrell; Brandy Klotzle; Andrew K Godwin; Devin Koestler; Peter Beyerlein; Jian-Bing Fan; Marina Bibikova; Jeremy Chien
Journal:  Oncotarget       Date:  2015-09-22

8.  Bacteria pathogens drive host colonic epithelial cell promoter hypermethylation of tumor suppressor genes in colorectal cancer.

Authors:  Xiaoxuan Xia; William Ka Kei Wu; Sunny Hei Wong; Dabin Liu; Thomas Ngai Yeung Kwong; Geicho Nakatsu; Pearlly S Yan; Yu-Ming Chuang; Michael Wing-Yan Chan; Olabisi Oluwabukola Coker; Zigui Chen; Yun Kit Yeoh; Liuyang Zhao; Xiansong Wang; Wing Yin Cheng; Matthew Tak Vai Chan; Paul Kay Sheung Chan; Joseph Jao Yiu Sung; Maggie Haitian Wang; Jun Yu
Journal:  Microbiome       Date:  2020-07-16       Impact factor: 14.650

9.  Detecting differential DNA methylation from sequencing of bisulfite converted DNA of diverse species.

Authors:  Iksoo Huh; Xin Wu; Taesung Park; Soojin V Yi
Journal:  Brief Bioinform       Date:  2019-01-18       Impact factor: 11.622

10.  scDPN for High-throughput Single-cell CNV Detection to Uncover Clonal Evolution During HCC Recurrence.

Authors:  Liang Wu; Miaomiao Jiang; Yuzhou Wang; Biaofeng Zhou; Yunfan Sun; Kaiqian Zhou; Jiarui Xie; Yu Zhong; Zhikun Zhao; Michael Dean; Yong Hou; Shiping Liu
Journal:  Genomics Proteomics Bioinformatics       Date:  2021-07-17       Impact factor: 7.691

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.