Literature DB >> 21985277

A model-based circular binary segmentation algorithm for the analysis of array CGH data.

Fang-Han Hsu1, Hung-I H Chen, Mong-Hsun Tsai, Liang-Chuan Lai, Chi-Cheng Huang, Shih-Hsin Tu, Eric Y Chuang, Yidong Chen.   

Abstract

BACKGROUND: Circular Binary Segmentation (CBS) is a permutation-based algorithm for array Comparative Genomic Hybridization (aCGH) data analysis. CBS accurately segments data by detecting change-points using a maximal-t test; but extensive computational burden is involved for evaluating the significance of change-points using permutations. A recent implementation utilizing a hybrid method and early stopping rules (hybrid CBS) to improve the performance in speed was subsequently proposed. However, a time analysis revealed that a major portion of computation time of the hybrid CBS was still spent on permutation. In addition, what the hybrid method provides is an approximation of the significance upper bound or lower bound, not an approximation of the significance of change-points itself.
RESULTS: We developed a novel model-based algorithm, extreme-value based CBS (eCBS), which limits permutations and provides robust results without loss of accuracy. Thousands of aCGH data under null hypothesis were simulated in advance based on a variety of non-normal assumptions, and the corresponding maximal-t distribution was modeled by the Generalized Extreme Value (GEV) distribution. The modeling results, which associate characteristics of aCGH data to the GEV parameters, constitute lookup tables (eXtreme model). Using the eXtreme model, the significance of change-points could be evaluated in a constant time complexity through a table lookup process.
CONCLUSIONS: A novel algorithm, eCBS, was developed in this study. The current implementation of eCBS consistently outperforms the hybrid CBS 4× to 20× in computation time without loss of accuracy. Source codes, supplementary materials, supplementary figures, and supplementary tables can be found at http://ntumaps.cgm.ntu.edu.tw/eCBSsupplementary.

Entities:  

Year:  2011        PMID: 21985277      PMCID: PMC3224564          DOI: 10.1186/1756-0500-4-394

Source DB:  PubMed          Journal:  BMC Res Notes        ISSN: 1756-0500


  17 in total

1.  A comparison study: applying segmentation to array CGH data for downstream analyses.

Authors:  Hanni Willenbrock; Jane Fridlyand
Journal:  Bioinformatics       Date:  2005-09-13       Impact factor: 6.937

Review 2.  Array comparative genomic hybridization and its applications in cancer.

Authors:  Daniel Pinkel; Donna G Albertson
Journal:  Nat Genet       Date:  2005-06       Impact factor: 38.330

3.  A faster circular binary segmentation algorithm for the analysis of array CGH data.

Authors:  E S Venkatraman; Adam B Olshen
Journal:  Bioinformatics       Date:  2007-01-18       Impact factor: 6.937

Review 4.  Copy number variants and genetic traits: closer to the resolution of phenotypic to genotypic variability.

Authors:  Jacques S Beckmann; Xavier Estivill; Stylianos E Antonarakis
Journal:  Nat Rev Genet       Date:  2007-08       Impact factor: 53.242

5.  Sparse representation and Bayesian detection of genome copy number alterations from microarray data.

Authors:  Roger Pique-Regi; Jordi Monso-Varona; Antonio Ortega; Robert C Seeger; Timothy J Triche; Shahab Asgharzadeh
Journal:  Bioinformatics       Date:  2008-01-18       Impact factor: 6.937

6.  A Bayesian segmentation approach to ascertain copy number variations at the population level.

Authors:  Long Yang Wu; Hugh A Chipman; Shelley B Bull; Laurent Briollais; Kesheng Wang
Journal:  Bioinformatics       Date:  2009-04-23       Impact factor: 6.937

7.  A note on oligonucleotide expression values not being normally distributed.

Authors:  Johanna Hardin; Jason Wilson
Journal:  Biostatistics       Date:  2009-03-10       Impact factor: 5.899

8.  High-resolution, dual-platform aCGH analysis reveals frequent HIPK2 amplification and increased expression in pilocytic astrocytomas.

Authors:  H Deshmukh; T H Yeh; J Yu; M K Sharma; A Perry; J R Leonard; M A Watson; D H Gutmann; R Nagarajan
Journal:  Oncogene       Date:  2008-04-14       Impact factor: 9.867

9.  Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing.

Authors:  Peter J Campbell; Philip J Stephens; Erin D Pleasance; Sarah O'Meara; Heng Li; Thomas Santarius; Lucy A Stebbings; Catherine Leroy; Sarah Edkins; Claire Hardy; Jon W Teague; Andrew Menzies; Ian Goodhead; Daniel J Turner; Christopher M Clee; Michael A Quail; Antony Cox; Clive Brown; Richard Durbin; Matthew E Hurles; Paul A W Edwards; Graham R Bignell; Michael R Stratton; P Andrew Futreal
Journal:  Nat Genet       Date:  2008-04-27       Impact factor: 38.330

10.  Integrated genomic analyses of ovarian carcinoma.

Authors: 
Journal:  Nature       Date:  2011-06-29       Impact factor: 49.962

View more
  4 in total

Review 1.  Current analysis platforms and methods for detecting copy number variation.

Authors:  Wenli Li; Michael Olivier
Journal:  Physiol Genomics       Date:  2012-11-06       Impact factor: 3.107

2.  A genomic approach to study down syndrome and cancer inverse comorbidity: untangling the chromosome 21.

Authors:  Jaume Forés-Martos; Raimundo Cervera-Vidal; Enrique Chirivella; Alberto Ramos-Jarero; Joan Climent
Journal:  Front Physiol       Date:  2015-02-04       Impact factor: 4.566

3.  Mapping the micro-proteome of the nuclear lamina and lamina-associated domains.

Authors:  Xianrong Wong; Jevon A Cutler; Victoria E Hoskins; Molly Gordon; Anil K Madugundu; Akhilesh Pandey; Karen L Reddy
Journal:  Life Sci Alliance       Date:  2021-03-23

4.  Reducing confounding and suppression effects in TCGA data: an integrated analysis of chemotherapy response in ovarian cancer.

Authors:  Fang-Han Hsu; Erchin Serpedin; Tzu-Hung Hsiao; Alexander J R Bishop; Edward R Dougherty; Yidong Chen
Journal:  BMC Genomics       Date:  2012-10-26       Impact factor: 3.969

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.