Literature DB >> 27153582

On genomic repeats and reproducibility.

Can Firtina1, Can Alkan1.   

Abstract

RESULTS: Here, we present a comprehensive analysis on the reproducibility of computational characterization of genomic variants using high throughput sequencing data. We reanalyzed the same datasets twice, using the same tools with the same parameters, where we only altered the order of reads in the input (i.e. FASTQ file). Reshuffling caused the reads from repetitive regions being mapped to different locations in the second alignment, and we observed similar results when we only applied a scatter/gather approach for read mapping-without prior shuffling. Our results show that, some of the most common variation discovery algorithms do not handle the ambiguous read mappings accurately when random locations are selected. In addition, we also observed that even when the exact same alignment is used, the GATK HaplotypeCaller generates slightly different call sets, which we pinpoint to the variant filtration step. We conclude that, algorithms at each step of genomic variation discovery and characterization need to treat ambiguous mappings in a deterministic fashion to ensure full replication of results.
AVAILABILITY AND IMPLEMENTATION: Code, scripts and the generated VCF files are available at DOI:10.5281/zenodo.32611. CONTACT: calkan@cs.bilkent.edu.tr SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2016        PMID: 27153582     DOI: 10.1093/bioinformatics/btw139

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  15 in total

Review 1.  Nanopore sequencing technology and tools for genome assembly: computational analysis of the current state, bottlenecks and future directions.

Authors:  Damla Senol Cali; Jeremie S Kim; Saugata Ghose; Can Alkan; Onur Mutlu
Journal:  Brief Bioinform       Date:  2019-07-19       Impact factor: 11.622

2.  Hercules: a profile HMM-based hybrid error correction algorithm for long reads.

Authors:  Can Firtina; Ziv Bar-Joseph; Can Alkan; A Ercument Cicek
Journal:  Nucleic Acids Res       Date:  2018-11-30       Impact factor: 16.971

3.  Privacy-preserving genotype imputation in a trusted execution environment.

Authors:  Natnatee Dokmai; Can Kockan; Kaiyuan Zhu; XiaoFeng Wang; S Cenk Sahinalp; Hyunghoon Cho
Journal:  Cell Syst       Date:  2021-08-26       Impact factor: 11.091

Review 4.  Revisiting characteristics of oncogenic extrachromosomal DNA as mobile enhancers on neuroblastoma and glioma cancers.

Authors:  Mohsen Karami Fath; Nastaran Karimfar; Andarz Fazlollahpour Naghibi; Shahriyar Shafa; Melika Ghasemi Shiran; Mehran Ataei; Hossein Dehghanzadeh; Mohsen Nabi Afjadi; Tahereh Ghadiri; Zahra Payandeh; Vahideh Tarhriz
Journal:  Cancer Cell Int       Date:  2022-05-25       Impact factor: 6.429

5.  An improved genome assembly uncovers prolific tandem repeats in Atlantic cod.

Authors:  Ole K Tørresen; Bastiaan Star; Sissel Jentoft; William B Reinar; Harald Grove; Jason R Miller; Brian P Walenz; James Knight; Jenny M Ekholm; Paul Peluso; Rolf B Edvardsen; Ave Tooming-Klunderud; Morten Skage; Sigbjørn Lien; Kjetill S Jakobsen; Alexander J Nederbragt
Journal:  BMC Genomics       Date:  2017-01-18       Impact factor: 3.969

Review 6.  Extrachromosomal circular DNA: a new potential role in cancer progression.

Authors:  Tianyi Wang; Haijian Zhang; Youlang Zhou; Jiahai Shi
Journal:  J Transl Med       Date:  2021-06-10       Impact factor: 5.531

7.  Whole-Genome Sequence Accuracy Is Improved by Replication in a Population of Mutagenized Sorghum.

Authors:  Charles Addo-Quaye; Mitch Tuinstra; Nicola Carraro; Clifford Weil; Brian P Dilkes
Journal:  G3 (Bethesda)       Date:  2018-03-02       Impact factor: 3.154

8.  Discovery and genotyping of novel sequence insertions in many sequenced individuals.

Authors:  Pinar Kavak; Yen-Yi Lin; Ibrahim Numanagic; Hossein Asghari; Tunga Güngör; Can Alkan; Faraz Hach
Journal:  Bioinformatics       Date:  2017-07-15       Impact factor: 6.937

9.  EAGLE: Explicit Alternative Genome Likelihood Evaluator.

Authors:  Tony Kuo; Martin C Frith; Jun Sese; Paul Horton
Journal:  BMC Med Genomics       Date:  2018-04-20       Impact factor: 3.063

10.  Metagenomic Composition Analysis of an Ancient Sequenced Polar Bear Jawbone from Svalbard.

Authors:  Diogo Pratas; Morteza Hosseini; Gonçalo Grilo; Armando J Pinho; Raquel M Silva; Tânia Caetano; João Carneiro; Filipe Pereira
Journal:  Genes (Basel)       Date:  2018-09-06       Impact factor: 4.096

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.