Literature DB >> 33710328

Comparative analysis of 7 short-read sequencing platforms using the Korean Reference Genome: MGI and Illumina sequencing benchmark for whole-genome sequencing.

Hak-Min Kim1, Sungwon Jeon2,3, Oksung Chung1, Je Hoon Jun1, Hui-Su Kim2, Asta Blazyte2,3, Hwang-Yeol Lee1, Youngseok Yu1, Yun Sung Cho1, Dan M Bolser4, Jong Bhak1,2,3,4,5.   

Abstract

BACKGROUND: DNBSEQ-T7 is a new whole-genome sequencer developed by Complete Genomics and MGI using DNA nanoball and combinatorial probe anchor synthesis technologies to generate short reads at a very large scale-up to 60 human genomes per day. However, it has not been objectively and systematically compared against Illumina short-read sequencers.
FINDINGS: By using the same KOREF sample, the Korean Reference Genome, we have compared 7 sequencing platforms including BGISEQ-500, DNBSEQ-T7, HiSeq2000, HiSeq2500, HiSeq4000, HiSeqX10, and NovaSeq6000. We measured sequencing quality by comparing sequencing statistics (base quality, duplication rate, and random error rate), mapping statistics (mapping rate, depth distribution, and percent GC coverage), and variant statistics (transition/transversion ratio, dbSNP annotation rate, and concordance rate with single-nucleotide polymorphism [SNP] genotyping chip) across the 7 sequencing platforms. We found that MGI platforms showed a higher concordance rate for SNP genotyping than HiSeq2000 and HiSeq4000. The similarity matrix of variant calls confirmed that the 2 MGI platforms have the most similar characteristics to the HiSeq2500 platform.
CONCLUSIONS: Overall, MGI and Illumina sequencing platforms showed comparable levels of sequencing quality, uniformity of coverage, percent GC coverage, and variant accuracy; thus we conclude that the MGI platforms can be used for a wide range of genomics research fields at a lower cost than the Illumina platforms.
© The Author(s) 2021. Published by Oxford University Press GigaScience.

Entities:  

Keywords:  DNBSEQ-T7; sequencing platform comparison; whole-genome sequencing

Year:  2021        PMID: 33710328      PMCID: PMC7953489          DOI: 10.1093/gigascience/giab014

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


  27 in total

1.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

Review 2.  Coming of age: ten years of next-generation sequencing technologies.

Authors:  Sara Goodwin; John D McPherson; W Richard McCombie
Journal:  Nat Rev Genet       Date:  2016-05-17       Impact factor: 53.242

3.  A framework for variation discovery and genotyping using next-generation DNA sequencing data.

Authors:  Mark A DePristo; Eric Banks; Ryan Poplin; Kiran V Garimella; Jared R Maguire; Christopher Hartl; Anthony A Philippakis; Guillermo del Angel; Manuel A Rivas; Matt Hanna; Aaron McKenna; Tim J Fennell; Andrew M Kernytsky; Andrey Y Sivachenko; Kristian Cibulskis; Stacey B Gabriel; David Altshuler; Mark J Daly
Journal:  Nat Genet       Date:  2011-04-10       Impact factor: 38.330

4.  An ethnically relevant consensus Korean reference genome is a step towards personal reference genomes.

Authors:  Yun Sung Cho; Hyunho Kim; Hak-Min Kim; Sungwoong Jho; JeHoon Jun; Yong Joo Lee; Kyun Shik Chae; Chang Geun Kim; Sangsoo Kim; Anders Eriksson; Jeremy S Edwards; Semin Lee; Byung Chul Kim; Andrea Manica; Tae-Kwang Oh; George M Church; Jong Bhak
Journal:  Nat Commun       Date:  2016-11-24       Impact factor: 14.919

5.  Germline and somatic variant identification using BGISEQ-500 and HiSeq X Ten whole genome sequencing.

Authors:  Ann-Marie Patch; Katia Nones; Stephen H Kazakoff; Felicity Newell; Scott Wood; Conrad Leonard; Oliver Holmes; Qinying Xu; Venkateswar Addala; Jenette Creaney; Bruce W Robinson; Shujin Fu; Chunyu Geng; Tong Li; Wenwei Zhang; Xinming Liang; Junhua Rao; Jiahao Wang; Mingyu Tian; Yonggang Zhao; Fei Teng; Honglan Gou; Bicheng Yang; Hui Jiang; Feng Mu; John V Pearson; Nicola Waddell
Journal:  PLoS One       Date:  2018-01-10       Impact factor: 3.240

6.  Discovery and genotyping of structural variation from long-read haploid genome sequence data.

Authors:  John Huddleston; Mark J P Chaisson; Karyn Meltz Steinberg; Wes Warren; Kendra Hoekzema; David Gordon; Tina A Graves-Lindsay; Katherine M Munson; Zev N Kronenberg; Laura Vives; Paul Peluso; Matthew Boitano; Chen-Shin Chin; Jonas Korlach; Richard K Wilson; Evan E Eichler
Journal:  Genome Res       Date:  2016-11-28       Impact factor: 9.043

7.  Comparison of the MGISEQ-2000 and Illumina HiSeq 4000 sequencing platforms for RNA sequencing.

Authors:  Sol A Jeon; Jong Lyul Park; Jong-Hwan Kim; Jeong Hwan Kim; Yong Sung Kim; Jin Cheon Kim; Seon-Young Kim
Journal:  Genomics Inform       Date:  2019-09-27

8.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

9.  The UCSC genome browser and associated tools.

Authors:  Robert M Kuhn; David Haussler; W James Kent
Journal:  Brief Bioinform       Date:  2012-08-20       Impact factor: 11.622

10.  Mining statistically-solid k-mers for accurate NGS error correction.

Authors:  Liang Zhao; Jin Xie; Lin Bai; Wen Chen; Mingju Wang; Zhonglei Zhang; Yiqi Wang; Zhe Zhao; Jinyan Li
Journal:  BMC Genomics       Date:  2018-12-31       Impact factor: 3.969

View more
  5 in total

1.  Genome sequencing data of extended-spectrum beta-lactamase-producing Escherichia coli INF191/17/A isolates of nosocomial infection.

Authors:  Nik Siti Hanifah Nik Ahmad; Khor Bee Yin; Nik Yusnoraini Yusof
Journal:  Data Brief       Date:  2022-06-23

Review 2.  Introduction to the principles and methods underlying the recovery of metagenome-assembled genomes from metagenomic data.

Authors:  Gleb Goussarov; Mohamed Mysara; Peter Vandamme; Rob Van Houdt
Journal:  Microbiologyopen       Date:  2022-06       Impact factor: 3.904

3.  Accelerating Detection of Variants During COVID-19 Surges by Diverse Technological and Public Health Partnerships: A Case Study From Indonesia.

Authors:  Ariel Pradipta; Meutia Ayuputeri Kumaheri; Lilik Duwi Wahyudi; Anindya Pradipta Susanto; Harryyanto Ishaq Agasi; Anuraj H Shankar; Pratiwi Sudarmono
Journal:  Front Genet       Date:  2022-01-28       Impact factor: 4.599

4.  Accuracy benchmark of the GeneMind GenoLab M sequencing platform for WGS and WES analysis.

Authors:  Chaoyang Li; Xue Fan; Xin Guo; Yongfeng Liu; Miao Wang; Xiao Chao Zhao; Ping Wu; Qin Yan; Lei Sun
Journal:  BMC Genomics       Date:  2022-07-22       Impact factor: 4.547

5.  Benchmarking of ATAC Sequencing Data From BGI's Low-Cost DNBSEQ-G400 Instrument for Identification of Open and Occupied Chromatin Regions.

Authors:  Marina Naval-Sanchez; Nikita Deshpande; Minh Tran; Jingyu Zhang; Majid Alhomrani; Walaa Alsanie; Quan Nguyen; Christian M Nefzger
Journal:  Front Mol Biosci       Date:  2022-07-07
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.