| Literature DB >> 31610628 |
Sol A Jeon1,2, Jong Lyul Park3, Jong-Hwan Kim1, Jeong Hwan Kim1, Yong Sung Kim1,2, Jin Cheon Kim4,5, Seon-Young Kim1,3.
Abstract
Currently, Illumina sequencers are the globally leading sequencing platform in the next-generation sequencing market. Recently, MGI Tech launched a series of new sequencers, including the MGISEQ-2000, which promise to deliver high-quality sequencing data faster and at lower prices than Illumina's sequencers. In this study, we compared the performance of two major sequencers (MGISEQ-2000 and HiSeq 4000) to test whether the MGISEQ-2000 sequencer delivers high-quality sequence data as suggested. We performed RNA sequencing of four human colon cancer samples with the two platforms, and compared the sequencing quality and expression values. The data produced from the MGISEQ-2000 and HiSeq 4000 showed high concordance, with Pearson correlation coefficients ranging from 0.98 to 0.99. Various quality control (QC) analyses showed that the MGISEQ-2000 data fulfilled the required QC measures. Our study suggests that the performance of the MGISEQ-2000 is comparable to that of the HiSeq 4000 and that the MGISEQ-2000 can be a useful platform for sequencing.Entities:
Keywords: HiSeq 4000; MGISEQ-2000; benchmarking
Year: 2019 PMID: 31610628 PMCID: PMC6808641 DOI: 10.5808/GI.2019.17.3.e32
Source DB: PubMed Journal: Genomics Inform ISSN: 1598-866X
Summary statistics of sequencing quality
| Total read bases (bp) | Q20 (%) | Q30 (%) | Uniquely mapped reads (%) | |||||
|---|---|---|---|---|---|---|---|---|
| Illumina | MGI | Illumina | MGI | Illumina | MGI | Illumina | MGI | |
| P1 | 7.45×109 | 2.44×1010 | 97.9 | 98.23 | 94.75 | 92.65 | 93.65 | 95.74 |
| P2 | 7.36×109 | 2.46×1010 | 97.87 | 98.26 | 94.67 | 92.85 | 89.8 | 91.8 |
| P3 | 8.71×109 | 2.40×1010 | 97.72 | 98.09 | 94.36 | 92.25 | 93.75 | 96.6 |
| P4 | 9.35×109 | 1.99×1010 | 97.88 | 98.23 | 94.73 | 92.65 | 92.75 | 94.65 |
Fig. 1.High concordance of RNA-seq data produced using the Illumina and MGI platforms as shown by a principal component analysis plot. RNA from the four samples was sequenced using the HiSeq 4000 (blue dots) and MGISEQ-2000 (red dots) sequencers.
Fig. 2.Scatter plots of gene expression values of the four pairs of samples produced using the HiSeq 4000 and MGISEQ-2000 sequencers. Gene expression values are represented as the base 2 logarithm of counts per million (cpm). The Pearson correlation coefficients of the four samples were between 0.98 and 0.99.
Fig. 3.Differentially expressed genes between the two platforms. Genes with larger than two-fold differences were selected from the four pairs of samples. As only one experiment was performed for each platform, no statistical test was applied. The overlap of the differentially expressed genes is shown.