| Literature DB >> 24651575 |
Shengpei Chen1, Sheng Li2, Weiwei Xie3, Xuchao Li3, Chunlei Zhang3, Haojun Jiang2, Jing Zheng3, Xiaoyu Pan3, Hancheng Zheng3, Jia Sophie Liu4, Yongqiang Deng5, Fang Chen3, Hui Jiang3.
Abstract
Ultra-low coverage sequencing (ULCS) is one of the most promising strategies for sequencing based clinical application. These clinical applications, especially prenatal diagnosis, have a strict requirement of turn-around-time; therefore, the application of ULCS is restricted by current high throughput sequencing platforms. Recently, the emergence of rapid sequencing platforms, such as MiSeq and Ion Proton, brings ULCS strategy into a new era. The comparison of their performance could shed lights on their potential application in large-scale clinic trials. In this study, we performed ULCS (<0.1X coverage) on both MiSeq and Ion Proton platforms for 18 spontaneous abortion fetuses carrying aneuploidy and compared their performance on different levels. Overall basic data and GC bias showed no significant difference between these two platforms. We also found the sex and aneuploidy detection indicated 100% sensitivity and 100% specificity on both platforms. Our study generated essential data from these two rapid sequencing platforms, which provides useful reference for later research and potentially accelerates the clinical applications of ULCS.Entities:
Mesh:
Year: 2014 PMID: 24651575 PMCID: PMC3961333 DOI: 10.1371/journal.pone.0092192
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Sequencing data statistics.
| MiSeq | Ion Proton | |||||||
| Raw | 90 K | Raw | 90 K | |||||
| AVERAGE | S.D. | AVERAGE | S.D. | AVERAGE | S.D. | AVERAGE | S.D. | |
| Reads Number | 254,360 | 36,617 | 180,000 | – | 2,184,945 | 611,149 | 90,000 | – |
| Unique Map Reads | 219,624 | 32,237 | 155,513 | 4,128 | 1,901,086 | 526,637 | 78,400 | 1,342 |
| Unique Map Rate(%) | 86.35 | 2.27 | 86.40 | 2.29 | 87.10 | 1.49 | 87.11 | 1.49 |
| Duplication Rate(%) | 0.08 | 0.01 | 0.07 | 0.01 | 10.10 | 0.64 | 0.62 | 0.19 |
| Unique non-duplication Reads | 214,288 | 31,740 | 151,731 | 4,241 | 1,707,684 | 467,757 | 77,916 | 1,336 |
| Unique non-duplication Rate(%) | 84.24 | 2.33 | 84.30 | 2.36 | 78.30 | 1.46 | 86.57 | 1.49 |
*S.D. represents for Standard Deviation.
**90K paired end reads.
Figure 1Characteristics of sequencing GC bias.
A, Coefficient of Variation of sequencing reads from two platforms on each chromosome. This correlation between CVs of UR% from 18 samples (y-axis) and GC-content of each chromosome (x-axis, sorted by GC content) are shown in this figure. The color-code lines represented data generated on MiSeq (blue) and Ion Proton (red) platform, respectively. B, Average RD in each GC content category. The blue line (MiSeq) and red line (Ion Proton) shows the relative depth (RD, y-axis) at each sequencing GC-content 1 Mbp windows (x-axis). The dash lines represent the RD from 0.80 to 1.20.
Figure 2The linear relation between UR% from two platforms.
The green scatters show the chromosome UR% calculated using data generated on Miseq (x-axis) and Ion Proton (y-axis) platform, respectively. The linear regression (black line) indicated a good consistency between these two platforms, with a slope of 0.9753, inception of 0.0047 and R2 of 0.9909.
Figure 3Performance of aneuploidy detection and Sex estimation using 90 K reads from two different platforms.
A, Aneuploidy detection result. The z-scores (y-axis) of all 22 autosomes (x-axis, sorted by GC content) for the 18 SA samples are shown in this figure. The cut-off value was set as ±3 (dash line). Aneuploidies signals detected by these two platforms were color-coded (red for MiSeq and blue for Ion Proton) respectively, and negative signals were labeled in grey. The curve was fitted between the z-scores and chromosomal GC content. B, Sex detection with UR-X% and UR-Y%. Using UR-X% (x-axis) and UR-Y% (y-axis), these 18 samples were grouped into two clusters by Gaussian finite mixture model fitted by EM algorithm. These color-coded (red for MiSeq and blue for Ion Proton) scatters with different shapes were label according to the fetal genders (circle for XX and triangle for XY). Each cluster is grouped according to sex detection.
Aneuploidy Detection results using UCLS.
| Sample | MiSeq | Ion Proton | Standard Karyotype | ||
| Detection | Consistency | Detection | Consistency | ||
| SAA01 | 47,XX +2 | Yes | 47,XX +2 | Yes | 47,XX +2 |
| SAA02 | 47,XY +3 | Yes | 47,XY +3 | Yes | 47,XY +3 |
| SAA03 | 47,XY +4 | Yes | 47,XY +4 | Yes | 47,XY +4 |
| SAA04 | 47,XX +5 | Yes | 47,XX +5 | Yes | 47,XX +5 |
| SAA05 | 47,XX +6 | Yes | 47,XX +6 | Yes | 47,XX +6 |
| SAA06 | 47,XY +7 | Yes | 47,XY +7 | Yes | 47,XY +7 |
| SAA07 | 47,XY +8 | Yes | 47,XY +8 | Yes | 47,XY +8 |
| SAA08 | 47,XX +9 | Yes | 47,XX +9 | Yes | 47,XX +9 |
| SAA09 | 47,XY +11 | Yes | 47,XY +11 | Yes | 47,XY +11 |
| SAA10 | 47,XY +10 | Yes | 47,XY +10 | Yes | 47,XY +10 |
| SAA11 | 47,XX +12 | Yes | 47,XX +12 | Yes | 47,XX +12 |
| SAA12 | 47,XX +13 | Yes | 47,XX +13 | Yes | 47,XX +13 |
| SAB01 | 47,XY +14 | Yes | 47,XY +14 | Yes | 47,XY +14 |
| SAB02 | 47,XX +17 | Yes | 47,XX +17 | Yes | 47,XX +17 |
| SAB03 | 47,XY +18 | Yes | 47,XY +18 | Yes | 47,XY +18 |
| SAB04 | 47,XY +20 | Yes | 47,XY +20 | Yes | 47,XY +20 |
| SAB05 | 47,XY +21 | Yes | 47,XY +21 | Yes | 47,XY +21 |
| SAB06 | 47,XY +22 | Yes | 47,XY +22 | Yes | 47,XY +22 |
*Consistency: We obtain the consistency by comparing the detection result with the standard karyotype.
Main output comparisons of current benchtop sequencing platforms for ULCS strategy.
| Platform | Reads length | #Base production | #Reads production | Cost | Run time | #Sample per run | Cost per sample |
| MiSeq | Paired-end 150 bp | Up to 5.1 Gbp | 17M | $750[ | 24–27 hours | 14–16 | $46–53 |
| Ion Proton | ∼200 bp | Up to 10 Gbp | 80M | $1000 | 3–4 hours | 70–75 | $14–15 |
paired end reads.
Only include the PI chip ($699), reagent for template ($166) and sequencing ($135), pricing from Invitrogen US territory website (http://www.invitrogen.com/, accessed 2 August 2013).
Assuming that we need 1M raw reads per sample, and 10% pooling variance.