| Literature DB >> 35246132 |
Amy Wing-Sze Leung1, Henry Chi-Ming Leung1, Chak-Lim Wong1, Zhen-Xian Zheng1, Wui-Wang Lui1, Ho-Ming Luk2, Ivan Fai-Man Lo2, Ruibang Luo3, Tak-Wah Lam4.
Abstract
BACKGROUND: The application of long-read sequencing using the Oxford Nanopore Technologies (ONT) MinION sequencer is getting more diverse in the medical field. Having a high sequencing error of ONT and limited throughput from a single MinION flowcell, however, limits its applicability for accurate variant detection. Medical exome sequencing (MES) targets clinically significant exon regions, allowing rapid and comprehensive screening of pathogenic variants. By applying MES with MinION sequencing, the technology can achieve a more uniform capture of the target regions, shorter turnaround time, and lower sequencing cost per sample.Entities:
Keywords: Ensemble variant calling; Medical exome sequencing; MinION sequencing; Target enrichment; Third-generation sequencing
Mesh:
Year: 2022 PMID: 35246132 PMCID: PMC8895767 DOI: 10.1186/s12920-022-01190-3
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Overview of the ECNano workflow diagram. The workflow comprises both wet-lab and bioinformatics components and was completed within three days (Day 1 noon to Day 4 noon)
Information on the four variant calling models of Clair used in Clair-ensemble (CE) for variant detection of ECNano data
| Model | Training set | Specifications* |
|---|---|---|
| CE1-3-4 | hg001 + hg003 + hg004 ONT data | Normal depth WGS model |
| CE1-2-2HD-3-4 | hg001 + hg002 + hg002 very high depth (up to 500x) + hg003 + hg004 ONT data | Very high depth and normal depth WGS mixed model |
| CE1-2 | hg001 + hg002 ONT data | Normal depth WGS model |
| CE1-2-3-4 | hg001 + hg002 + hg003 + hg004 ONT data | Normal depth WGS model |
The models were trained with combinations of different ONT WGS datasets (Additional files in Ref. [16]). * Normal depth models were trained with datasets of maximum 168 × DoC, and the model with mixed high depth data included a dataset of maximum 578 × DoC
Fig. 2Read length distribution of an ECNano target captured HG001 library, sequenced using an ONT MinION sequencer with a single flowcell with 10 Gbp throughput
Fig. 3Depth distribution of the ECNano target positions in a sequencing run using standard reference HG001 samples. Positions with 30X coverage or above are considered to be highly confident for variant calling
Fig. 4Sample depth distribution comparison of target regions in three genes: (upper: NF1; middle: AGRN; bottom: BRCA1) in HG001 with ECNano target enrichment protocol
Fig. 5Performance of Clair-ensemble against other existing ONT variant callers at target positions using an ECNano HG001 dataset. The performance was evaluated in terms of overall (top), SNP (middle), INDEL (bottom). Clair-ensemble was performed with both the built-in resampling method and down-sampling with VariantBam. Other tools evaluated included the original Clair with different ONT models (model details described in Table 1); and LongShot for SNP calling
Fig. 6Middle position showing the target variant(s) in 3 patient samples and the alignment of the adjacent positions: patient sample 1 (top) with homozygous C > T SNP in SLURP1; patient sample 2 (middle) with 10-base insertion in BCAP31; and patient sample 3 (bottom) with heterozygous C > T SNP in UROC1