| Literature DB >> 26883533 |
Thomas W Laver1, Richard C Caswell1, Karen A Moore2, Jeremie Poschmann2, Matthew B Johnson1, Martina M Owens3, Sian Ellard1, Konrad H Paszkiewicz2, Michael N Weedon1.
Abstract
The long-read sequencers from Pacific Bioscience (PacBio) and Oxford Nanopore Technologies (ONT) offer the opportunity to phase mutations multiple kilobases apart directly from sequencing reads. In this study, we used long-range PCR with ONT and PacBio sequencing to phase two variants 9 kb apart in the RET gene. We also re-analysed data from a recent paper which had apparently successfully used ONT to phase clinically important haplotypes at the CYP2D6 and HLA loci. From these analyses, we demonstrate PCR-chimera formation during PCR amplification and reference alignment bias are pitfalls that need to be considered when attempting to phase variants using amplicon-based long-read sequencing technologies. These methodological pitfalls need to be avoided if the opportunities provided by long-read sequencers are to be fully exploited.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26883533 PMCID: PMC4756330 DOI: 10.1038/srep21746
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Graphical representation of the RET gene and the targeted mutations, including primer positions.
Image adapted from UCSC genome browser24.
Sequencing data statistics.
| Total reads analysed | Reads aligned | Error rate (%) | Reads aligned to both snps | |
|---|---|---|---|---|
| MinION1 | 6915 | 1812 | 37.6 | 111 |
| MinION2 | 4365 | 1833 | 34.2 | 81 |
| PacBio1 | 3459 | 2885 | 21.4* | 261 |
| PacBio C29 | 41244 | 20602 | 3.97* | 15200 |
| PacBio C34 | 54294 | 27147 | 3.74* | 21945 |
| PacBio C39 | 26722 | 13361 | 3.85* | 11683 |
*These are based on ROI (read of insert) error rates.
RET haplotypes.
| CC (ref/ref) | CT (ref/alt) | AC (alt/ref) | AT (alt/alt) | Total other haplotypes | |
|---|---|---|---|---|---|
| MinION1 | 36 (32.4%) | 25 (22.5%) | 16 (14.4%) | 3 (2.7%) | 31 (27.9%) |
| MinION2 | 21 (25.9%) | 21 (25.9%) | 6 (7.4%) | 7 (8.6%) | 26 (32.1%) |
| PacBio1 | 84 (32.2%) | 67 (25.7%) | 59 (22.6%) | 39 (14.9%) | 12 (4.6%) |
RET haplotypes with reference bases reversed.
| CC (alt/alt) | CT (alt/ref) | AC (ref/alt) | AT (ref/ref) | Total other haplotypes | |
|---|---|---|---|---|---|
| MinION1 | 2 (1.8%) | 22 (19.8%) | 14 (12.6%) | 34 (30.6%) | 39 (35.1%) |
| MinION2 | 3 (4.9%) | 16 (26.2%) | 8 (13.1%) | 13 (21.3%) | 21 (34.4%) |
| PacBio1 | 28 (12.6%) | 54 (24.3%) | 58 (26.1%) | 74 (33.3%) | 8 (3.6%) |
Figure 2Distance between SNPs within the sequenced region versus abundance of the 3rd and 4th (chimeric) haplotypes.
The line shows a positive linear correlation between the distance apart of the SNPs and the abundance of the 3rd and 4th haplotypes.
Ammar et al. CYP2D6 haplotypes.
| CT (ref/ref) | C- (ref/alt) | TT (alt/ref) | T- (alt/alt) | Total other haplotypes | |
|---|---|---|---|---|---|
| Original reference | 86 (33.2%) | 61 (23.6%) | 52 (20.1%) | 6 (2.3%) | 54 (20.9%) |
Ammar et al. CYP2D6 haplotypes with reference bases reversed.
| CT (alt/alt) | C- (alt/ref) | TT (ref/alt) | T- (ref/ref) | Total other haplotypes | |
|---|---|---|---|---|---|
| Reversed reference | 1 (0.4%) | 42 (18.0%) | 44 (18.9%) | 87 (37.3%) | 59 (25.3%) |
RET reduced PCR haplotypes.
| CC (ref/ref) | CT (ref/alt) | AC (alt/ref) | AT (alt/alt) | Total other haplotypes | |
|---|---|---|---|---|---|
| PacBio C29 | 632 (4.2%) | 7104 (46.7%) | 7021 (46.2%) | 356 (2.3%) | 87 (0.6%) |
| PacBio C34 | 3860 (17.6%) | 7073 (32.2%) | 7556 (34.4%) | 3284 (15.0%) | 172 (0.8%) |
| PacBio C39 | 2552 (21.8%) | 3326 (28.5%) | 3424 (29.3%) | 2281 (19.5%) | 100 (0.9%) |