| Literature DB >> 35217695 |
Sarah Charnaud1,2, Jacob E Munro1,2, Ivo Mueller1,2, Melanie Bahlo3,4, Lucie Semenec1,2,5, Ramin Mazhari1,2, Jessica Brewster1,2, Caitlin Bourke1,2, Shazia Ruybal-Pesántez1,2,6, Robert James1,2, Dulcie Lautu-Gumal1,2, Harin Karunajeewa1,2.
Abstract
The CYP2D6 enzyme is estimated to metabolize 25% of commonly used pharmaceuticals and is of intense pharmacogenetic interest due to the polymorphic nature of the CYP2D6 gene. Accurate allele typing of CYP2D6 has proved challenging due to frequent copy number variants (CNVs) and paralogous pseudogenes. SNP-arrays, qPCR and short-read sequencing have been employed to interrogate CYP2D6, however these technologies are unable to capture longer range information. Long-read sequencing using the PacBio Single Molecule Real Time (SMRT) sequencing platform has yielded promising results for CYP2D6 allele typing. However, previous studies have been limited in scale and have employed nascent data processing pipelines. We present a robust data processing pipeline "PLASTER" for accurate allele typing of SMRT sequenced amplicons. We demonstrate the pipeline by typing CYP2D6 alleles in a large cohort of 377 Solomon Islanders. This pharmacogenetic method will improve drug safety and efficacy through screening prior to drug administration.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35217695 PMCID: PMC8881578 DOI: 10.1038/s42003-022-03102-8
Source DB: PubMed Journal: Commun Biol ISSN: 2399-3642
Comparison of long-read CYP2D6 typing studies.
| Study | Platform | Num. samples typed | Max. multiplex | Phasing | Chimera removal | Fusion calling | Copy num. | End-to-end pipeline available |
|---|---|---|---|---|---|---|---|---|
| Ammar et al.[ | ONT MinION | 1 | 1 | Yes | No | No | qPCR | No |
| Qiao et al.[ | PacBio RS II | 39 | NP | Noa | No | No | qPCR | No |
| Buermans et al.[ | PacBio RS II | 25 | 24 | Yes | Yes | PCR | PCRb | No |
| Liau et al.[ | ONT GridION | 32 | 24 | Yes | No | No | PCRb | No |
| This study | PacBio Sequel I/II | 365 | 95 | Yes | Yes | PCR + Seq | qPCR | Yes |
NP not provided.
aPhasing performed for a subset of samples using additional allele-specific PCR, not long-read sequencing.
bPresence of duplication and deletion PCR products checked, however precise copy numbers of duplications were not attained.
Assigned CYP2D6 alleles of control samples.
| Sample | Reported allelesa | Assigned alleles | Assigned fusions | Copy num. | |
|---|---|---|---|---|---|
| intron-2 | exon-9 | ||||
| NA07439 | *4×N,*41 | *4×2,*41 | – | 3 | 3 |
| NA10005 | *17,*29 | *17,*29 | – | 2 | 2 |
| NA12244 | *35,*41 | *35,*41 | – | 2 | 2 |
| NA17052 | *1,*1 | *1,*1 | – | 2 | 2 |
| NA17058 | *10,*10 | *10,*10 | CYP2D6-D7 (exon-8) | 4 | 2 |
| NA17203 | *4,*35 | *4,*35 | CYP2D6-D7 (intron-1) | 2 | 2 |
| NA17246 | *4,*35 | *4,*35 | CYP2D6-D7 (intron-1) | 2 | 2 |
| NA17252 | *4,*5 | *4,*5 | – | 1 | 1 |
| NA17280 | *2,*3 | *59,*3 | – | 2 | 2 |
| NA17300 | *1,*6 | *1,*6 | – | 2 | 2 |
aReported consensus allele for Pratt et al.[14].
bDeletion allele (*5) inferred due to copy number 1 result from qPCR.
Fig. 1CYP2D6 intron-2 and exon-9 qPCR results.
a Scatterplot showing raw copy numbers for intron-2 and exon-9 qPCR assays for the Solomon Islands Cohort. Axis margins show histograms overlaid with the density from the Gaussian mixture models used to assign copy number states; b Tukey boxplots of raw copy number for control sample technical replicates repeated over several qPCR runs. Shown are seven technical replicates for all samples/markers except NA10005/Exon-9, NA17300/Exon-9, and NA17246/Intron-2 where only six were available, and NA12244/Intron-2 where only four were available.
Fig. 2Fusion detection by breakpoint clustering.
Violin plots overlaid with jitter plots show the distribution of chimeric/fusion read breakpoints for a set of samples selected for illustrative purposes. Chimeric breakpoints are expected to a have a relatively uniform distribution over the length of the amplicon, while true gene fusions should be concentrated within a narrow region. Fusions are called when a sufficient density of breakpoints is detected (see “Methods” and source code for full details).
Fig. 3Frequency and activity of CYP2D6 alleles in the Solomon Islands cohort.
a Frequency of CYP2D6 alleles identified in the Solomon Islands cohort (n = 355), colored by CYP2D6 functional status from the “CYP2D6 Allele Functionality Table” (PharmGKB). Alleles prefixed with “N” represent novel alleles identified in this study that contain variants that are likely to impact protein function (VEP MODERATE and HIGH impact), full allele definitions are available in Supplementary Data 5. b Assigned metabolizer status of individuals in the Solomon Islands cohort (n = 355). In the case an individual contains one or more alleles with unknown function (“?”), the minimum metabolizer status is calculated based on the alleles present with known function.