| Literature DB >> 26072492 |
Ibrahim Numanagić1, Salem Malikić1, Victoria M Pratt2, Todd C Skaar2, David A Flockhart2, S Cenk Sahinalp1.
Abstract
MOTIVATION: CYP2D6 is highly polymorphic gene which encodes the (CYP2D6) enzyme, involved in the metabolism of 20-25% of all clinically prescribed drugs and other xenobiotics in the human body. CYP2D6 genotyping is recommended prior to treatment decisions involving one or more of the numerous drugs sensitive to CYP2D6 allelic composition. In this context, high-throughput sequencing (HTS) technologies provide a promising time-efficient and cost-effective alternative to currently used genotyping techniques. To achieve accurate interpretation of HTS data, however, one needs to overcome several obstacles such as high sequence similarity and genetic recombinations between CYP2D6 and evolutionarily related pseudogenes CYP2D7 and CYP2D8, high copy number variation among individuals and short read lengths generated by HTS technologies.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26072492 PMCID: PMC4542776 DOI: 10.1093/bioinformatics/btv232
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Five known CYP2D6 gene arrangements. The reference strand of human genome was used in all cases. Various number, including zero, of CYP2D6 copies is allowed within the parenthesis. (a) CYP2D6 non-duplicated arrangement consisting one copy of each of CYP2D6, CYP2D7 and CYP2D8. Purple rectangle represents CYP2D6 untranslated region. This region contains several variations important for the detection of some CYP2D6 alleles; (b) typical CYP2D6 duplication arrangement; (c) the deletion arrangement, indicating the absence of CYP2D6 (denoted as *5 allele); (d) CYP2D6/2D7 fusions (*13 family of alleles) lacking CYP2D7. Variable number of copies of CYP2D6 gene might precede fusion alleles; (e) CYP2D7/2D6 fusion cases with presence of CYP2D7. Variable number of copies of CYP2D6 gene might precede fusion alleles in this case as well
Fig. 2.Graphical representation of the steps employed by our framework
Fig. 3.The coverage of the reads mappable to the CYP2D6 and/or CYP2D7 genes is depicted in grey on the flanking regions of CYP2D8 (blue strip). Only two small 0.5 KB regions on the sides accept CYP2D6 and/or CYP2D7 reads
Fig. 4.The ambiguous case where two genotypes *6C/*34 (yellow) and *2D/*6B (green) are equally likely. Key c.2850 C > T and c.4180 G > C are too distant to be resolved with the currently available HTS data
Cypiripi performance for every group
Correctly identified alleles are shown in green, while incorrect estimates are reported in red colour. In case of mismatches, red colour is also applied to the second column items to pinpoint the problematic allele. Unless otherwise specified, genotypes are given for both maternal and paternal chromosome in the format M/P. For set (d), where both chromosomes have the same allelic combination, we only show content of one chromosome for the sake of brevity. Results for set (d) are still reported for each chromosome separately. For ambiguous cases, all optimal genotypes are reported (e.g. 26th sample).
Cypiripi predictions for real data set
| CEPH 1463 trio dataset | |
|---|---|
| ID | Identified |
| NA12877 (father) | *4M/*4M |
| NA12878 (mother) | *3A/*4M |
| NA12882 (son) | *4M/*4M |
NA12878 predictions are coloured in green due to the fact that they match the highly confident SNP calls from Zook . Since the validated predictions are not available for the other two samples, predictions of their genotypes are coloured black.