| Literature DB >> 31206625 |
Seung-Been Lee1, Marsha M Wheeler1, Kenneth E Thummel2,3, Deborah A Nickerson1,3.
Abstract
Variation in the enzymatic activity of pharmacogenes is defined by star alleles (haplotypes) comprised of single-nucleotide variants, small insertion-deletions, and large structural variants. We recently developed Stargazer, a next-generation sequencing-based tool to call star alleles for the clinically important CYP2D6 gene. Here, we present the utility of extending Stargazer to call star alleles for 28 pharmacogenes using whole genome sequencing (WGS) data. We applied Stargazer to WGS data from 70 ethnically diverse samples from the Genetic Testing Reference Materials Coordination Program (GeT-RM). These reference samples were extensively characterized by GeT-RM using multiple pharmacogenetic testing assays. In all 28 genes, Stargazer recalled 100% of star alleles (N = 92) present in GeT-RM's consensus genotypes (N = 1,559). Stargazer also detected star alleles not previously reported by GeT-RM, including complex structural variants. Our results demonstrate that combining WGS data and Stargazer enables automated, accurate, and comprehensive genotyping of pharmacogenes in the human genome.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31206625 PMCID: PMC6896231 DOI: 10.1002/cpt.1552
Source DB: PubMed Journal: Clin Pharmacol Ther ISSN: 0009-9236 Impact factor: 6.875
Star alleles previously reported by GeT‐RM and assessed by Stargazer's analysis of whole genome sequencing data
| Gene | Reference allele | Star alleles found in consensus GeT‐RM genotypes ( | Star alleles only found in nonconsensus GeT‐RM genotypes ( |
|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Structural variant–defined alleles are indicated by “del” (deletion), “dup” (duplication), and “hyb” (hybrid).
GeT‐RM, Genetic Testing Reference Materials Coordination Program.
Figure 1Examples of star alleles with structural variation previously undercalled by Genetic Testing Reference Materials Coordination Program (GeT‐RM). Panels display Stargazer's result for copy number analysis for individual samples (N = 8). Genotypes from GeT‐RM and Stargazer (abbreviated as “G” and “S” for brevity) are also shown, with “()” indicating nonconsensus genotypes. The left and right panels exhibit samples whose structural variant calls are matched and not matched, respectively. Gray dots in each panel indicate the sample's copy number estimates computed from read depth. The navy solid line and the cyan dashed line represent copy number profiles for each haplotype. Thick colored lines represent copy number profiles for different genes for both haplotypes combined. Each panel contains gene names and scaled gene models, in which exons and introns are depicted with colored boxes and black lines, respectively. Reports in the Database of Genomic Variants supported Stargazer's gene deletion calls in NA18942, NA18980, and NA18868.
Star alleles with structural variation previously undercalled by GeT‐RM
| Star allele | Assays |
|
|
|
|
|---|---|---|---|---|---|
|
| [1, 2] | 4 | 2 | 7 | 2 |
|
| [2, 3] | 7 | 0 | 4 | 4 |
|
| [1, 2] | 0 | 32 | 21 | 32 |
|
| [1, 2] | 16 | 18 | 36 | 18 |
GeT‐RM, Genetic Testing Reference Materials Coordination Program.
[1] Affymetrix DMET Plus Array (Affymetrix, Santa Clara, CA); [2] Agena Bioscience iPLEX ADME PGx Pro Panel (Agena Bioscience, San Diego, CA); [3] Agena Bioscience iPLEX ADME CYP2D6 Panel (Agena Bioscience, San Diego, CA).
Star alleles identified by Stargazer's analysis of whole genome sequencing and not previously reported by GeT‐RM
| Gene | Star alleles ( |
|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Structural variant‐defined alleles are indicated by “dup” (duplication) and “hyb” (hybrid).
GeT‐RM, Genetic Testing Reference Materials Coordination Program.
Figure 2Examples of star alleles with structural variation not previously reported by Genetic Testing Reference Materials Coordination Program (GeT‐RM). Panels display Stargazer's results for copy number analysis for individual samples (N = 7). Genotypes from GeT‐RM and Stargazer (abbreviated as “G” and “S” for brevity) are also shown, with “()” indicating nonconsensus genotypes. Gray dots in each panel indicate the sample's copy number estimates computed from read depth. The navy solid line and the cyan dashed line represent copy number profiles for each haplotype. Thick colored lines represent copy number profiles for different genes for both haplotypes combined. Each panel contains gene names and scaled gene models, in which exons and introns are depicted with colored boxes and black lines, respectively. Stargazer's genotype call for HG01190 involving two different structural variants (a CYP2D6 deletion and a CYP2D6/CYP2D7 hybrid) was independently verified previously.19 Reports in the Database of Genomic Variants supported Stargazer's gene duplication calls in NA18861 and NA11908.
New star alleles discovered by Stargazer's analysis of whole genome sequencing data
| Star allele | Description |
|
|
|
|---|---|---|---|---|
|
| Gene duplication of | 0 | 1 | 0 |
|
| Gene duplication in exons 7‐9 (chr10:135350465‐135439323) | 4 | 0 | 0 |
|
| Gene deletion in intron 9 (chr6:160649735‐160654861) | 3 | 0 | 0 |
|
| Gene deletion affecting 3′‐UTR (chr6:160627751‐160638068) | 2 | 0 | 0 |
|
| Gene deletion affecting exons 4‐6 (chr4:69510975‐69528283) | 0 | 0 | 1 |
|
| Nonsense (chr10:96741125A>T; no rsID) | 0 | 1 | 0 |
|
| Nonsense (chr12:21349909C>T; rs183501729) | 0 | 1 | 0 |
|
| Splice site (chr12:21329832G>T; rs77271279) | 2 | 0 | 0 |
|
| In‐frame deletion (chr11:74873754GCACAGAAAA>G; rs72408262) | 0 | 4 | 1 |
Genomic coordinates and nucleotide changes are according to Human Genome version 19.
Figure 3Examples of new star alleles with structural variation. Panels display Stargazer's results for copy number analysis for individual samples (N = 5). Genotypes from Genetic Testing Reference Materials Coordination Program (GeT‐RM) and Stargazer (abbreviated as “G” and “S” for brevity) are also shown, with “()” indicating nonconsensus genotypes. Gray dots in each panel indicate the sample's copy number estimates computed from read depth. The navy solid line and the cyan dashed line represent copy number profiles for each haplotype. Thick colored lines represent copy number profiles for different genes for both haplotypes combined. Each panel contains gene names and scaled gene models, in which exons and introns are depicted with colored boxes and black lines, respectively. CYP2A6*1 + *S6 was identified in only one sample (HG00436) exhibiting both a CYP2A6 deletion (CYP2A6*4) and a duplication in the CYP2A7 paralog (CYP2A6*1 + *S6). Reports in the Database of Genomic Variants supported Stargazer's partial duplication call in NA19143 and partial deletion call in NA19226.