| Literature DB >> 35629091 |
Ari Silver1, Gabriel A Lazarin1, Maxwell Silver1, Meghan Miller1, Michael Jansen1, Christine Wechsberg1, Erin Dekanek1, Stav Grossfeld1, Tim Herpel1, Dinura Gunatilake1, Alexander Bisignano1, Malgorzata Jaremko1.
Abstract
DNA-based screening in individuals without known risk factors potentially identifies those who may benefit from genetic counseling, early medical interventions, and/or avoidance of late or missed diagnoses. While not currently in widespread usage, technological advances in genetic analysis overcome barriers to access by enabling less labor-intensive and more cost-efficient means to discover variants of clinical importance. This study describes the technical validation of a 430-gene next-generation sequencing based assay, GeneCompassTM, indicated for the screening of healthy individuals in the areas of actionable health risks, pharmaceutical drug response, and wellness traits. The test includes genes associated with Mendelian disorders and genetic susceptibility loci, encompassing 14 clinical areas and pharmacogenetic variants. The custom-designed target enrichment capture and bioinformatics pipelines interrogate multiple variant types, including single nucleotide variants, insertions/deletions (indels), copy number variants, and functional haplotypes (star alleles), including tandem alleles and structural variants. Validation was performed against reference DNA from three sources: 1000 Genomes Project (n = 3), Coriell biobank (n = 105), and previously molecularly characterized biological specimens: blood (n = 15) and saliva (n = 11). Analytical sensitivity and specificity for single nucleotide variants (SNVs) were 97.57% and 99.99%, respectively, and for indels were 74.57% and 97.34%, respectively. This study demonstrates the validity of an NGS assay for genetic screening and the broadening of access to preventative genomics.Entities:
Keywords: bioinformatics; genetics; genomic screening; next generation sequencing (NGS); population health
Year: 2022 PMID: 35629091 PMCID: PMC9147210 DOI: 10.3390/jpm12050667
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
GeneCompass clinical panel content.
| Monogenic Conditions | Susceptibility Loci | Pharmacogenetic Loci | |||
|---|---|---|---|---|---|
| Genes ( | Conditions ( | Genes ( | Conditions ( | Genes ( | |
| Cardiovascular | 90 | 46 | 8 | 4 | 33 |
| Endocrinology | 4 | 2 | 8 | 4 | 35 |
| Gastroenterology | 2 | 2 | 9 | 4 | 5 |
| Hearing | 13 | 5 | 4 | 4 | 0 |
| Immunology | 1 | 1 | 3 | 3 | 47 |
| Infectious Disease | 0 | 0 | 8 | 9 | 18 |
| Metabolic | 14 | 9 | 0 | 0 | 0 |
| Neurology | 25 | 13 | 11 | 7 | 11 |
| Oncology | 62 | 51 | 8 | 6 | 51 |
| Ophthalmology | 10 | 9 | 4 | 1 | 24 |
| Pain Management | 3 | 4 | 2 | 2 | 17 |
| Psychiatry | 0 | 0 | 9 | 5 | 21 |
| Pulmonology | 1 | 1 | 1 | 1 | 5 |
| Reproductive Health | 39 | 21 | 9 | 5 | 40 |
Figure 1Description of multi-variant star alleles. (A) Possible star allele haplotypes at a genomic region. (B) Possible star allele diplotypes at a genomic region.
Figure 2Aldy’s star allele haplotype calling workflow for variants in CYP2D6.
Figure 3Proper nomenclature for HLA typing results.
Comparison of quality metrics in blood versus saliva specimens.
| Sample Type | Uniformity % | Aligned % | Duplicate % | On Target % | Total Reads | Avg. Base Depth | Base Depth |
|---|---|---|---|---|---|---|---|
| Blood ( | 94.33% | 98.33% | 1.49% | 42.60% | 1,338,982.00 | 46.96× | 88.67% |
| Saliva ( | 93.81% | 97.29% | 1.26% | 43.72% | 1,418,766.73 | 51.18× | 88.91% |
1 Averages of the two sample collection types are reported here.
(a) Intra-precision study results; (b) Intra-precision variant concordance.
| (a) | ||||||
|---|---|---|---|---|---|---|
| Sample | Total Reads | Average Depth | % Aligned | % Duplicate Reads | % On-Target | |
| HG00109A | 2,433,884 | 86.15 | 98.64% | 3.82% | 43.57% | |
| HG00109B | 2,119,180 | 106.02 | 99.13% | 3.05% | 61.59% | |
| HG00109C | 2,174,032 | 108.26 | 99.21% | 2.57% | 61.30% | |
| Average | 2,242,365 | 100.14 | 98.99% | 3.15% | 55.49% | |
| HG00428A | 2,675,926 | 95.56 | 98.36% | 4.05% | 43.96% | |
| HG00428B | 2,152,570 | 107.84 | 98.67% | 3.59% | 61.67% | |
| HG00428C | 2,204,744 | 108.02 | 99.17% | 2.50% | 60.31% | |
| Average | 2,344,413 | 104.00 | 98.73% | 3.38% | 55.31% | |
| HG01350A | 2,694,592 | 93.64 | 98.61% | 4.11% | 42.78% | |
| HG01350B | 2,418,592 | 121.57 | 99.08% | 3.72% | 61.88% | |
| HG01350C | 2,739,276 | 130.64 | 98.99% | 2.55% | 58.71% | |
| Average | 2,617,487 | 115.00 | 98.89% | 3.46% | 54.46% | |
|
| ||||||
|
|
|
|
|
|
|
|
|
| 1329 | 65 | 1375 | 1360 | 1361 | 95.33% |
|
| 1426 | 85 | 1476 | 1464 | 1463 | 94.37% |
|
| 1368 | 82 | 1412 | 1404 | 1422 | 94.34% |
|
|
|
|
|
|
|
|
Each 1000 Genomes Project sample was run in triplicate.
(a) Inter-precision study results; (b) Inter-precision variant concordance.
| (a) | ||||||
|---|---|---|---|---|---|---|
| Batch | Sample | Total Reads | Average Depth | % Aligned | % Duplicate | % On-Target |
| Run 1 | HG00109 | 2,240,126 | 116.88 | 99.20% | 3.14% | 64.23% |
| HG00428 | 2,198,956 | 114.36 | 99.22% | 3.08% | 64.02% | |
| HG01350 | 1,939,004 | 100.91 | 99.24% | 2.47% | 64.07% | |
| Average | 2,126,029 | 111.00 | 99.22% | 2.90% | 64.11% | |
| Run 2 | HG00109 | 2,433,884 | 86.15 | 98.64% | 3.82% | 43.57% |
| HG00428 | 2,675,926 | 95.56 | 98.36% | 4.05% | 43.96% | |
| HG01350 | 2,694,592 | 93.64 | 98.61% | 4.11% | 42.78% | |
| Run 2 Average | 2,601,467 | 91.78 | 98.50% | 4.00% | 43.40% | |
| Run 3 | HG00109 | 2,059,476 | 105.68 | 99.22% | 3.06% | 63.17% |
| HG00428 | 2,059,476 | 105.68 | 99.22% | 3.06% | 63.17% | |
| HG01350 | 2,189,582 | 113.96 | 99.03% | 2.82% | 64.07% | |
| Run 3 Average | 2,102,845 | 108.00 | 99.00% | 3.00% | 63.00% | |
|
| ||||||
|
|
|
|
|
|
|
|
|
| 1317 | 52 | 1348 | 1339 | 1339 | 96.20% |
|
| 1418 | 60 | 1448 | 1454 | 1442 | 95.94% |
|
| 1357 | 74 | 1393 | 1399 | 1391 | 94.83% |
|
|
|
|
|
|
|
|
Each 1000 Genomes Project sample was run in triplicate.
Accuracy, sensitivity, and specificity.
| Reportable Regions | All Regions | |||
|---|---|---|---|---|
| SNV | Indel | SNV | Indel | |
| True positives | 10,164 | 689 | 10,164 | 689 |
| True negatives | 366,657 | 7073 | 366,657 | 7073 |
| False positives | 10 | 53 | 53 | 193 |
| False negatives | 54 | 43 | 253 | 235 |
| Accuracy | 99.98% | 98.78% | 99.92% | 94.77% |
| Sensitivity | 99.47% | 94.13% | 97.57% | 74.57% |
| Specificity | 99.99% | 99.26% | 99.99% | 97.34% |
| Positive predictive value | 99.90% | 92.86% | 99.48% | 78.12% |
| Negative predictive value | 99.99% | 99.40% | 99.93% | 96.78% |
Sample Allele Frequency vs. Population Allele Frequency.
| Targeted Category | Gene Count | T-Test Concordance % |
|---|---|---|
| Disease Susceptibility | 62 | 98.80% |
| Wellness | 91 | 99.48% |
| Drug Response | 81 | 99.81% |
Pharmacogenetic haplotypes detected by assay.
| Gene | Haplotypes Detected |
|---|---|
| CYP2B6 | *4, *6, *9, *18, *22, *34, *36 |
| CYP2C19 | *1, *2, *3, *4, *5, *6, *7, *8, *9, *10, *13, *17 |
| CYP2C8 | *1A, *2, *3 |
| CYP2C9 | *1, *2, *3, *4, *5, *6, *7, *8, *9, *10, *11, *12, *13, *15, *16 |
| CYP2D6 1 | *1, *1 × 2, *1 × 3, *1 × 4, *2, *2 × 2, *2 × 3, *2 × 4, *3, *4, *4 × 2, *5, *6, *7, *8, *8 × 2, *9, *9 × 2, *10, *10 × 2, *11, *12, *13, *14, *15, *17, *17 × 2, *21, *29, *29 × 2, *35, *35 × 2, *36, *40, *41, *41 × 2, *45, *49, *50, *54, *55, *56, *59, *61, *61 × 2, *62, *63, *65, *68, *68 × 2, *69, *72, *74, *82, *83, *83 × 2, *84, *96, *109, *114 |
| CYP3A4 | *1, *2, *13, *15, *22 |
| CYP3A5 | *1, *2, *3, *3B, *6, *7 |
| DPYD | *1, *2A, *6, *9A, *13, rs67376798 |
| NAT2 | *4, *5, *6, *7, *14 |
| NUDT15 | *1, *2, *3, *6, *9 |
| SLCO1B1 | *1A, *1B, *5, *15, *17, *21 |
| TPMT | *1, *2, *3A, *3B, *3C, *4, *8, *24 |
| UGT1A1 | *1, *6, *28, *36, *37, *80 |
1 Additional structural variants may be detected, including hybrid and tandem alleles.
| Sample | Concordant | Discordant | Run 1 Variants | Run 2 Variants | Run 3 Variants | Concordance % |
|---|---|---|---|---|---|---|
| 1317 | 52 | 1348 | 1339 | 1339 | 96.20% | |
| 1418 | 60 | 1448 | 1454 | 1442 | 95.94% | |
| 1357 | 74 | 1393 | 1399 | 1391 | 94.83% | |
1 Each 1000 Genomes Project sample was run in triplicate.