| Literature DB >> 35346344 |
Eric Venner1, Donna Muzny2, Joshua D Smith3, Kimberly Walker2, Cynthia L Neben4, Christina M Lockwood5,6, Phillip E Empey7, Ginger A Metcalf2, Chris Kachulis8, Sana Mian9, Anjene Musick9, Heidi L Rehm8,10, Steven Harrison8, Stacey Gabriel8, Richard A Gibbs2, Deborah Nickerson3, Alicia Y Zhou4, Kimberly Doheny11, Bradley Ozenberger9, Scott E Topper4, Niall J Lennon8.
Abstract
BACKGROUND: The All of Us Research Program (AoURP, "the program") is an initiative, sponsored by the National Institutes of Health (NIH), that aims to enroll one million people (or more) across the USA. Through repeated engagement of participants, a research resource is being created to enable a variety of future observational and interventional studies. The program has also committed to genomic data generation and returning important health-related information to participants.Entities:
Mesh:
Year: 2022 PMID: 35346344 PMCID: PMC8962531 DOI: 10.1186/s13073-022-01031-z
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Fig. 1Overview of All of Us Research Program return of genomic results workflow. Participants are enrolled at a variety of locations including mobile sites, hospitals, and walk-in clinics. Samples are sent to the Biobank (Mayo Clinic) where DNA is extracted and stored. Upon indication from the program to proceed, samples are sent to one of three Genome Centers—Baylor College of Medicine (BCM), Broad Institute (BI), or University of Washington (UW)—for whole-genome sequencing. For participants who have consented to return of genomic results, data are forwarded on to the Clinical Validation Laboratories (CVLs) for pharmacogenomics (PGx) analysis and variant interpretation and orthogonal confirmation for Hereditary Disease Risk (HDR) gene pathogenic or likely pathogenic variants. Reports are generated by the Report and Harmonization Platform (Color) and delivered to participants through the Genetic Counseling Resource (Color). QC quality control
Summary of assessments and numbers of specimens included in each study
| Sample cohort | Assessment | |||||||
|---|---|---|---|---|---|---|---|---|
| Reportable range | Accuracy | Precision | Interlab concordance | Limit of detection | Extraction performance | Molecular index performance | Assay fail rate | |
| Blood-derived patient samples | n/a | 376 | 271 | n/a | n/a | n/a | n/a | 30 |
| Blood-derived healthy donor aliquots | 5 | n/a | 84 | 28 | 21 | 15 | 28 | 28 |
| GIAB reference cell lines | 1 | 18 | n/a | n/a | 37 | n/a | 37 | 37 |
| Cell lines with known HDR variants | 30 | 65 | 48 | 48 | n/a | n/a | 30 | 30 |
| CDC GeT-RM PGx cell lines | 68 | 310 | 186 | 186 | n/a | n/a | 164 | 175 |
| Historical blood-derived WGS data | n/a | n/a | n/a | n/a | n/a | n/a | n/a | 25,028 |
This table summarizes the samples that were used for each set of comparisons that comprise this study. Many samples were sequenced in replicate either across the genome centers or within them and these counts reflect replicates, not unique samples
GIAB Genome in a Bottle [15], HDR hereditary disease risk, CDC United States Centers for Disease Control and Prevention, GeT-RM Genetic Testing Reference Materials Coordination Program [16], PGx pharmacogenomics, WGS whole-genome sequencing
Call concordance of PGx alleles using WGS and orthogonal methods in patient samples
| Gene and star allele | Number of clinical samples | Correct calls | Incorrect calls | Correct calls | Incorrect calls | Correct calls | Incorrect calls | Correct calls | Incorrect calls | Correct calls | Incorrect calls | Correct calls | Incorrect calls | Correct calls | Incorrect calls | Overall concordance |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 1/1 | 0 | - | - | - | - | 1/1 | 0 | - | - | - | - | - | - | 2/2 | |
| 40 | 25/25 | 0 | 15/15 | 0 | - | - | 25/25 | 0 | 15/15 | 0 | - | - | - | - | 80/80 | |
| 1 | 1/1 | - | - | - | - | - | 1/1 | - | - | - | - | - | - | - | 2/2 | |
| 48 | 33/33 | 0 | - | - | 15/15 | 0 | 33/33 | - | - | - | - | - | 15/15 | 0 | 96/96 | |
| 3 | 3/3 | 0 | - | - | - | - | 3/3 | - | - | - | - | - | - | - | 3/3 | |
| 1 | - | - | 1/1 | 0 | - | - | - | - | 1/1 | 0 | - | - | - | - | 2/2 | |
| 3 | 3/3 | 0 | - | - | - | - | 3/3 | - | - | - | - | - | - | - | 3/3 | |
| 1 | 1/1 | 0 | - | - | - | - | 1/1 | - | - | - | - | - | - | - | 1/1 | |
| 6 | 3/3 | 0 | 3/3 | 0 | - | - | 3/3 | 0 | 3/3 | 0 | - | - | - | - | 12/12 | |
| 1 | - | - | - | - | 1/1 | 0 | - | - | - | - | - | - | 1/1 | 0 | 2/2 | |
| 2 | - | - | 2/2 | 0 | - | - | - | - | 2/2 | 0 | - | - | - | - | 4/4 | |
| 2 | 2/2 | 0 | 2/2 | - | - | - | - | - | - | - | 4/4 | |||||
| 3 | 3/3 | 0 | 3/3 | - | - | - | - | - | - | - | 6/6 | |||||
| 2 | - | - | 2/2 | 0 | - | - | - | - | 2/2 | 0 | - | - | - | - | 4/4 | |
| 2 | - | - | 2/2 | 0 | - | - | - | - | 2/2 | 0 | - | - | - | - | 4/4 | |
| 3 | 3/3 | 0 | - | - | - | - | 3/3 | 0 | - | - | - | - | - | - | 6/6 | |
| 1 | 1/1 | 0 | - | - | - | - | 1/1 | 0 | - | - | - | - | - | - | 2/2 | |
| 2 | 2/2 | 0 | - | - | - | - | 2/2 | 0 | - | - | - | - | - | - | 4/4 | |
| 5 | 5/5 | 0 | - | - | - | - | 5/5 | 0 | - | - | - | - | - | - | 10/10 | |
| 1 | 1/1 | 0 | - | - | - | - | 1/1 | 0 | - | - | - | - | - | - | 2/2 | |
| 2 | 2/2 | 0 | - | - | - | - | 2/2 | 0 | - | - | - | - | - | - | 4/4 | |
| 2 | 2/2 | 0 | - | - | - | - | 2/2 | 0 | - | - | - | - | - | - | 4/4 | |
| 9 | 8/8 | 0 | - | - | 1/1 | 0 | 8/8 | 0 | - | - | - | - | 1/1 | 0 | 18/18 | |
| 28 | 22/22 | 0 | - | - | 6/6 | 0 | 22/22 | 0 | - | - | - | - | 6/6 | 0 | 56/56 | |
| 4 | 4/4 | 0 | - | - | 4/4 | 0 | - | - | - | - | - | - | 8/8 | |||
| 3 | - | - | - | - | 3/3 | 0 | - | - | - | - | - | - | 3/3 | 0 | 6/6 | |
| 8 | 7/7 | 0 | - | - | 1/1 | 0 | 7/7 | 0 | - | - | - | - | 1/1 | 0 | 16/16 | |
| 6 | 5/5 | 0 | 1/1 | 0 | - | - | 5/5 | 0 | 1/1 | 0 | - | - | - | - | 12/12 | |
| 9 | 9/9 | 0 | - | - | - | - | 9/9 | 0 | - | - | - | - | - | - | 18/18 | |
| 5 | 5/5 | 0 | - | - | - | - | 5/5 | 0 | - | - | - | - | - | - | 10/10 | |
| 3 | 3/3 | 0 | - | - | - | - | 3/3 | 0 | - | - | - | - | - | - | 6/6 | |
| 70 | 37/37 | 0 | 33/33 | 0 | - | - | 37/37 | 0 | 33/33 | 0 | - | - | - | - | 140/140 | |
| 12 | 4/4 | 0 | 8/8 | 0 | - | - | 4/4 | 0 | 8/8 | 0 | - | - | - | - | 24/24 | |
| 1 | 1/1 | 0 | - | - | - | - | 1/1 | 0 | - | - | - | - | - | - | 2/2 | |
| 3 | 2/2 | 0 | - | - | 1/1 | 0 | 2/2 | 0 | - | - | 1/1 | 0 | - | - | 6/6 | |
| 8 | 3/3 | 0 | 5/5 | 0 | - | - | 3/3 | 0 | 5/5 | 0 | - | - | - | - | 16/16 | |
BI Broad Institute, WGS whole-genome sequencing, BCM Baylor College of Medicine, UW University of Washington, Color Color Health, CVL Clinical Validation Laboratory
Fig. 2Timeline of the IDE application process. The requirements for the investigational device exemption (IDE) content were refined through a series of pre-IDE submissions and responses, in-person meetings, and teleconferences with the United States Food and Drug Administration (FDA) over a period of 19 months. AoURP, All of Us Research Program; CCP, Change Control Policy; IC, informed consent; PGx, pharmacogenomics; Pre-Sub, pre-submission; SR, significant risk; Sup, supplement
Accuracy of variant calling across genomic contexts and variant types in patient samples
| Panel+ | Panel–/WGS+ | Panel+/WGS+ | Panel–/WGS– | PPA [95% CI] | NPA [95% CI] | |
|---|---|---|---|---|---|---|
| All compiled | 76 | 201 | 29,475 | 45,847,148 | 99.74% [99.68–99.81%] | 100% [100–100%] |
| P/LP variants only | 0 | 0 | 145 | 28,474,503 | 100% [100–100%] | 100% [100–100%] |
| SNVs only | 23 | 6 | 28,710 | 45,848,108 | 99.92% [99.87–99.97%] | 100% [100–100%] |
| Insertions only | 9 | 8 | 225 | 29,887,204 | 96.20% [94.30–98.00%] | 100% [100–100%] |
| Deletions only | 33 | 12 | 540 | 42,170,158 | 94.24% [92.65–95.83%] | 100% [100–100%] |
| Segmental duplications | 12 | 2 | 2119 | 38,466,267 | 99.44% [99.10–99.78%] | 100% [100–100%] |
| Known pseudogenes | 9 | 74 | 1075 | 45,676,596 | 99.17% [98.60–99.73%] | 100% [100–100%] |
| Low mappability regions | 18 | 12 | 1069 | 36,912,107 | 98.34% [97.84–98.85%] | 100% [100–100%] |
| Low complexity regions | 24 | 69 | 956 | 38,072,735 | 97.55% [96.35–98.75%] | 100% [100–100%] |
| Low GC regions | 6 | 12 | 197 | 14,792,575 | 97.04% [95.87–98.22%] | 100% [100–100%] |
| High GC regions | 0 | 0 | 23 | 3,891,553 | 100% [100–100%] | 100% [100–100%] |
| Heterozygous variants only | 75 | 147 | 18,295 | 45,858,382 | 99.59% [98.48–99.70%] | 100% [100–100%] |
| Homozygous variants only | 1 | 83 | 11,180 | 45,865,561 | 99.99% [99.97–100%] | 100% [100–100%] |
+ indicates that a variant was present on the panel or the corresponding whole genome sequencing (WGS) sample. – indicates that a variant was not present on the panel or the corresponding WGS sample. Subsequent to the investigational device exemption (IDE) submission, additional review indicated that this number should in fact be a total of 21 panel positive, but WGS negative variants distributed across the various subcategories (some variants were missannotated in the original tables). However, to reflect what was actually submitted as part of the IDE, the original number is listed here. The impact on overall performance measures is small. PPA positive percent agreement, CI confidence interval, NPA negative percent agreement, P/LP pathogenic/likely pathogenic, SNVs single-nucleotide variants