| Literature DB >> 31508243 |
J A Stamm1, D J Carey2, U L Mirshahi2, J P Sugunaraj1, H M Brosius1, M F Murray3, K Manickam4.
Abstract
Doubts have been raised about the value of DNA-based screening for low-prevalence monogenic conditions following reports of testing this approach using available electronic health record (EHR) as the sole phenotyping source. We hypothesized that a better model for EHR-focused examination of DNA-based screening is Cystic Fibrosis (CF) since the diagnosis is proactively sought within the healthcare system. We reviewed CFTR variants in 50,778 exomes. In 24 cases with bi-allelic pathogenic CFTR variants, there were 21 true-positives. We considered three cases "potential" false-positives due to limitations in available EHR phenotype data. This genomic screening exhibited a positive predictive value of 87.5%, negative predictive value of 99.9%, sensitivity of 95.5%, and a specificity of 99.9%. Despite EHR-based phenotyping limitations in three cases, the presence or absence of pathogenic CFTR variants has strong predictive value for CF diagnosis when EHR data is used as the sole phenotyping source. Accurate ascertainment of the predictive value of DNA-based screening requires condition-specific phenotyping beyond available EHR data.Entities:
Keywords: Genetic techniques; Personalized medicine
Year: 2019 PMID: 31508243 PMCID: PMC6726623 DOI: 10.1038/s41525-019-0095-6
Source DB: PubMed Journal: NPJ Genom Med ISSN: 2056-7944 Impact factor: 8.617
Fig. 1EHR-based Genomic Screening for CF. In a database of 50,778 participants, there were 50,754 negatives, namely cases without bi-allelic pathogenic CFTR variants, and 24 positives, namely cases with such variants. Amongst the negatives, eight had EHR data suggesting a CF diagnosis, six of those had no CFTR pathogenic variants, and two had a single pathogenic variant. Open chart review concluded that one of the heterozygotes was a false negative and one was a true negative. The remainder of the negatives, those without bi-allelic CFTR variants or EHR data consistent with CF were also considered true negatives. Open chart review was pursued for all 24 positive cases. The diagnosis of CF was confirmed in 21 cases. In the remaining three cases the diagnosis could not be confirmed due to insufficient available evidence
Details of cases undergoing open chart review
| Patient # | CF Diagnosis | DNA screen positive? (Known pathogenic | Diagnostic Category | Sex | Current age (years) | CF Diagnosis in EHR (age at diagnosis)a | Age of first encounter recorded in Geisinger EHRa | Sweat chloride test (mmol/L) | Clinical genetic testing | Clinical test concordant with WES | Bronchiectasis | Respiratory pseudomonas infection | Pancreatic supplement | Notations |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Yes | Yes (F508del/F508del) | TP | M | 6 | Yes (0.3) | 0 | 93 | Yes | Yes | ND | ND | Yes | |
| 2 | Yes | Yes (F508del/F508del) | TP | F | 19 | Yes (3) | 3 | 92 | Yes | Yes | ND | ND | Yes | |
| 3 | Yes | Yes (F508del/F508del) | TP | F | 20 | Yes (4) | 4 | 91 | Yes | Yes | Yes | Yes | Yes | |
| 4 | Yes | Yes (F508del/F508del) | TP | F | 21 | Yes (4) | 4 | ND | Yes | Yes | ND | Yes | Yes | Transferred from another center; record of attempted sweat chloride testing reported as QNS |
| 5 | Yes | Yes (F508del/F508del) | TP | M | 27 | Yes (12) | 12 | 112 | Yes | Yes | Yes | Yes | Yes | |
| 6 | Yes | Yes (F508del/F508del) | TP | M | 29 | Yes (14) | 14 | 90 | Yes | Yes | ND | Yes | Yes | |
| 7 | Yes | Yes (F508del/F508del) | TP | F | 30 | Yes (15) | 15 | 82 | Yes | Yes | Yes | Yes | Yes | |
| 8 | Yes | Yes (F508del/F508del) | TP | M | 34 | Yes (29) | 23 | 91 | Yes | Yes | ND | ND | Yes | |
| 9 | Yes | Yes (F508del/F508del) | TP | F | 37 | Yes (18) | 18 | 69 | Yes | Yes | Yes | Yes | Yes | |
| 10 | Yes | Yes (F508del/F508del) | TP | F | 38 | Yes (30) | 30 | ND | Yes | Yes | Yes | Yes | Yes | Transferred from another center; results of sweat chloride testing not available |
| 11 | Yes | Yes (F508del/F508del) | TP | F | 41 | Yes (27) | 27 | 97 | Yes | Yes | Yes | Yes | Yes | |
| 12 | Yes | Yes (F508del/F508del) | TP | M | 45 | Yes (30) | 29 | ND | Yes | Yes | ND | ND | Yes | Transferred from another center; results of sweat chloride testing not available |
| 13 | Yes | Yes (F508del/F508del) | TP | M | 54 | Yes (38) | 38 | 108 | Yes | Yes | Yes | Yes | Yes | |
| 14 | Yes | Yes (F508del/F508del) | TP | F | 55 | Yes (47) | 46 | ND | No | NA | ND | Yes | Yes | Patient’s CF care at outside facility; results of clinical genetic testing and sweat chloride testing not available |
| 15 | Yes | Yes (F508del/Q39X) | TP | M | 37 | Yes (32) | 27 | ND | No | NA | ND | ND | ND | Patient’s CF care at outside facility; results of clinical genetic testing and sweat chloride testing not available |
| 16 | Yes | Yes (F508del/L88fs) | TP | M | 6 | Yes (1) | 0.3 | 73 | Yes | Yes | ND | Yes | Yes | |
| 17 | Yes | Yes (F508del/R117H T5/TG11) | TP | F | 23 | Yes (15) | 7 | ND | Yes | Yes | ND | ND | Yes | No sweat chloride testing done |
| 18 | Yes | Yes (c.489 + 1 G > T/c.3718–2477 C > T) | TP | F | 37 | Yes (19) | 19 | 50 | Yes | Yes | ND | Yes | Yes | |
| 19 | Yes | Yes (R334W/G542X) | TP | F | 22 | Yes (9) | 6 | 103 | Yes | Yes | ND | ND | Yes | |
| 20 | Yes | Yes (F508del/G551D) | TP | F | 29 | Yes (13) | 13 | 90 | Yes | Yes | ND | Yes | Yes | |
| 21 | Yes | Yes (F508del/K710X) | TP | M | 45 | Yes (34) | 33 | 91 | Yes | Yes | ND | ND | Yes | |
| 22 | Yes | No (F508del/ --) | FN | M | 5 | Yes (0.06) | 0 | 66 | Yes | Yes | ND | ND | Yes | Second variant = R1239S, not currently found in CFTR2 or ClinVar databases |
| 23 | No | No (F508del/ --) | TN | F | 7 | Yes (1) | 0.3 | <10, 14 | Yes | Yes | ND | Yes | ND | Likely CRMS; second variant = S1235R, is “not CF-causing” or “benign” in databases |
| 24 | No | Yes (F508del/L206Wb) | FP | M | 62 | No | 46 | ND | No | NA | ND | ND | ND | Unknown; insufficient data |
| 25 | No | Yes (F508del/L206Wb) | FP | M | 74 | No | 60 | ND | No | NA | possible | ND | ND | Possible; insufficient data |
| 26 | No | Yes (F508del/Q1476X) | FP | F | 68 | No | 53 | ND | Yes/NA | No | Yes | Yes | ND | Possible; insufficient data; clinical genetic test in 2009 only reported F508del/− |
CF diagnosis was assigned following open chart review by CF-expert clinicians independent of DNA-screening (see Materials and Methods). DNA-screening was positive or negative based on the presence or absence of bi-allelic pathogenic CFTR variants. DNA-screening positive was determined in 24 out of 26 reviewed cases; DNA-screening negative was determined in 2 out of 26 cases. The screen negative cases came to be reviewed because of the CF diagnostic codes in their EHR. Each case was placed in a diagnostic category, namely: true positive (TP), false positive (FP), true negative (TN), or false negative (FN). Open chart review was done on an additional six cases based on CF-associated ICD data; these charts were not found to have further evidence of CF and there were no pathogenic CFTR variants associated with these cases. CRMS CFTR-related metabolic syndrome, ND no data available, NA not applicable, QNS quantity not sufficient. aAge in years when first recorded in EHR. bCases 24 and 25 found to have same rare pathogen variant (L206W, MAF 0.0003) together with F508del, but the two cases were confirmed through identity by descent to not be genetically related
CFTR variants found in 26 cases that underwent open chart review
| Variant cDNA name | Variant protein name | CFTR2 (8/31/18) | ClinVar Review Statusa | ClinVar link (accessed 9/15/18) | # alleles in compound heterozygous | # alleles in homozygous | # alleles | ||
|---|---|---|---|---|---|---|---|---|---|
| 1 | c.115 C > T | p.Gln39Ter | Q39X | CF causing | Pathogenic *3 |
| 1 | 0 | 1 |
| 2 | c.263 T > G | p.Leu88Ter | L88X | CF causing | Pathogenic *3 |
| 1 | 0 | 1 |
| 3 | c.350 G > A | p.Arg117His | R117H | Varying clinical consequence | Pathogenic *4 |
| 1 | 0 | 1 |
| 4 | c.489 + 1 G > T | NA | NA | CF causing | Pathogenic *4 |
| 1 | 0 | 1 |
| 5 | c.617 T > G | p.Leu206Trp | L206W | CF causing | Pathogenic *3 |
| 2 | 0 | 2 |
| 6 | c.1000 C > T | p.Arg334Trp | R334W | CF causing | Pathogenic *4 |
| 1 | 0 | 1 |
| 7 | c.1521_1523delCTT | p.Phe508del | F508del | CF causing | Pathogenic *4 |
| 10 | 28 | 38 |
| 8 | c.1624G > T | p.Gly542Ter | G542X | CF causing | Pathogenic *4 |
| 1 | 0 | 1 |
| 9 | c.1652G > A | p.Gly551Asp | G551D | CF causing | Pathogenic *4 |
| 1 | 0 | 1 |
| 10 | c.2128 A > T | p.Lys710Ter | K710X | CF causing | Pathogenic *3 |
| 1 | 0 | 1 |
| 11 | c.3705 T > G | p.Ser1235Arg | S1235R | Non CF-causing | Benign |
| 1 | 0 | 1 |
| 12 | c.3717 G > C | p.Arg1239Ser | R1239S | NA | NA | NA | 1 | 0 | 1 |
| 13 | c.3718–2477 C > T | NA | NA | CF causing | Pathogenic *4 |
| 1 | 0 | 1 |
| 14 | c.4426 C > T | p.Gln1476Ter | Q1476X | NA | Pathogenic *2 |
| 1 | 0 | 1 |
|
|
|
| |||||||
Variants #11 (S1235R) and #12 (R1239S) were identified in cases 23 and 22, respectively (see Table 1), in each case forming CFTR compound heterozygotes with F508del; however, neither variant was found as pathogenic variants in either the CFTR2 or ClinVar databases. Variant #14 was missed on clinical testing obtained in 2009 for case 26 (see Table 1). Note that only 42% (5/12) of the pathogenic variants in this table were included in the 2004 ACMG panel #3, 6, 7, 8, 9. The genomic screening used in this manuscript (WES plus microarray) is more sensitive than many of the clinical genetic tests offered in the past. aClinVar review status: *2 = endorsed by two or more submitters providing assertion criteria provided the same interpretation, *3 = endorsed by expert panel, *4 = endorsed by practice guideline
Fig. 2Data concordance in clinical and research CFTR variant results. Twenty of the 22 confirmed CF patients in the cohort also received their care in the Geisinger CF clinical program and had clinical CF genetic testing results available in their EHR. In the 20 cases with DNA data from both sources there was 100% pathogenic variant calling concordance
Screening predictive statistics
| Diagnostic category | ||
|---|---|---|
| ES positive, | 24 (0.05) | |
| CF | TP | 21 |
| No CF | FP | 3a |
| ES negative, | 50,754 (99.95) | |
| No CF | TN | 50,753 |
| CF | FN | 1 |
| Sensitivity, % (95% CI) | 95.5 (77.2–99.9) | |
| Specificity, % (95% CI) | 99.99 (99.98–100.00) | |
| PPV, % (95% CI) | 87.5 (69.2–95.6) | |
| NPV, % (95% CI) | 99.99 (99.98–100.00) | |
| Accuracy, % (95% CI) | 99.99 (99.98–100.00) | |
| Disease prevalence, % (95% CI) | 0.04 (0.03– 0.07) |
Individuals with or without CF are categorized as TP, FP, TN, or FN based on exome sequence variants previously classified as pathogenic. Screening predictive statistics are calculated as described in Methods. ES exome sequence, CF cystic fibrosis, TP true positive, FP false positive, TN true negative, FN false negative, PPV positive predictive value, NPV negative predictive value, 95% CI 95% confidence interval. a“potential” false positive – insufficient EHR data to rule in or rule out diagnosis