| Literature DB >> 29743278 |
Sian Taylor-Phillips1, Chris Stinton2, Lavinia Ferrante di Ruffano3, Farah Seedat2, Aileen Clarke2, Jonathan J Deeks3,4.
Abstract
OBJECTIVE: To understand whether international differences in recommendations of whether to screen for rare diseases using the newborn blood spot test might in part be explained by use of systematic review methods.Entities:
Mesh:
Year: 2018 PMID: 29743278 PMCID: PMC5941220 DOI: 10.1136/bmj.k1612
Source DB: PubMed Journal: BMJ ISSN: 0959-8138
Criteria for defining whether each country undertook a systematic review for each condition, with examples
| Country | Condition | Systematic review used | Rationale for classification |
|---|---|---|---|
| Netherlands | Carnitine acylcarnitine translocase deficiency | No | No methods given, but likely expert consensus. Section 1.3.4 states “the committee believes that this disease should be classified in Category 1 |
| Denmark | Multiple carboxylase deficiency | No | Section 4 states: “[we] assessed the conditions selected for additional analysis, which was based on a review of original literature including treatment options, screening potential and experience.” |
| Canada | Phenylketonuria | Yes | Section 17 outlined the review methods, and included: source searched (Medlline only), search term (phenylketonuria), and date limit. |
| UK | Long chain 3-hydroxyacyl-CoA dehydrogenase deficiency | Yes | “Chapter 5 provides a methodology for the systematic review.” This included the search strategy, resources searched (electronic databases and reference lists of identified articles), search terms, date limit, language restrictions, and number of reviewers; and the inclusion and exclusion criteria. |
Criteria for defining a systematic review: A: describes two parts of the search strategy (eg, search terms, databases, dates), or B: describes any details of systematic evidence selection after a search (eg, inclusion or exclusion criteria, numbers at abstract and full text sift, PRISMA flow diagram). Each country was defined as having undertaken a systematic review for each condition if either criterion A or B, or both, were met.
The review topic could be about any aspect of screening for the disease under consideration (eg, benefits of early detection through screening, disease prevalence, test accuracy, effects of false positive test results, overdiagnosis or any other harm, clinical course).
Scoring system for assessment of evidence for test accuracy, benefit of early treatment, and overdiagnosis
| Score* | Definition | Examples | ||
|---|---|---|---|---|
| Test accuracy | Benefit of earlier treatment | Overdiagnosis | ||
| 0 | Not considered at all | USA, American College of Medical Genetics recommendation to screen for 3-hydroxy-3-methyglutaric aciduria “Screening test: MSMS [tandem mass spectrometry]. Reported in 1990 [references given]” (the references provided refer to how to undertake testing using MS/MS, but provide no details on test accuracy | USA, argininemia: “Treatment is expected to reduce neurological dysfunction [references given].” References refer to treatment effectiveness not benefit of earlier treatment (after screen detection) over later treatment (after symptomatic detection) | Overdiagnosis not mentioned using any form of wording, including asymptomatic phenotypes, penetrance, and any description of people remaining symptom-free to adulthood |
| 1 | Considered in some way (mentioned in at least one document once) | Netherlands, tyrosinemia type I: “It is possible to make the test specific for tyrosinemia type I and greatly reduce the number of false-positives by also measuring the amount of succinyl acetone in the blood specimen [no reference given]” | USA, congenital hypothyroidism: “Some evidence that early intervention optimizes individual outcomes [no reference given]” | USA, 3-methylcrotonyl-CoA carboxylase deficiency: “since newborn screening with MS/MS [tandem mass spectrometry] began, many individuals have been identified with the analytes associated with the condition but without apparent clinical manifestations [no reference given]” |
| 2 | Measured in some way (at least one study or source cited, and for test accuracy at least some numerical estimate given) or acknowledged that data do not exist yet | New Zealand, economic model of screening for severe combined immunodeficiency: table 4 model assumptions “test sensitivity 0.999, test specificity 0.996 [reference given]” | New Zealand, nomination form for removal of 3-methylcrotonyl-CoA carboxylase deficiency 3MCC from the screening panel: “RCTs [randomised controlled trials] are not possible in newborn metabolic screening due to the low incidence of the disorders, and the time period required to generate a statistically significant number of cases in the screening arm versus the control arm. Case studies suggest screening is not effective in reducing mortality or morbidity [reference given]” | Denmark, biotidinase deficiency: “It is unclear whether asymptomatic children with partial biotinidase deficiency need treatment [references given]” |
| 3 | Investigated using systematic methods of collecting evidence (score if detail two parts of search strategy or any details of evidence selection methods) | Spain, findings of a systematic review of biotidinase: “Therefore sensitivity and specificity of the test is estimated at 100% and 99.994%, respectively. These results are very similar to those presented in Kwon & Farrel, 2000 [reference given]” | Canada, systematic review of cystic fibrosis (CF): “Before any screening program is implemented there should be good evidence that people identified in the presymptomatic phase do better than those in whom a diagnosis is made because of symptoms . . . Several cohort studies of screened and unscreened subjects have suggested that early diagnosis does make a difference. In one study [26] in the Netherlands, 88% of screened children but only 60% of unscreened children were still alive at age 11 years. In an earlier study by the same group, screened children were found to have better clinical scores at age 8 years than did unscreened children with CF, but the differences in chest x-ray films, heights and weights were not statistically significant [reference given]” | France, systematic review of medium-chain acyl-CoA dehydrogenase deficiency: “Screening results in the United States, Germany and Australia have revealed the presence of a relatively frequent mutation which was not found in patients exhibiting clinical symptoms [references given]. Studies in vitro have demonstrated that this mutation is associated with a reduction in the enzymatic activity which may not necessarily have any clinical significance [reference given]” |
| 4 | Systematic review and mention external validity (generalisability to local context) or internal validity (bias or confounding) of evidence or hierarchy of evidence | Spain, systematic review of classic galactosemia: “sensitivity of 100% and a specificity of 99.9% in all programs, although these data should be interpreted with caution in the absence of studies to conduct a verification of negative cases [reference given]” | UK, systematic review of maple syrup urine disease: “Other authors provide shorter case-history approaches to identification of improved clinical outcomes from screen detected patients. These include [references given] all of whom compare small numbers of pre-symptomatically detected versus clinically detected cases but without construction of comparative cohorts” | USA, systematic review of Krabbe disease: “Of the seven high-risk cases detected in New York (Table 6), two were considered EIKD [early infantile Krabbe disease] and referred for HSCT [hematopoietic stem cell transplant] because of their GALC [galactosylceramidase] genotypes and the early signs of neurologic disease. One of these patients was homozygous for the 30-kb deletion mutation, while the other patient was heterozygous for the 30-kb deletion and a novel mutation. Dr. Wenger reports that the five remaining children who screened high risk had genotypes considered to put them at a low risk for early onset of disease. Dr. Caggana and Dr. Orsini state that two of these children were lost to follow-up and three are being followed on a quarterly basis by a neurologist. One of these children is known to be asymptomatic and the other two are assumed to be asymptomatic as Dr. Caggana and Dr. Orsini have not heard otherwise |
| 5 | Systematic review and assessed using formal quality assessment | No examples found | Belgium, systematic review of CF provides full quality assessment of the studies in an appendix, with summary: “The studies performed to support CF NBS [newborn screening] is not as strong as one might expect, knowing that there are still two large randomized trials (RCTs) [that] were designed to evaluate CF NBS. The design of the UK RCT (1985-1989) was substandard and this study was therefore not retained in a recent Cochrane review. The Wisconsin RCT (1985-1995) did have a proper design and demonstrated a significant advantage of CF NBS in the field of nutrition and growth (weight and length). However, in [relation to] lung function, no benefit from CF NBS could be demonstrated” | No examples found |
Scores are cumulative—for example, a score of 3 can only be achieved if meeting all criteria to score 1, 2, and 3.
Fig 1Flow of documents through study. One paper was included from Italy, but no national decisions in the analysis, because one paper that will be used in part to underpin the national decisions has been published, but the national review process is incomplete and recommendations are yet to be made
Fig 2Distribution of scores for evaluating test accuracy, benefits of early versus late detection and treatment, and overdiagnosis. A score of zero indicates that these elements were not considered at all, and 5 indicates that they were assessed using a systematic review with formal quality appraisal
Review methods and decisions for each country
| Country | Proportion of decisions, % (No/total No) | Review scores (No*) | ||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Recommended screening | Used systematic review | Test accuracy | Early detection | Overdiagnosis | ||||||||||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 0 | 1 | 2 | 3 | 4 | 5 | 0 | 1 | 2 | 3 | 4 | 5 | |||||
| Australia | 100 (1/1) | 0 (0/1) | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | ||
| Belgium | 14 (1/7) | 14 (1/7) | 0 | 1 | 5 | 1 | 0 | 0 | 0 | 3 | 3 | 0 | 0 | 1 | 4 | 2 | 0 | 1 | 0 | 0 | ||
| Canada | 83 (5/6) | 67 (4/6) | 2 | 0 | 0 | 3 | 1 | 0 | 3 | 0 | 1 | 1 | 1 | 0 | 5 | 0 | 1 | 0 | 0 | 0 | ||
| Denmark | 60 (21/35) | 0 (0/35) | 4 | 3 | 28 | 0 | 0 | 0 | 8 | 16 | 11 | 0 | 0 | 0 | 25 | 9 | 1 | 0 | 0 | 0 | ||
| Finland | 0 (0/7) | 100 (7/7) | 1 | 4 | 1 | 1 | 0 | 0 | 0 | 7 | 0 | 0 | 0 | 0 | 6 | 1 | 0 | 0 | 0 | 0 | ||
| France | 33 (1/3) | 100 (3/3) | 0 | 0 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 2 | 0 | 0 | 2 | 0 | 1 | 0 | 0 | ||
| Germany | 100 (1/1) | 100 (1/1) | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | ||
| Japan | 81 (25/31) | 0 (0/31) | 27 | 2 | 2 | 0 | 0 | 0 | 30 | 0 | 1 | 0 | 0 | 0 | 30 | 1 | 0 | 0 | 0 | 0 | ||
| Netherlands | 55 (29/53) | 0 (0/53) | 21 | 13 | 19 | 0 | 0 | 0 | 18 | 23 | 12 | 0 | 0 | 0 | 48 | 3 | 2 | 0 | 0 | 0 | ||
| New Zealand | 13 (1/8) | 75 (6/8) | 0 | 5 | 2 | 0 | 0 | 0 | 5 | 1 | 2 | 0 | 0 | 0 | 7 | 0 | 1 | 0 | 0 | 0 | ||
| Spain | 41 (11/27) | 100 (27/27) | 6 | 0 | 0 | 20 | 1 | 0 | 3 | 0 | 0 | 22 | 2 | 0 | 10 | 5 | 0 | 12 | 0 | 0 | ||
| UK | 75 (6/8) | 63 (5/8) | 0 | 0 | 3 | 4 | 1 | 0 | 0 | 2 | 2 | 1 | 3 | 0 | 2 | 0 | 1 | 5 | 0 | 0 | ||
| USA | 64 (57/89) | 7 (6/89) | 53 | 15 | 15 | 0 | 6 | 0 | 16 | 19 | 49 | 0 | 5 | 0 | 74 | 3 | 10 | 0 | 2 | 0 | ||
Number of included recommendations with each evidence score.
Fig 3Forest plot of the odds of recommending screening in decisions that included compared with did not include evidence from a systematic review. Overall effect estimate from fixed effects meta-analysis with a 0.1 zero cell correction
Number of reviews recommending screening and no screening by scores for test accuracy, benefit of early detection, and overdiagnosis
| Scores | Recommendation | Proportion recommend screening (%) | Spearman correlation coefficient* (95% CI) | P value | |
|---|---|---|---|---|---|
| Screening | No screening | ||||
| Test accuracy: | |||||
| 0 | 41 | 14 | 75 | −0.17 (−0.33 to −0.01) | 0.04 |
| 1 | 10 | 8 | 56 | ||
| 2 | 27 | 16 | 63 | ||
| 3 | 10 | 11 | 48 | ||
| 4 | 2 | 1 | 67 | ||
| 5 | 0 | 0 | |||
| Benefits of early detection: | |||||
| 0 | 27 | 17 | 61 | −0.06 (−0.22 to 0.11) | 0.51 |
| 1 | 23 | 11 | 68 | ||
| 2 | 30 | 7 | 81 | ||
| 3 | 5 | 13 | 28 | ||
| 4 | 4 | 2 | 67 | ||
| 5 | 1 | 0 | 100 | ||
| Overdiagnosis: | |||||
| 0 | 71 | 34 | 68 | −0.13 (−0.29 to 0.03) | 0.12 |
| 1 | 5 | 5 | 50 | ||
| 2 | 7 | 3 | 70 | ||
| 3 | 7 | 8 | 47 | ||
| 4 | 0 | 0 | |||
| 5 | 0 | 0 | |||
Correlation is between scores and whether screening was recommended (only includes reviews of conditions where at least one review recommended screening and one did not).