Literature DB >> 25903221

Signal detection using change point analysis in postmarket surveillance.

Zhiheng Xu¹, Taha Kass-Hout², Colin Anderson-Smits³, Gerry Gray¹.

Abstract

PURPOSE: Signal detection methods have been used extensively in postmarket surveillance to identify elevated risks of adverse events associated with medical products (drugs, vaccines, and devices). However, current popular disproportionality methods ignore useful information such as trends when the data are aggregated over time for signal detection.
METHODS: In this paper, we applied change point analysis (CPA) to trend analysis of medical products in a spontaneous adverse event reporting system. CPA was used to detect the time point at which statistical properties of a sequence of observations change over time. Two CPA approaches, change in mean and change in variance, were demonstrated by an example using neurostimulator adverse event dataset.
RESULTS: Two significant change points associated with upward trends were detected in June 2008 (n = 20, p < 0.001) and May 2011 (n = 51, p = 0.003). Further investigation confirmed battery issues and expansion of the indication for use could be possible causes for the occurrence of these change points. Two time points showed extremely low number of loss of therapy events, two cases in October 2009 and three in November 2009, which could be the result of reporting issues such as underreporting.
CONCLUSION: As a complimentary tool to current signal detection efforts at FDA, CPA can be used to detect changes in the association between medical products and adverse events over time. Detecting these changes could be critical for public health regulation, adverse events surveillance, product recalls, and regulators' understanding of the connection between adverse events and other events regarding regulated products.

Entities: Chemical Disease Species

Keywords: adverse event; change point analysis; pharmacoepidemiology; signal detection

Mesh：

Year: 2015 PMID： 25903221 PMCID： PMC4690504 DOI： 10.1002/pds.3783

Source DB: PubMed Journal: Pharmacoepidemiol Drug Saf ISSN： 1053-8569 Impact factor: 2.890

Introduction

Americans rely on the Food and Drug Administration (FDA) to keep their food and medical products safe and effective. During the approval process for a medical product, such as a vaccine, drug, or medical device, manufacturers conduct rigorous analytical studies or clinical trials, and FDA carries out a thorough pre-market review to evaluate the product’s efficacy and safety performance. However, in their submissions to the FDA, sponsors have only tested their products on a limited number of patients from the population in which the product will ultimately be used. Therefore, it is possible that some rare adverse events from patients in the product’s intended population may not be detected before the product goes to the market. In addition, in some cases, the product may change over time as newer iterations of the device are introduced, the product is used off label, or the indication for use is modified. Therefore, it is important to monitor medical products during the post-approval phase to detect emergent adverse events. To this end, FDA maintains several spontaneous adverse events reporting systems, such as FDA Adverse Event Reporting System (FAERS) for drug and biological products and Manufacturer and User Device Experience (MAUDE) for medical devices. The spontaneous adverse event reporting systems continuously generate large volumes of data. For example, FAERS contains approximately nine million reports and currently receives approximately half a million reports per year.1 As a result, it is not practical to manually review all reports to identify adverse event safety “signals”—real changes and differences in underlying event rates. Neither is it realistic to detect subtle signals through manual review. Furthermore, there is additional difficulty in identifying signals when the total number of patients using a certain product is unknown, making the estimation of adverse event rate impossible. Thus, various data mining techniques have been implemented at FDA to detect possible safety signals, which could reveal the association between approved medical products and adverse events caused by these products. The most commonly used data mining techniques at FDA include proportional reporting ratio (PRR) and Multi-item Gamma Poisson Shrinker (MGPS). For a specific adverse event j and drug/device i, PRR is defined as the ratio of conditional probability of adverse event j given drug/device i and conditional probability of adverse event j given all other drugs/devices except drug/device i.2 The purpose of PRR is to demonstrate the extent to which a specific adverse event is associated with that drug/device as compared with other drugs/devices. DuMouchel3 developed the MGPS method by assuming that counts of reports containing drug i and adverse event j follow a Poisson distribution with unknown parameter λ. A mixture of two Gamma distributions is used as the prior distribution for λ. Five parameters are estimated from the entire data matrix, and the posterior distribution of each λ is used to create “shrinkage” estimates, the empirical Bayes geometric mean (EBGM), which is used to rank all cell counts to determine which cells have unusually large observed counts compared with the expected counts. The lower and upper limits of a 90% confidence interval of the EBGM are denoted as EB05 and EB95, respectively. In general, safety signals will be generated if EB05 > 2, which means the observed count for drug/devices i and adverse event j is at least twice the expected ratio relative to all other drugs/devices and events in the database. A signal can be further refined and investigated to see if the EB05 is larger than the expected ratio of specific drugs/devices or a similar class of drugs/devices in the database. Various commercially available software programs can generate PRR and/or EBGM scores (e.g., Empirica Signal™, PVAnalyser™, SAS™, and MASE™). Most disproportionality methods such as PRR and MGPS look at the aggregated data over time. Such methods are designed to detect a proportional increase in events for a particular drug or device as compared with a comparator set of drugs or devices. However, useful information across the timeline is lost when data are aggregated over time. Specifically, identification of trends or changes over time for a particular product may be difficult to detect.4,5 Also, it is not always clear what constitutes the appropriate set of comparator drugs or devices. If the drug or device is the only available treatment in its class, there may not be a meaningful comparator. In other cases, there may be a wide range of similar or not-so-similar products that could be chosen to be included in a comparator set. In this paper, we applied a time-series method, change point analysis (CPA), to trend analysis using data from the spontaneous adverse event reporting system. CPA is a powerful statistical method in determining whether a change has taken place in time series or sequences. It has been demonstrated to be an effective tool in detecting changes in different application areas such as economics, medicine, agriculture, and machine intelligence.6–8 Recently, CPA has been introduced to public health surveillance. Kass-Hout et al.9,10 applied CPA to the active syndromic surveillance data to detect changes in the incidence of emergency department visits due to daily influenza like illness during the H1N1 pandemic. To the best of our knowledge, CPA has not been used in signal detection efforts at FDA. In this study, we intended to explore CPA method as a complimentary tool in detecting safety signals from MAUDE by evaluating benefits of using CPA in detecting adverse event change points in postmarket safety surveillance. Two different CPA approaches—change in mean and change in variance—were used to investigate trends of adverse events related to a neurostimulator.

Method

Data source

Using the FDA MAUDE database, we retrieved adverse events from one specific neurostimulator and aggregated monthly counts for adverse events related to loss of therapy. Loss of therapy could include several types of events, including battery problems, infection, overstimulation, and so on. The data spanned from 2000 to 2012. Figure1 illustrates monthly counts of loss of therapy during the study period.

Figure 1

The time series of number of loss of therapy for neurostimulator and their detected change points. AE, adverse event

CPA methods

The outcome measure was the monthly count of adverse event reports classified as loss of therapy. The detection of a single change point can be posed as a hypothesis test. The null hypothesis, H0, corresponds to no change point, and the alternative hypothesis, Ha, corresponds to a single change point. The current CPA research focused on developing robust algorithms to detect multiple change points on the mean of a sequence of observation data, including binary segmentation,11 segment neighborhoods,12,13 and the Pruned Exact Linear Time.14 Likelihood ratio and cumulative sum (CUSUM) are two widely used test statistics in detecting changes in mean.15 In this paper, we employed Taylor’s nonparametric CPA method, which uses iterative application of CUSUM and bootstrapping methods to detect changes in time-series data.16 This approach is based on the mean-shift model and assumes that residuals are independent and identically distributed with a mean of zero. For time-series data Y with i = 1, …, N, the mean-shift model is written as where μ is the sample average and ε is the residual term ε = Y − μ for the ith observation. To carry out the nonparametric CPA method, we defined the CUSUMs of residuals as S for i = 1, …, N, where the first set S0 = 0 and the remaining sets were calculated as S = S + ε for i = 1, …, N. Note that by construction, because we were subtracting the overall mean, S = 0 as well. If there was no change point, the time series is stationary, and thus a permuted sample of the residuals can be used to construct an instance of the CUSUMs under the null. Repeated permutation samples can be used to provide a null distribution for a test statistic constructed from the CUSUMs. A potential change point in an interval was identified at location m by searching for the maximum absolute CUSUM of residuals, where . As a statistic, we used the maximum absolute CUSUM difference within a given interval Sdiff = Smax − Smin, where and . On the other hand, when the CUSUM of residuals were plotted, a sudden change in direction of the CUSUM indicated a sudden shift or change in the average, and the place where sudden change occurred was defined as change point. The distribution of 1000 Sdiff was used to determine the p-value for the change point as the percentage of Sdiff values, which were greater than from original time-series data.16 In addition to the changes in mean approach, we used another CPA method for detecting changes in variability. This method was motivated by potential non-stationarity of variability of the data where variance was smaller in some sections but larger in other sections.15 Similar to the changes in mean CUSUM approach, change in variance can be implemented using the sum square error (SSE) approach, which was meaningful in operation, simple in calculation, and useful for testing significance. Let SSE(m) be defined as From the analysis of variance, it was known that the sum of the squared distances of points on a line from their mean can be partitioned, when the points were classified into two groups, 1 to m and m + 1 to N, into two within-group sums of squares SSE1 and SSE2 and a between-groups sum of squares SSE.11 The change point was defined at the value of m that minimizes SSE(m), the sum of the two within-group sums of squares. This can be thought of as a modified application of the k-means clustering algorithm, which was used to partition n observations into k clusters with each observation belonging to the cluster with the nearest mean. In this application, the clusters were restricted to retain the time-series nature of the data. The open-source software r has a package called changepoint, which provides a choice of different CPA algorithms in detecting changes in mean and variances. We used cpt.var function in the r package changepoint to implement the change in variance approach. For the change in mean CUSUM CPA method, we have developed publically available codes in r, sas, and stata format. Those codes can be downloaded from our open-access collaboration website for CPA at https://sites.google.com/site/changepointanalysis.

Results

The CUSUM plot for the sample neurostimulator data is shown in Figure2. Figure2(a) shows the CUSUM direction for entire time-series data, while Figure2(b) and (c) displays the CUSUM trend prior and post the first detected change point on June 2008.

Figure 2

(a) Cumulative sum (CUSUM) plot for the sample neurostimulator data; (b) CUSUM plot for the sample neurostimulator data prior to first change point at June 2008; and (c) CUSUM plot for the sample neurostimulator data after first change point at June 2008 Change points detected using CUSUM for the sample neurostimulator data along with their significance levels are listed in Table1. A p-value of 0.05 was used as the cutoff to screen significant change points. The first candidate change point occurred in June 2008, with a permutation test p-value of <0.001.

Table 1

Change points using cumulative sum method based on changes in mean

Detection order	Change point	Count in change point month	p-value
1	June 2008	20	<0.001
2	May 2011	51	0.003
3	December 2006	2	<0.001
4	November 2007	2	0.005
5	February 2007	1	<0.001

Change points using cumulative sum method based on changes in mean Figure1 shows how the data series were split based on the change points. The June 2008 change point is displayed as symbol “1” in Figure1. Then the data were split into two segments in June 2008, and CPA was implemented independently on each of the resulting segments. The second significant change point occurred in May 2011, which is displayed as symbol “2” in Figure1. This segment was further split in May 2011 change point, and CPA was applied, but no further significant change point was detected. For the left segment (prior to June 2008), CPA picked up three additional significant change points in 2006 and 2007. Because the variability of the data showed two different patterns before and after June 2008, we applied a change in variance CPA approach to detect any significant change points caused by the variability of the data (Table2). The change in variance CPA method detected two change points, June 2008, which was also detected by change in mean method, and April 2011, which is 1 month before the change point detected by change in mean method.

Table 2

Change points using change point analysis method based on changes in variance

Change point	Count in change point month
June 2008	20
April 2011	25

Change points using change point analysis method based on changes in variance

Discussion

Change point analysis is a very useful tool for detecting significant changes in the means or variances of a sequence of observed data. For medical products, the detected change points may provide valuable information for postmarket surveillance. In the neurostimulator example we used in this paper, two significant change points were detected—June 2008 and May 2011. A report based on retrospective analysis shows battery, and other device failure issues also occurred during June 2008. In May 2011, an extended indication for use for this device was approved. As a larger population used this device, more adverse events were reported. It is less intuitive as to observe significant change points before 2008 because the number of reports was relatively small and, for most months, there was no report. One of the limitations of the CUSUM approach is that it is based on the identification of change in mean, and thus, its performance could be affected by the actual numerical value of data. The counts of two losses of therapy could be significant change points if there are no events nearby (i.e., n = 2 for December 2006 and November 2007). However, although statistically significant, a count of two may not be sufficient in providing meaningful clinical implication to call for further investigation. Considering there are millions of adverse event reports in FAERS and MAUDE, it would be ideal to reduce the number of change points, which require further action. We also observed several low numbers of reports from October 2009 to January 2010, for example, two for October 2009 and three for November 2009. Because they existed in the elevated reporting period, the unusual pattern may be due to reporting issues such as underreporting. Underreporting of events is a significant problem in the spontaneous adverse event reporting system. Reporting of adverse events and medication errors by healthcare professionals and consumers is voluntary in the USA. As a result, a significant underreporting of adverse events occurs.17 FDA receives some adverse event, and medication error reports directly from healthcare professionals (e.g., physicians, pharmacists, and nurses) and consumers (e.g., patients, family members, and lawyers). Healthcare professionals and consumers may also report adverse events and/or medication errors to product manufacturers. If a manufacturer receives an adverse event report, it is required to send the report to FDA as specified by regulations. Reports received directly by FDA and reports from manufacturers are entered into FAERS or MAUDE. Underreporting may vary according to the type of product, the seriousness of an event, the population using the product, the product’s time on the market, and other factors. It has been estimated that 94% of adverse drug reactions go undetected by spontaneous reporting systems.17 In addition to underreporting, overreporting of events could also be problematic. A relative increase of reporting for a particular event or syndrome of events may be stimulated by publicity or litigation.18 However, such inflated reporting could lead to a biased estimate of safety signals. Factors that can cause bias include newly published safety alerts and recalls, new regulations, reporting incentives, or deterrents. Furthermore, when the report is from patients who are using multiple medical products, it is hard to correctly identify which medical products caused the adverse event. Both change in mean and change in variance CPA approaches showed similar change points in the neurostimulator adverse event dataset. April 2011 can be considered the last time point before counts of loss of therapy went up in May 2011. The CUSUM method treated the beginning of new segment as the change point, while the change in variance CPA approach used the last time point before the change occurred. We would argue that both change points (April and May 2011) represent the same change that occurred in the data. One difference between the two approaches is that change in mean detected multiple significant change points in 2006 and 2007 while change in variance did not. Compared with change in mean, change in variance is less sensitive to the actual numerical value of the data and could be more robust to outliers. The approaches presented in this paper provide complimentary analyses to findings from disproportionality-based methods. Instead of aggregating data to detect differences in relative reporting rates between products, CPA methods focus on detecting changes within a product over time. Both types of analysis can provide signals that would prompt further investigation of potential problems. CPA should be used as the starting point, not the end point to investigate these changes. In some situations, the actual events may start even before the detected change points. In order to determine the underlying cause of those changes, multiple data sources may be used for the investigation. As the volume of time-series data increases, there is a growing need to maintain situational awareness and be able to efficiently and accurately estimate the location of multiple change points. As a complimentary tool to current signal detection efforts at FDA, CPA can be applied to detect changes in the association between medical products (drugs, vaccines, and devices) and adverse events over time. Detecting these changes could be critical for public health regulation, adverse events surveillance, product recalls, and regulators’ understanding of the connection between adverse events and other events regarding regulated products.

Conflict of Interest

The authors declare no conflict of interest. Signal detection using disproportionality methods may ignore useful information when the data are aggregated over time for signal detection. Change point analysis (CPA), a time-series analysis tool, allows the estimation of the point at which statistical properties of a sequence of observations change. CPA can be used to detect changes in mean or in variance. CPA can be applied to detect changes in the association between medical products (drugs, vaccines, and devices) and adverse events over time. CPA can be a complimentary tool to current signal detection efforts at FDA.

Ethics Statement

This research analyzes the publicly available MAUDE data and does not require ethical approval.

9 in total

1. Use of proportional reporting ratios (PRRs) for signal generation from spontaneous adverse drug reaction reports.

Authors: S J Evans; P C Waller; S Davis
Journal: Pharmacoepidemiol Drug Saf Date: 2001 Oct-Nov Impact factor: 2.890

2. A METHOD FOR CLUSTER ANALYSIS.

Authors: A W EDWARDS; L L CAVALLI-SFORZA
Journal: Biometrics Date: 1965-06 Impact factor: 2.571

3. Estimating the extent of reporting to FDA: a case study of statin-associated rhabdomyolysis.

Authors: Mara McAdams; Judy Staffa; Gerald Dal Pan
Journal: Pharmacoepidemiol Drug Saf Date: 2008-03 Impact factor: 2.890

4. Field sampling for the estimation of wireworm populations.

Authors: D J FINNEY
Journal: Biometrics Date: 1946-02 Impact factor: 2.571

5. Algorithms for the optimal identification of segment neighborhoods.

Authors: I E Auger; C E Lawrence
Journal: Bull Math Biol Date: 1989 Impact factor: 1.758

6. A fast Bayesian change point analysis for the segmentation of microarray data.

Authors: Chandra Erdman; John W Emerson
Journal: Bioinformatics Date: 2008-07-29 Impact factor: 6.937

7. Performance of pharmacovigilance signal-detection algorithms for the FDA adverse event reporting system.

Authors: R Harpaz; W DuMouchel; P LePendu; A Bauer-Mehren; P Ryan; N H Shah
Journal: Clin Pharmacol Ther Date: 2013-02-11 Impact factor: 6.875

8. Application of change point analysis to daily influenza-like illness emergency department visits.

Authors: Taha A Kass-Hout; Zhiheng Xu; Paul McMurray; Soyoun Park; David L Buckeridge; John S Brownstein; Lyn Finelli; Samuel L Groseclose
Journal: J Am Med Inform Assoc Date: 2012-07-03 Impact factor: 4.497

Review 9. Under-reporting of adverse drug reactions : a systematic review.

Authors: Lorna Hazell; Saad A W Shakir
Journal: Drug Saf Date: 2006 Impact factor: 5.228

9 in total

3 in total

1. Augmenting aer2vec: Enriching distributed representations of adverse event report data with orthographic and lexical information.

Authors: Xiruo Ding; Justin Mower; Devika Subramanian; Trevor Cohen
Journal: J Biomed Inform Date: 2021-06-08 Impact factor: 8.000

Review 2. Challenges Associated with the Safety Signal Detection Process for Medical Devices.

Authors: Josep Pane; Katia M C Verhamme; Dorian Villegas; Laura Gamez; Irene Rebollo; Miriam C J M Sturkenboom
Journal: Med Devices (Auckl) Date: 2021-02-24

3. Identifying Actionability as a Key Factor for the Adoption of 'Intelligent' Systems for Drug Safety: Lessons Learned from a User-Centred Design Approach.

Authors: George I Gavriilidis; Vlasios K Dimitriadis; Marie-Christine Jaulent; Pantelis Natsiavas
Journal: Drug Saf Date: 2021-10-21 Impact factor: 5.606

3 in total