| Literature DB >> 34313404 |
Chien-Wei Chiang1, Penyue Zhang2, Macarius Donneyong3, You Chen4, Yu Su5, Lang Li1.
Abstract
Case-control design based high-throughput pharmacoinformatics study using large-scale longitudinal health data is able to detect new adverse drug event (ADEs) signals. Existing control selection approaches for case-control design included the dynamic/super control selection approach. The dynamic/super control selection approach requires all individuals to be evaluated at all ADE case index dates, as the individuals' eligibilities as control depend on ADE/enrollment history. Thus, using large-scale longitudinal health data, the dynamic/super control selection approach requires extraordinarily high computational time. We proposed a random control selection approach in which ADE case index dates were matched by randomly generated control index dates. The random control selection approach does not depend on ADE/enrollment history. It is able to significantly reduce computational time to prepare case-control data sets, as it requires all individuals to be evaluated only once. We compared the performance metrics of all control selection approaches using two large-scale longitudinal health data and a drug-ADE gold standard including 399 drug-ADE pairs. The F-scores for the random control selection approach were between 0.586 and 0.600 compared to between 0.545 and 0.562 for dynamic/super control selection approaches. The random control selection approach was ~ 1000 times faster than dynamic/super control selection approach on preparing case-control data sets. With large-scale longitudinal health data, a case-control design-based pharmacoinformatics study using random control selection is able to generate comparable ADE signals than the existing control selection approaches. The random control selection approach also significantly reduces computational time to prepare the case-control data sets.Entities:
Mesh:
Year: 2021 PMID: 34313404 PMCID: PMC8452297 DOI: 10.1002/psp4.12673
Source DB: PubMed Journal: CPT Pharmacometrics Syst Pharmacol ISSN: 2163-8306
FIGURE 1(a) Algorithm for dynamic/super control selection approach. (b) Algorithm for random control selection approach. ADE, adverse drug event
Pharmacoinformatics approaches: PRR, ROR and IC (a, b, c, and d are the four counts in a two‐by‐two contingency table)
| DPA | Formula | Quantity of estimation and description |
|---|---|---|
| PRR |
|
|
| ROR |
|
|
| IC |
|
|
| By adding 1 on both the numerator and the denominator, infrequent drug‐ADE pairs will have penalized IC values. |
a = count (ADE = Yes and drug = Yes), b = count (ADE = No and drug = Yes), c = count (ADE = Yes and drug = No), and d = count (ADE = No and drug = No).
Abbreviations: IC, information component; PRR, proportional reporting ratio; ROR, reporting odds ratio.
FIGURE 2Precision, recall, and F‐score in MarketScan data analysis. IC, information component; PRR, proportional reporting ratio; ROR, reporting odds ratio
FIGURE 3Performances for 50 independent replications using MarketScan data and acute myocardial infarction as ADE. ADE, adverse drug event; IC, information component; PRR, proportional reporting ratio; ROR, reporting odds ratio
FIGURE 4Actual and projected computation time for random control selection approach and dynamic/super control selection approach using MarketScan data