Literature DB >> 32323626

A multinomial quadrivariate D-vine copula mixed model for meta-analysis of diagnostic studies in the presence of non-evaluable subjects.

Aristidis K Nikoloulopoulos1.   

Abstract

Diagnostic test accuracy studies observe the result of a gold standard procedure that defines the presence or absence of a disease and the result of a diagnostic test. They typically report the number of true positives, false positives, true negatives and false negatives. However, diagnostic test outcomes can also be either non-evaluable positives or non-evaluable negatives. We propose a novel model for the meta-analysis of diagnostic studies in the presence of non-evaluable outcomes, which assumes independent multinomial distributions for the true and non-evaluable positives, and, the true and non-evaluable negatives, conditional on the latent sensitivity, specificity, probability of non-evaluable positives and probability of non-evaluable negatives in each study. For the random effects distribution of the latent proportions, we employ a drawable vine copula that can successively model the dependence in the joint tails. Our methodology is demonstrated with an extensive simulation study and applied to data from diagnostic accuracy studies of coronary computed tomography angiography for the detection of coronary artery disease. The comparison of our method with the existing approaches yields findings in the real data application that change the current conclusions.

Entities:  

Keywords:  Diagnostic tests; multivariate meta-analysis; sensitivity; specificity; summary receiver operating characteristic curves

Mesh:

Year:  2020        PMID: 32323626      PMCID: PMC7682507          DOI: 10.1177/0962280220913898

Source DB:  PubMed          Journal:  Stat Methods Med Res        ISSN: 0962-2802            Impact factor:   3.021


1 Introduction

Diagnostic test accuracy studies observe the result of a gold standard procedure that defines the presence or absence of a disease and the result of a diagnostic test. They typically report the number of true positives (diseased subjects correctly diagnosed), false positives (non-diseased subjects incorrectly diagnosed as diseased), true negatives (non-diseased subjects correctly diagnosed as non-diseased) and false negatives (diseased subjects incorrectly diagnosed as non-diseased). However, diagnostic test outcomes can be non-evaluable.[1] This is the case for coronary computed tomography (CT) angiography studies which have non-evaluable results of index text in various ways such as when transferring a segment/vessel to a patient based evaluation.[2] Synthesis of diagnostic test accuracy studies is the most common medical application of multivariate meta-analysis.[3,4] The purpose of a meta-analysis of diagnostic test accuracy studies is to combine information over different studies and provide an integrated analysis that will have more statistical power to detect an accurate diagnostic test than an analysis based on a single study. Nevertheless, the existence of non-evaluable subjects is an important issue that could lead to biased meta-analytic estimates of index test accuracy.[2,5,6] Schuetz et al.[2] studied different ad-hoc approaches dealing with diagnostic test non-evaluable subjects, such as non-evaluable subjects are excluded from the study, non-evaluable positives (non-evaluable diseased subjects) are taken as true positives and non-evaluable negatives (non-evaluable non-diseased subjects) are taken as false positives, non-evaluable positives are taken as false negatives and non-evaluable negatives are taken as true negatives, and non-evaluable positives as false negatives and non-evaluable negatives as false positives. In all of these approaches, Schuetz et al.[2] used the bivariate generalized linear mixed model (BGLMM)[7] and concluded that excluding the index test non-evaluable subjects leads to overestimation of the meta-analytic estimates of sensitivity and specificity and recommended the intent-to-diagnose approach by treating non-evaluable positives as false negatives and non-evaluable negatives as false positives. Ma et al.[5] used a trivariate generalized mixed model (TGLMM) approach by treating the non-evaluable subjects as missing data under a missing at random assumption (MAR). Ma et al.,[5] with extensive simulation studies, showed that the intent-to-diagnose approach[2] under-estimates both meta-analytic estimates of sensitivity and specificity, while the TGLMM approach under the MAR assumption gives nearly unbiased estimates of sensitivity, specificity and prevalence. Nikoloulopoulos,[6] similar to Ma et al.,[5] extended the vine copula mixed model for trivariate meta-analysis of diagnostic test accuracy studies accounting for disease prevalence[8] to additionally account for non-evaluable subjects. The extended trivariate vine copula mixed model includes the extended TGLMM as a special case and can also model sensitivity, specificity and prevalence on the original scale. Nikoloulopoulos[6] demonstrated that the extended TGLMM leads to biased meta-analytic estimates of sensitivity, specificity and prevalence when the univariate random effects are misspecified and that the extended vine copula mixed model gives nearly unbiased estimates of test accuracy indices and disease prevalence. A recurrent theme underlying the methodologies of Ma et al.[5] and Nikoloulopoulos[6] is the need to make the MAR assumption that cannot be verified based on the observed data. Hence, it is natural to be concerned about robustness or sensitivity of inferences to departures from the MAR assumption. The within-study model assumes that the number of true negatives, false negatives, false positives, true positives, non-evaluable negatives and non-evaluable positives are multinomially distributed, given the latent (random) vector of sensitivity, specificity, disease prevalence, probability of non-evaluable positives and probability of non-evaluable negatives. Under the MAR assumption, the multinomial probability mass function (pmf) decomposes into a product of independent binomial pmfs given the random effects. Hence, the within-study model actually assumes that the number of true negatives, number of true positives, number of diseased subjects, number of non-evaluable negatives and number of non-evaluable positives are conditionally independent and binomially distributed given the random effects. The triplet of latent sensitivity, specificity and prevalence are independent of the missing probabilities, hence the joint likelihood factors into two components, one involving only the sensitivity, specificity and disease prevalence, and the other involving only the probabilities of non-evaluable positives and non-evaluable negatives. Therefore, the methodology of Chu et al.[9] or Nikoloulopoulos[8] is applied to the first likelihood component to infer about the sensitivity, specificity and disease prevalence. Hence, the models in Ma et al.[5] and Nikoloulopoulos[6] extend the BGLMM[7] and the bivariate vine copula mixed model,[10] respectively, to the trivariate case by adding the disease prevalence as a third outcome to indirectly account for the non-evaluable results. On the one hand, the number of diseased subjects are binomially distributed with probability of success the latent prevalence and a support that includes the number of non-evaluable positives and the number of non-evaluable negatives, but the true positives and true negatives are binomially distributed with probability of success the latent sensitivity and specificity, respectively, and a support that does not include either the number of non-evaluable positives or the number of non-evaluable negatives on the other, just like in the BGLMM[7] and the bivariate vine copula mixed model.[10] Note in passing that a special case of the bivariate copula mixed model is the BGLMM, that is, a copula mixed model composed of a bivariate normal (BVN) copula with normal margins. In this paper, in order to remedy this situation of ignoring the non-evaluable subjects in the derivation of the meta-analytic estimates of sensitivity and specificity, we include the number of non-evaluable positives and the number of non-evaluable negatives as separate non-missing response categories. Interestingly, the proposed model extends the bivariate copula mixed model[10] to the quadrivariate case by directly adding the number of non-evaluable positives and number of non-evaluable negatives as a third and fourth outcome, respectively. Hence, it directly utilizes all the available data. The bivariate copula mixed model[10] assumes independent binomial distributions for the true positives and true negatives, conditional on the latent pair of sensitivity and specificity in each study. In the proposed methodology for the meta-analysis of diagnostic tests where we additionally account for non-evaluable outcomes of the diagnostic test, we will assume independent multinomial distributions for the true and non-evaluable positives, and, the true and non-evaluable negatives, conditional on the latent sensitivity, specificity, probability of non-evaluable positives and probability of non-evaluable negatives in each study. For the random effects distribution, we employ a regular vine copula.[11] Regular vine copulas are suitable for high-dimensional data, hence given the low dimension d = 4, where d is the dimension, we use their boundary case namely a drawable vine (D-vine) copula. D-vine copulas have become important in many applications areas such as finance[12,13] and biological sciences,[14,15] to just name a few, in order to deal with dependence in the joint tails. Another boundary case of regular vine copulas is the canonical vine copula, but this parametric family of copulas is only suitable if there exists a (pilot) variable that drives the dependence among the variables,[16,17] which apparently is not the case in this application area. The remainder of the paper proceeds as follows. Section 2 introduces the multinomial quadrivariate D-vine copula mixed model for meta-analysis of diagnostic studies accounting for non-evaluable results and provides computational details for maximum likelihood (ML) estimation. Section 3 studies the small-sample efficiency and robustness of the ML estimation of the multinomial quadrivariate D-vine copula mixed model. Section 4 applies our methodology to data from a published meta-analysis for diagnostic accuracy studies of coronary computed tomography angiography for the detection of coronary artery disease. We conclude with some discussion in Section 5, followed by a brief section with software details.

2 The multinomial quadrivariate D-vine copula mixed model

In this section, we introduce the multinomial quadrivariate D-vine copula mixed model. In Subsections 2.1 and 2.2, a D-vine copula representation of the random effects distribution with normal and beta margins, respectively, is presented. We complete this section with details on maximum likelihood estimation.

2.1 The multinomial quadrivariate D-vine copula mixed model with normal margins

We first introduce the notation used in this paper. The data are , where i is an index for the individual studies, j is an index for the test outcome (0: negative; 1: positive; 2: non-evaluable) and k is an index for the disease outcome (0: non-diseased; 1: diseased). The “classic” 2 × 2 table is extended to a 3 × 2 table (Table 1). Each cell in Table 1 provides the cell frequency corresponding to a combination of index test and disease outcomes in study i.
Table 1.

Data from an individual study in a 3 × 2 table.


Disease (by gold standard)
Test+Total
yi00 yi01 yi0+
+ yi10 yi11 yi1+
Non-evaluable yi20 yi21 yi2+
Total yi+0 yi+1 yi++
Data from an individual study in a 3 × 2 table. The diseased subjects have three possible states: false negative, true positive, and non-evaluable positive. The multinomial observation is therefore the number of diseased subjects where the diagnostic test is in each of its states. Hence, we assume that the false negatives , the true positives , and the non-evaluable positives are multinomially distributed given , viz. where (X1, X3) is the bivariate latent pair of transformed sensitivity and probability of non-evaluable positives and is the inverse multinomial logit link. Note that is shorthand notation for the multinomial distribution, where T is the number of cells, n is the number of observations, and with is the T-dimensional vector of success probabilities. In a similar manner, the non-diseased subjects have also three possible states: true negative, false positive, and non-evaluable negative. Hence, we assume that the true negatives , the false positives , and the non-evaluable negatives are multinomially distributed given , viz. where (X2, X4) is the bivariate latent pair of transformed specificity and probability of non-evaluable negatives. After defining the within-studies model in equations (1) and (2), we next define the between-studies model. The stochastic representation of the between studies model takes the form where is a quadrivariate D-vine copula with dependence parameter vector and is the cumulative distribution function (cdf) of the N() distribution, and is the multinomial logit link. The copula parameter vector has parameters of the random effects model and they are separated from the univariate parameters . The parameters π1 and π2 are those of actual interest denoting the meta-analytic parameters for the sensitivity and specificity, while the parameters π3 and π4 denote the probabilities of non-evaluable positives and negatives, respectively. The univariate parameters denote the variabilities of the random effects. The quadrivariate D-vine copula is built via successive mixing from bivariate pair-copulas on different levels. The pairs at level 1 are , for j = 1, 2, 3, and for level (), the (conditional) pairs are for . That is, for the four-dimensional D-vine, the copulas for variables j and given the variables indexed in between capture the conditional dependence.[13] When all the bivariate pair-copulas are BVN copulas with correlation (copula) parameters (1st level) and partial correlation parameters (2nd and 3rd level), the resulting distribution is the quadrivariate normal with mean vector and variance covariance matrix where , , , , and .[15] Other choices of copulas are better if there is more dependence in joint upper or lower tail. The models in equations (1)–(3) together specify a multinomial quadrivariate D-vine copula mixed model with joint likelihood where is the pmf and is the quadrivariate D-vine density, viz. with where and is shorthand notation for the density and cdf of the distribution, are bivariate copula densities, , and ; are bivariate copula cdfs. Note that a for a four-dimensional D-vine copula density there are 12 different decompositions.[12] To be concrete in the exposition of the theory, we use the decomposition in equation (4); the theory though also applies to the other 11 decompositions. Below we transform the original integral into an integral over a unit hypercube using the inversion method. Hence, the joint likelihood becomes

2.2 The multinomial D-vine copula mixed model with beta margins

In this section, we use the parametrization proposed by Wilson[18] in order the latent sensitivity and specificity to remain on the original scale. The within-study model takes the form The stochastic representation of the between studies model is where is a D-vine copula with dependence parameter vector and is the cdf of the Beta() distribution with π the mean and γ the dispersion parameter. The copula parameter vector has the dependence parameters of the random effects model and they are separated from the univariate parameters . The parameters π1 and π2 are those of actual interest denoting the meta-analytic parameters for the sensitivity and specificity, while the parameters π3 and π4 denote the probabilities of non-evaluable positives and negatives, respectively. The univariate parameters denote the variabilities of the random effects. In contrast with the model in the preceding subsection, the random effects of sensitivity and specificity are on the original scale. The models in equations (5) and (6) together specify a vine copula mixed model with joint likelihood where is as in equation (4) where we use beta instead of normal marginal distributions. Below we transform the integral into an integral over a unit hypercube using the inversion method. Hence, the joint likelihood becomes

2.3 Maximum likelihood estimation and computational details

Estimation of the model parameters can be approached by the standard maximum likelihood (ML) method, by maximizing the logarithm of the joint likelihood. The estimated parameters can be obtained by using a quasi-Newton[19] method applied to the logarithm of the joint likelihood. This numerical method requires only the objective function, i.e. the logarithm of the joint likelihood, while the gradients are computed numerically and the Hessian matrix of the second-order derivatives is updated in each iteration. The standard errors (SEs) of the ML estimates can be also obtained via the gradients and the Hessian computed numerically during the maximization process. For the multinomial quadrivariate D-vine copula mixed model, numerical evaluation of the joint pmf can be achieved with the following steps: Calculate Gauss-Legendre[20] quadrature points and weights in terms of standard uniform. Convert from independent uniform random variables , and to dependent uniform random variables , and that have a D-vine distribution using the algorithm in Nikoloulopoulos[15]: Set where and are conditional copula cdfs and their inverses. 3. Numerically evaluate the joint pmf, e.g. in a quadruple sum With Gauss-Legendre quadrature, the same nodes and weights are used for different functions; this helps in yielding smooth numerical derivatives for numerical optimization via quasi-Newton.

3 Simulations

In this section, we study the small-sample efficiency and robustness of the ML estimation of the multinomial quadrivariate D-vine copula mixed model. In Section 3.1, we gauge the small-sample efficiency of the ML method and investigate the misspecification of the parametric margin or bivariate pair-copulas of the random effects distribution. In Section 3.2, we investigate the mixed model misspecification by using both the proposed model and the extended trivariate vine copula mixed model[6] as true models. We set the sample size and the true univariate and dependence parameters to mimic the data analyzed in Section 4. In each model, we use six different linking copula families: normal, Frank, and Clayton copula along with its rotated versions (see our previous papers on copula mixed models[8,10,21,22] for definitions) to cover different types of dependence structure. To make it easier to compare strengths of dependence, we convert the BVN, Frank, and rotated Clayton estimated parameters to Kendall’s τ’s in via the following relations[23,24] and[25]

3.1 Small-sample efficiency–misspecification of the random effects distribution

We randomly generate samples of size N = 30 from the multinomial quadrivariate D-vine copula mixed model with both normal and beta margins. The simulation process is as below: 1. Simulate from a D-vine distribution . 2. • Convert to normal realizations via • Convert to beta realizations via 3. Simulate the size of diseased and non-diseased subjects n1 and n2, respectively, from a shifted gamma distribution to obtain heterogeneous study sizes,[26] i.e. and round off n1 and n2 to the nearest integers. 4. • For normal margins, draw from and from • For beta margins, draw from and from Tables 2 and 3 contain the resultant biases, root mean square errors (RMSE), and standard deviations (SD), along with the square root of the average theoretical variances (), scaled by 100, for the ML estimates under different pair-copulas and marginal choices from the multinomial D-vine copula mixed model with beta and normal margins, respectively. The true (simulated) pair-copula distributions are the Clayton copulas rotated by for both the and pair-copulas and the Clayton copula rotated by for the pair-copula.
Table 2.

Small sample of sizes N = 30 simulations (103 replications; n = 15) from the multinomial quadrivariate D-vine copula mixed model with beta margins and resultant biases, root mean square errors (RMSE) and standard deviations (SD), along with the square root of the average theoretical variances (), scaled by 100, for the ML estimates under different pair-copula choices and margins.

MarginCopulaπ1 = 0.90π2=0.77π3 = 0.06π4 = 0.11γ1 = 0.09γ2 = 0.08γ3 = 0.37γ4 = 0.15τ12 = 0.82τ23 = –0.52τ34 = 0.26
BiasNormalBVN4.203.49–1.97–1.91–22.3736.2716.07
Beta–0.08–0.030.380.03–0.10–0.21–4.81–0.12–5.016.211.97
NormalFrank4.243.68–1.96–1.86–21.2834.8415.74
Beta0.210.430.11–0.18–0.01–0.17–4.25–0.09–2.585.142.00
NormalCln{180°, 90°}4.203.37–2.00–1.84–15.9030.2215.57
Betaa–0.21–0.160.310.11–0.17–0.28–1.75–0.520.600.711.37
NormalCln{0°, 270°}4.143.52–1.90–1.85–30.1038.7612.30
Beta–0.080.110.530.020.820.49–6.330.14–3.6215.15–2.62
SDNormalBVN1.842.681.591.7424.5014.0628.1817.5824.5227.5825.93
Beta1.952.531.711.672.972.288.294.3510.2614.2717.16
NormalFrank1.892.741.651.8124.7014.0428.4417.8024.9028.5726.45
Beta1.842.371.611.583.002.228.534.348.0214.7117.31
NormalCln{180°, 90°}1.882.671.621.7323.9713.5327.7817.8623.6725.4621.90
Betaa1.982.521.681.672.852.158.894.289.1814.6515.85
NormalCln{0°, 270°}1.882.761.621.7926.2815.8930.7518.6033.7828.3430.21
Beta1.982.631.741.713.592.839.054.5416.1316.2319.53
V¯ NormalBVN1.382.391.171.6616.8610.8525.4014.7315.5515.6215.66
Beta1.341.991.211.461.971.827.924.069.0413.1414.88
NormalFrank1.312.281.121.6216.2110.7624.9414.6613.1313.7514.50
Beta1.181.851.101.361.841.948.254.057.8413.0715.20
NormalCln{180°, 90°}1.362.341.151.6316.4910.3724.4414.1413.5115.5313.77
Betaa1.331.991.211.441.921.838.003.978.0813.4514.33
NormalCln{0°, 270°}1.382.401.181.6616.0410.9227.3414.8413.4712.4416.15
Beta1.221.851.101.362.101.947.844.1410.8312.7516.73
RMSENormalBVN4.594.402.532.5833.1945.5630.51
Beta1.952.531.751.672.972.289.584.3511.4215.5617.28
NormalFrank4.644.592.572.5932.7545.0530.78
Beta1.852.411.611.593.002.239.534.358.4315.5817.42
NormalCln{180°, 90°}4.604.302.582.5328.5239.5226.87
Betaa1.992.521.701.672.852.179.064.319.2014.6715.91
NormalCln{0°, 270°}4.554.472.502.5845.2448.0132.62
Beta1.982.631.811.713.692.8711.044.5416.5422.2019.70

Note: Cln{}: The and pair-copulas are Clayton rotated by ω1 and ω2 degrees, respectively.

BVN: bivariate normal

True model.

Table 3.

Small sample of sizes N = 30 simulations (103 replications; n = 15) from the multinomial quadrivariate D-vine copula mixed model with normal margins and resultant biases, root mean square errors (RMSE) and standard deviations (SD), along with the square root of the average theoretical variances (), scaled by 100, for the ML estimates under different pair-copula choices and margins.

MarginCopulaπ1 = 0.94π2 = 0.79π3 = 0.03π4 = 0.09σ1 = 0.75σ2 = 0.65σ3 = 1.20σ4 = 0.69τ12 = 0.82τ23 = –0.38τ34 = 0.29
BiasNormalBVN–0.64–0.330.610.250.99–1.22–5.03–0.88–6.984.305.50
Beta–6.16–4.214.082.29–15.26–13.2612.79
NormalFrank–0.63–0.170.610.220.82–1.05–5.73–0.86–6.672.535.45
Beta–5.97–3.963.962.25–12.55–14.9212.18
Normal[a]Cln{180°, 90°}–0.63–0.440.570.33–1.13–1.96–2.71–0.97–1.54–2.422.31
Beta–6.37–4.424.102.50–10.10–19.629.71
NormalCln{0°, 270°}–0.72–0.240.710.243.571.36–3.63–0.46–4.0811.784.52
Beta–6.20–4.254.232.37–21.47–6.6110.91
SDNormalBVN2.122.751.831.8418.2911.6223.0614.4017.5417.4219.35
Beta2.992.942.311.945.263.006.513.4210.9818.6322.53
NormalFrank2.202.801.911.8817.9211.6023.4214.5014.4618.5420.16
Beta2.973.002.352.005.233.176.713.4310.6819.3022.22
Normal[a]Cln{180°, 90°}2.142.771.841.8617.7411.4422.7914.3615.4719.0816.82
Beta3.063.032.342.015.163.257.073.5011.4620.3521.01
NormalCln{0°, 270°}2.152.811.851.8619.9213.0824.7215.1322.1619.4724.34
Beta2.993.022.331.985.733.396.603.4616.1321.8330.25
V¯ NormalBVN1.432.451.191.6215.8110.2322.6612.4318.1815.8815.91
Beta1.352.101.171.452.042.116.093.108.0914.8917.57
NormalFrank1.332.301.111.5515.5310.1322.2812.3711.7515.1416.00
Beta1.282.071.131.432.012.296.353.107.7015.8917.29
Normal[a]Cln{180°, 90°}1.412.381.181.5914.929.8821.7112.0414.0616.5314.14
Beta1.312.141.171.451.972.296.583.137.9314.9216.86
NormalCln{0°, 270°}1.392.411.171.6016.2010.5623.2212.6118.5015.0918.85
Beta1.261.951.081.342.202.095.723.158.8918.6320.45
RMSENormalBVN2.222.771.931.8618.3211.6823.6014.4218.8817.9420.12
Beta6.855.134.693.0018.8022.8725.91
NormalFrank2.292.812.001.8917.9311.6524.1114.5315.9318.7120.88
Beta6.674.964.613.0116.4824.3925.34
Normal[a]Cln{180°, 90°}2.232.801.931.8917.7711.6122.9514.3915.5519.2416.98
Beta7.075.364.723.2115.2828.2723.14
NormalCln{0°, 270°}2.272.821.981.8820.2413.1524.9815.1322.5322.7524.76
Beta6.885.224.833.0926.8522.8132.15

Note: Cln{}: The and pair-copulas are Clayton rotated by ω1 and ω2 degrees, respectively.

BVN: bivariate normal.

True model.

Small sample of sizes N = 30 simulations (103 replications; n = 15) from the multinomial quadrivariate D-vine copula mixed model with beta margins and resultant biases, root mean square errors (RMSE) and standard deviations (SD), along with the square root of the average theoretical variances (), scaled by 100, for the ML estimates under different pair-copula choices and margins. Note: Cln{}: The and pair-copulas are Clayton rotated by ω1 and ω2 degrees, respectively. BVN: bivariate normal True model. Small sample of sizes N = 30 simulations (103 replications; n = 15) from the multinomial quadrivariate D-vine copula mixed model with normal margins and resultant biases, root mean square errors (RMSE) and standard deviations (SD), along with the square root of the average theoretical variances (), scaled by 100, for the ML estimates under different pair-copula choices and margins. Note: Cln{}: The and pair-copulas are Clayton rotated by ω1 and ω2 degrees, respectively. BVN: bivariate normal. True model. Conclusions from the values in the tables are the following: ML with the true multinomial D-vine copula mixed model is highly efficient according to the simulated biases, SDs and RMSEs. The ML estimates of the univariate meta-analytic parameters and their SDs are robust under copula misspecification, but are not robust to margin misspecification. The ML estimates of τ’s and their SDs are robust to copula misspecification, but they are not robust to margin misspecification.

3.2 Misspecification of the copula mixed model that accounts for non-evaluable outcomes

We randomly generate samples of size N = 30 from the multinomial quadrivariate D-vine copula mixed model and the extended trivariate vine copula mixed model with both normal and beta margins using the algorithm in Section 3.1 and in Nikoloulopoulos,[6] respectively. We compare the ML estimates of common parameters for both approaches under misspecification and also include in the comparison the bivariate copula mixed model estimates where the non-evaluable positives and negatives are either excluded or included as false negatives and false positives (intention to diagnose approach), respectively. In Section 3.1 and in Nikoloulopoulos,[6] it has been revealed that (a) the estimation of the univariate meta-analytic parameters is a univariate inference, and hence it is the univariate marginal distribution that matters and not the type of the copula, and (b) estimated Kendall’s τ is similar among different families of copulas. Hence, as the ML estimates are nearly not affected by the type of the pair-copula, we provide here the results when all the bivariate copulas are BVN. Tables 4 and 5 contain the resultant biases, RMSEs, and SDs, along with the square root of the average theoretical variances (), scaled by 100, for the ML estimates under different copula mixed models. The true quadrivariate multinomial vine copula mixed model is composed by the Clayton copulas rotated by for both the and pair-copulas and the Clayton copula rotated by for the pair-copula. The true trivariate vine copula mixed model is composed by the Clayton copula for and the Clayton rotated by for both the and pair-copulas.
Table 4.

Small sample of sizes N = 30 simulations (103 replications; n = 15) from the multinomial quadrivariate D-vine and trivariate vine copula mixed models with beta margins and resultant biases, root mean square errors (RMSE) and standard deviations (SD), along with the square root of the average theoretical variances (), scaled by 100, for the ML estimates under different copula mixed models.


True vine copula mixed model

Trivariate

Quadrivariate
Fitted copulamixed modelMarginπ1 = 0.97π2 = 0.85γ1 = 0.03γ2 = 0.06τ = 0.39π1 = 0.90π2 = 0.77γ1 = 0.09γ2 = 0.08τ = 0.82
BiasBivariate[a]Beta0.040.22–0.11–0.1511.377.109.63–5.84–2.25–42.16
Normal[b]0.912.3814.248.2611.78–40.39
Bivariate[c]Beta–3.18–5.461.700.14–2.54–0.08–0.03–0.08–0.22–4.75
Normal[b]–1.59–3.50–1.262.791.47–2.35
TrivariateBeta–0.03–0.09–0.10–0.068.977.109.60–5.81–2.24–42.47
Normal[d]0.862.1310.898.2511.76–40.61
QuadrivariateBeta–3.18–5.461.710.13–1.63–0.08–0.03–0.10–0.21–5.01
Normal–0.46–1.1021.734.203.49–22.37
SDBivariate[a]Beta0.641.911.292.0024.080.811.831.911.9917.07
Normal[b]0.571.9023.4914.4025.410.491.7223.1513.9317.40
Bivariate[c]beta0.771.851.581.9217.451.942.542.942.2210.03
Normal[b]0.801.9419.3812.2218.291.872.8625.1114.079.06
TrivariateBeta0.661.891.272.0222.710.811.831.931.9916.92
Normal[d]0.581.8823.4514.4623.790.481.7223.5114.1017.35
QuadrivariateBeta0.771.871.571.9318.201.952.532.972.2810.25
Normal0.681.9123.3513.8323.921.842.6824.5014.0624.90
V¯ Bivariate[a]Beta0.631.801.342.0227.900.601.611.211.6915.20
Normal[b]0.531.7324.7814.0526.160.451.5219.4011.9115.35
Bivariate[c]beta1.082.091.782.0016.391.311.971.931.798.79
Normal[b]0.932.1019.5511.9917.091.272.0416.0910.478.03
TrivariateBeta0.661.861.372.0625.750.601.601.201.6815.06
Normal[d]0.541.7724.0314.2422.360.451.5219.1311.8215.12
QuadrivariateBeta1.092.091.782.0017.081.341.991.971.829.04
Normal0.812.0124.8113.6122.071.382.3916.8610.8515.55
RMSEBivariate[a]Beta0.641.921.292.0026.637.159.806.143.0045.48
Normal[b]1.083.0429.138.2811.9143.98
Bivariate[c]Beta3.275.772.321.9217.641.942.542.942.2311.10
Normal[b]1.784.0118.343.363.219.36
TrivariateBeta0.671.891.282.0224.427.159.776.123.0045.72
Normal[d]1.042.8426.168.2711.8844.16
QuadrivariateBeta3.275.772.321.9418.271.952.532.972.2811.42
Normal0.822.2132.324.594.4033.19

The non-evaluable outcomes are excluded

The resulting model is the same as the BGLMM.

The non-evaluable positives and negatives are included as false negatives and positives, respectively.

The resulting model is the same as the extended TGLMM.

Table 5.

Small sample of sizes N = 30 simulations (103 replications; n = 15) from the multinomial quadrivariate D-vine and trivariate vine copula mixed models with normal margins and resultant biases, root mean square errors (RMSE) and standard deviations (SD), along with the square root of the average theoretical variances (), scaled by 100, for the ML estimates under different copula mixed models.


True vine copula mixed model

Trivariate

Quadrivariate
Fitted copulamixed modelMarginπ1 = 0.98π2 = 0.88σ1 = 0.90σ2 = 0.73τ = 0.39π1 = 0.94π2 = 0.79σ1 = 0.75σ2 = 0.65τ = 0.82
BiasBivariate[a]Beta–0.85–1.9113.572.696.59–4.63
Normal[b]–0.030.17–6.52–1.3915.813.358.18–1.80–1.96–4.69
Bivariate[c]Beta–4.04–7.58–1.43–5.97–3.98–17.33
Normal[b]–2.46–5.61–0.79–6.12–0.35–1.75–1.9363.8517.11–14.39
TrivariateBeta–0.92–2.1910.972.636.45–7.20
Normal[d]–0.08–0.07–6.28–1.1512.263.318.09–1.64–2.09–7.31
QuadrivariateBeta–4.04–7.57–0.13–6.16–4.21–15.26
Normal–1.39–3.28–8.23–3.6923.31–0.64–0.330.99–1.22–6.98
SDBivariate[a]Beta0.691.941.432.1325.070.541.610.801.5617.37
Normal[b]0.521.7524.4313.6526.480.441.4817.3411.5016.85
Bivariate[c]Beta0.781.831.541.9417.782.952.865.503.0810.51
Normal[b]0.761.8418.2311.7218.502.112.9625.4113.979.93
TrivariateBeta0.721.921.432.1523.290.601.690.861.5517.14
Normal[d]0.541.7324.0013.6324.340.471.5517.8711.4016.62
QuadrivariateBeta0.791.841.531.9618.772.992.945.263.0010.98
Normal0.661.7824.1113.3425.912.122.7518.2911.6217.54
V¯ Bivariate[a]Beta0.591.751.181.9329.790.501.480.711.3819.52
Normal[b]0.501.6723.6013.4030.270.451.4516.2310.6023.19
Bivariate[c]Beta1.062.061.711.9716.681.332.102.012.138.30
Normal[b]0.912.0718.9211.8216.941.342.2215.1511.028.07
TrivariateBeta0.611.781.201.9425.490.511.480.721.3718.94
Normal[d]0.511.7022.8613.2522.970.451.4515.6310.3719.41
QuadrivariateBeta1.062.071.701.9816.831.352.102.042.118.09
Normal0.791.9622.0012.8424.511.432.4515.8110.2318.18
RMSEBivariate[a]Beta1.102.7228.512.746.7917.98
Normal[b]0.521.7525.2813.7230.843.388.3117.4311.6617.49
Bivariate[c]Beta4.117.8017.836.664.9020.27
Normal[b]2.585.9118.2513.2218.502.743.5368.7322.0917.49
TrivariateBeta1.172.9125.742.706.6718.59
Normal[d]0.551.7324.8113.6727.253.358.2317.9411.5918.16
QuadrivariateBeta4.127.7918.776.855.1318.80
Normal1.543.7325.4713.8434.852.222.7718.3211.6818.88

The non-evaluable outcomes are excluded.

The resulting model is the same as the BGLMM.

The non-evaluable positives and negatives are included as false negatives and positives, respectively.

The resulting model is the same as the extended TGLMM.

Small sample of sizes N = 30 simulations (103 replications; n = 15) from the multinomial quadrivariate D-vine and trivariate vine copula mixed models with beta margins and resultant biases, root mean square errors (RMSE) and standard deviations (SD), along with the square root of the average theoretical variances (), scaled by 100, for the ML estimates under different copula mixed models. The non-evaluable outcomes are excluded The resulting model is the same as the BGLMM. The non-evaluable positives and negatives are included as false negatives and positives, respectively. The resulting model is the same as the extended TGLMM. Small sample of sizes N = 30 simulations (103 replications; n = 15) from the multinomial quadrivariate D-vine and trivariate vine copula mixed models with normal margins and resultant biases, root mean square errors (RMSE) and standard deviations (SD), along with the square root of the average theoretical variances (), scaled by 100, for the ML estimates under different copula mixed models. The non-evaluable outcomes are excluded. The resulting model is the same as the BGLMM. The non-evaluable positives and negatives are included as false negatives and positives, respectively. The resulting model is the same as the extended TGLMM. Conclusions from the values in the tables are the following: The bivariate copula mixed model where the non-evaluable outcomes are disregarded and the extended trivariate vine copula mixed model showed similar performance. Both led to unbiased (biased) and efficient (inefficient) estimates when the true model is the trivariate (quadrivariate multinomial) vine copula mixed model. The bivariate copula mixed model where the non-evaluable positives and negatives included as false negatives and false positives, respectively, and the multinomial D-vine copula mixed model with beta margins showed similar performance. Both led to unbiased (biased) and efficient (inefficient) estimates when the true model is the quadrivariate multinomial vine copula mixed model with beta margins (trivariate vine copula mixed model or quadrivariate multinomial vine copula mixed model with normal margins).

4 Meta-analysis of coronary computed tomography angiography studies

We apply the multinomial quadrivariate D-vine copula mixed model for the meta-analysis of diagnostic accuracy studies accounting for non-evaluable subjects to data on 30 studies from a systematic review for diagnostic accuracy studies of coronary computed tomography angiography for the detection of coronary artery disease.[27] We fit the multinomial quadrivariate D-vine copula mixed model for all different decompositions of the D-vine copula density, for both beta and normal margins and different pair-copulas at the level 1; for levels 2 and 3, we use BVN copulas. In cases when fitting the multinomial quadrivariate D-vine copula mixed model, the resultant estimate of one of the conditional dependence parameters was close to the right boundary of its parameter space (that is clear indication that the model with a full structure provides more dependence structure than it is actually required[8]), we used a truncated model, i.e. we captured the strongest dependence in the first tree and then just used the independence copulas in lower order trees, i.e. conditional independence. Joe et al.[28] showed that in order for a vine copula to have (tail) dependence for all bivariate margins, it is only necessary for the bivariate copulas in level 1 to have (tail) dependence and it is not necessary for the conditional bivariate copulas in levels 2 and 3, to have tail dependence. Hence, one can either use BVN or independence copulas at levels 2 and 3 without sacrificing the tail dependence of the vine copula distribution. In Table 6, we present the results from the decomposition of the vine copula density in equation (4), as a different decompositions led to similar results due to the small sample size. This is consistent with our previous studies on vine copula mixed models.[6,8] Since the number of parameters is not the same between the models, we use the Akaike information criterion (AIC), that is −2× log-likelihood +2× (#model parameters) as a rough diagnostic measure for goodness of fit between the models. The AICs showed that a (truncated) multinomial quadrivariate D-vine copula mixed model with Clayton copulas rotated by for both the and pair-copulas and the Clayton copula rotated by for the pair-copula and beta margins (Table 6) provides the best fit.
Table 6.

AICs, ML estimates, and standard errors (SE) of the multinomial quadrivariate D-vine copula mixed models for diagnostic accuracy studies of coronary computed tomography angiography.


BVN

Frank

Cln{180°, 90°}[a]

Cln{180°, 270°}
Est.SEEst.SEEst.SEEst.SE
Normal margins
π10.940.010.950.010.940.020.940.02
π20.800.030.800.030.790.030.790.03
π30.040.010.030.010.030.010.040.01
π40.090.020.090.020.090.020.090.02
σ10.890.200.910.190.750.170.830.17
σ20.720.150.650.130.650.120.670.13
σ31.320.361.370.361.200.311.190.33
σ40.800.230.700.210.690.190.730.19
τ120.540.220.490.200.820.190.820.18
τ23–0.160.20–0.310.17–0.380.24–0.040.15
τ340.220.230.110.240.290.170.370.17
τ13|20.430.340.670.23
τ24|30.110.22–0.030.24
τ14|23–0.390.32–0.360.49
 AIC4013.224010.804007.724009.36
Beta margins
π10.900.020.900.020.900.010.890.01
π20.760.030.770.020.770.020.760.02
π30.060.010.060.010.060.010.070.01
π40.110.020.110.020.110.020.110.02
γ10.080.030.090.030.090.030.100.03
γ20.090.030.090.020.080.020.090.02
γ30.320.120.320.130.370.120.280.12
γ40.150.070.160.070.150.070.150.06
τ120.710.110.740.080.820.080.790.07
τ23–0.350.17–0.340.12–0.520.14–0.230.10
τ340.230.220.200.210.260.180.210.17
τ13|2–0.660.38
τ24|3–0.100.20
τ14|23–0.020.57
 AIC4009.424005.934002.174004.92

Note: Cln{}: The and pair-copulas are Clayton rotated by ω1 and ω2 degrees, respectively.

AIC: akaike information criterion; BVN: bivariate normal.

Best fit.

AICs, ML estimates, and standard errors (SE) of the multinomial quadrivariate D-vine copula mixed models for diagnostic accuracy studies of coronary computed tomography angiography. Note: Cln{}: The and pair-copulas are Clayton rotated by ω1 and ω2 degrees, respectively. AIC: akaike information criterion; BVN: bivariate normal. Best fit. In real data (in contrast with the simulated data in Section 3), the truth is unknown, so it is important to compare between the proposed and other existing approaches in terms of point estimation and variance. First, in order to reveal if the use of the proposed model is worthy, when a standard bivariate analysis (either ignoring the non-evaluable outcomes or including the non-evaluable positives and negatives as false negatives and positives, respectively) is easy, we also fit the bivariate copula mixed model[10] with both beta and normal margins and different bivariate copulas. According to the likelihood principle, a bivariate copula mixed model with a Clayton and Clayton copula rotated by 180° (to model the association between the latent sensitivity and specificity) and beta margins provides the best fit for both different ad-hoc approaches to handle the non-evaluable outcomes (Table 7). It is revealed that a bivariate copula mixed model with the sensitivity and specificity on the original scale provides better fit than the BGLMM,[7] which models the sensitivity and specificity on a transformed scale.
Table 7.

AICs, ML estimates, and standard errors (SE) of the best fitted bivariate copula and extended trivariate vine copula mixed models with beta margins for diagnostic accuracy studies of coronary computed tomography angiography.


Bivariate

Trivariate

Clayton[a]

Clayton 180°b

Clayton {0°,90°}
Est.SEEst.SEEst.SE
π 1 0.970.010.900.010.970.01
π 2 0.850.020.770.020.850.02
π 3 0.490.03
γ 1 0.030.010.090.030.030.01
γ 2 0.060.020.080.020.060.02
γ 3 0.110.02
τ 12 0.420.190.820.080.390.20
τ 13 0.020.23
τ23|1 –0.280.17
AIC244.82321.91492.26

AIC: akaike information criterion.

The non-evaluable outcomes are excluded.

The non-evaluable positives and negatives are included as false negatives and positives, respectively.

AICs, ML estimates, and standard errors (SE) of the best fitted bivariate copula and extended trivariate vine copula mixed models with beta margins for diagnostic accuracy studies of coronary computed tomography angiography. AIC: akaike information criterion. The non-evaluable outcomes are excluded. The non-evaluable positives and negatives are included as false negatives and positives, respectively. Then, in order to compare the proposed approach with the ones that use the MAR assumption, we fit the extended trivariate vine copula mixed model[6] with both beta and normal margins and different pair-copulas. According to the likelihood principle, a vine copula mixed model composed of a Clayton copula to model the association between the sensitivity and specificity, a Clayton copula rotated by 90° to model both the associations between the specificity and prevalence and between the sensitivity and prevalence given the specificity, and beta margins provides the best fit (Table 7). It is revealed that an extended trivariate vine copula mixed model with the sensitivity, specificity, and disease prevalence on the original scale provides better fit than the extended TGLMM,[5] which models the sensitivity, specificity, and disease prevalence on a transformed scale. It has been shown that the trivariate analysis does not change the conclusions from the bivariate analysis excluding the non-evaluable outcomes. It is also apparent that the results from the quadrivariate analysis differentiate from the ones from bivariate (excluding the non-evaluable outcomes) and trivariate analyses which are fairly similar. The meta-analytic estimates of sensitivity and specificity from the latter approaches are blown, because in both approaches it is assumed that i.e. their support ignores the number of non-evaluable positives and the number of non-evaluable negatives . The conclusions from the quadrivariate analysis with the latent proportions on the original scale are quite similar with the ones from the bivariate analysis where the non-evaluable positives and negatives are included as false negatives and positives, respectively. These results are consistent with the findings in the simulations in Section 3.2. Note in passing that comparing the AIC values among the quadrivariate, trivariate and bivariate copula mixed models is inconclusive as they use a different number of responses. Although typically the focus of meta-analysis has been to derive the summary-effect estimates, there is increasing interest in drawing predictive inference. Summary receiver operating characteristic curves (SROC) can be deduced from the D-vine copula mixed model with the sensitivity and specificity on the original scale through the quantile regression techniques developed for the bivariate copula mixed model.[10] SROC essentially shows the effect of different model (random effect distribution) assumptions, since it is an inference that depends on the joint distribution. An SROC curve has been deduced for the bivariate copula mixed model[10] through a median regression curve of X1 on X2. For the copula mixed model, the model parameters (including dependence parameters), the choice of the copula, and the choice of the margin affect the shape of the SROC curve.[10] However, there is no priori reason to regress X1 on X2 instead of the other way around, so a median regression curve of X2 on X1 has also been provided. Rucker and Schumacher[29] stated that instead of summarizing data using an SROC, it might be preferable to give confidence regions. Hence, in addition to using just median regression curves, quantile regression curves with a focus on high (q = 0.99) and low quantiles (q = 0.01), which are strongly associated with the upper and lower tail dependence imposed from each parametric family of copulas, have been proposed.[10] These can been seen as confidence regions of the median regression SROC curve. Figure 1 demonstrates the SROC curves with a confidence region and summary operating points (a pair of the model-based sensitivity and specificity; shown by the black square) from the best fitted multinomial quadrivariate D-vine copula mixed model, the best fitted trivariate vine copula mixed model, and the best fitted bivariate copula mixed models along with the study estimates (shown by the circles in Figure 1). For the upper panel graphs, the sensitivity and specificity at study i (point estimates) have been calculated with the typical definitions of sensitivity and specificity, viz. respectively, as only patients with positive or negative results are considered, while for the lower panel graphs, the sensitivity and specificity at study i have been calculated with the definitions of sensitivity and specificity in Simel et al.,[30] viz. respectively, since the number of non-evaluable positives contributes to the diseased population and the number of non-evaluable negatives contributes to the non-diseased population.
Figure 1.

SROC curves with a confidence region and summary operating points (a pair of the model-based sensitivity and specificity) from the best fitted multinomial quadrivariate D-vine, extended trivariate vine and bivariate copula mixed models along with the study estimates.

: summary point; : study estimate; black and grey lines represent the quantile regression curves and , respectively; for q = 0.5 solid lines and for dotted lines (confidence region).aThe non-evaluable outcomes are excluded.bThe non-evaluable positives and negatives are included as false negatives and positives, respectively.

SROC curves with a confidence region and summary operating points (a pair of the model-based sensitivity and specificity) from the best fitted multinomial quadrivariate D-vine, extended trivariate vine and bivariate copula mixed models along with the study estimates. : summary point; : study estimate; black and grey lines represent the quantile regression curves and , respectively; for q = 0.5 solid lines and for dotted lines (confidence region).aThe non-evaluable outcomes are excluded.bThe non-evaluable positives and negatives are included as false negatives and positives, respectively.

5 Discussion

Motivated by the existence of non-evaluable results in diagnostic test accuracy studies, this paper proposed a multinomial quadrivariate D-vine copula mixed model for meta-analysis of diagnostic test accuracy studies accounting for non-evaluable subjects. Our general statistical model allows for selection of pair-copulas independently among a variety of parametric copula families, i.e. there are no constraints in the choices of bivariate parametric families of copulas and can also operate on the original scale of sensitivity and specificity. For the random effects, we have used a quadrivariate D-vine copula distribution or a truncated at level 1 quadrivariate D-vine copula (conditional independence), which allows flexible (tail) dependence.[28] We have proposed a numerically stable ML estimation technique based on Gauss-Legendre quadrature; the crucial step is to convert from independent to dependent quadrature points that follow a quadrivariate D-vine distribution. In an era of evidence-based medicine, decision makers need high-quality procedures such as the one developed in this article to support decisions about whether or not to use a diagnostic test in a specific clinical situation. The multinomial quadrivariate vine-copula mixed model is not an ad-hoc[2] but rather a sophisticated approach that utilizes all the available data in decision making and can satisfy the intention-to-diagnose principle. Using an intention to diagnose principle, i.e. a conservative approach, ensures that both the sensitivity and specificity are not overestimated. Hence, it formally enables decision makers to be more cautious in solely relying to the overly optimistic meta-analytic estimates of sensitivity and specificity derived from the extended trivariate vine copula mixed model that indirectly accounts for the non-evaluable outcomes.
  18 in total

1.  A mixed effect model for bivariate meta-analysis of diagnostic test accuracy studies using a copula representation of the random effects distribution.

Authors:  Aristidis K Nikoloulopoulos
Journal:  Stat Med       Date:  2015-08-02       Impact factor: 2.373

2.  A vine copula mixed effect model for trivariate meta-analysis of diagnostic test accuracy studies accounting for disease prevalence.

Authors:  Aristidis K Nikoloulopoulos
Journal:  Stat Methods Med Res       Date:  2015-08-11       Impact factor: 3.021

3.  Bayesian bivariate meta-analysis of diagnostic test studies using integrated nested Laplace approximations.

Authors:  M Paul; A Riebler; L M Bachmann; H Rue; L Held
Journal:  Stat Med       Date:  2010-05-30       Impact factor: 2.373

4.  Letter to the editor.

Authors:  Gerta Rücker; Martin Schumacher
Journal:  Biostatistics       Date:  2009-07-06       Impact factor: 5.899

5.  The influence of uninterpretability on the assessment of diagnostic tests.

Authors:  C B Begg; R A Greenes; B Iglewicz
Journal:  J Chronic Dis       Date:  1986

6.  Hybrid copula mixed models for combining case-control and cohort studies in meta-analysis of diagnostic tests.

Authors:  Aristidis K Nikoloulopoulos
Journal:  Stat Methods Med Res       Date:  2016-12-21       Impact factor: 3.021

7.  Intermediate, indeterminate, and uninterpretable diagnostic test results.

Authors:  D L Simel; J R Feussner; E R DeLong; D B Matchar
Journal:  Med Decis Making       Date:  1987 Apr-Jun       Impact factor: 2.583

8.  A D-vine copula-based model for repeated measurements extending linear mixed models with homogeneous correlation structure.

Authors:  Matthias Killiches; Claudia Czado
Journal:  Biometrics       Date:  2018-03-22       Impact factor: 2.571

9.  A trivariate meta-analysis of diagnostic studies accounting for prevalence and non-evaluable subjects: re-evaluation of the meta-analysis of coronary CT angiography studies.

Authors:  Xiaoye Ma; Muhammad Fareed K Suri; Haitao Chu
Journal:  BMC Med Res Methodol       Date:  2014-12-04       Impact factor: 4.615

Review 10.  Use of 3x2 tables with an intention to diagnose approach to assess clinical performance of diagnostic tests: meta-analytical evaluation of coronary CT angiography studies.

Authors:  Georg M Schuetz; Peter Schlattmann; Marc Dewey
Journal:  BMJ       Date:  2012-10-24
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.