Literature DB >> 17919329

Accuracy of telepsychiatric assessment of new routine outpatient referrals.

Surendra P Singh1, Dinesh Arya, Trish Peters.   

Abstract

BACKGROUND: Studies on the feasibility of telepsychiatry tend to concentrate only on a subset of clinical parameters. In contrast, this study utilises data from a comprehensive assessment. The main objective of this study is to compare the accuracy of findings from telepsychiatry with those from face to face interviews.
METHOD: This is a primary, cross-sectional, single-cluster, balanced crossover, blind study involving new routine psychiatric referrals. Thirty-seven out of forty cases fulfilling the selection criteria went through a complete set of independent face to face and video assessments by the researchers who were blind to each other's findings.
RESULTS: The accuracy ratio of the pooled results for DSM-IV diagnoses, risk assessment, non-drug and drug interventions were all above 0.76, and the combined overall accuracy ratio was 0.81. There were substantial intermethod agreements for Cohen's kappa on all the major components of evaluation except on the Risk Assessment Scale where there was only weak agreement.
CONCLUSION: Telepsychiatric assessment is a dependable method of assessment with a high degree of accuracy and substantial overall intermethod agreement when compared with standard face to face interview for new routine outpatient psychiatric referrals.

Entities:  

Mesh:

Year:  2007        PMID: 17919329      PMCID: PMC2194760          DOI: 10.1186/1471-244X-7-55

Source DB:  PubMed          Journal:  BMC Psychiatry        ISSN: 1471-244X            Impact factor:   3.630


Background

Verbal information and visual cues are major and primary ingredients of psychiatric assessment. The sounds and images transmitted through video-conferencing are equivalent to these two parameters respectively. Other factors such as empathy and rapport are also crucial and their influence on the outcome of assessment is well understood but not well quantified. The assumption that video-conferencing would provide results equivalent to those from face-to-face psychiatric interview is related to these corollaries and requires testing and quantification. Trust and confidence in using this technology can be greatly enhanced if this assumption is proved true. In view of rapid developments in hardware, wireless technology and data-transmission, psychiatric intervention through video-conferencing (telepsychiatry) can be an effective mode of service delivery, especially for remotely located population clusters. Meeting mental health needs for remotely and sparsely populated communities has been a challenge to service providers due to various factors including resource constrains and difficulty in recruitment of mental health staff. Attempts have been made to address these concerns through the use of new and emerging technologies. When such new methods are used for clinical assessment, there is bound to be inherent uncertainties as to whether these new methods are as reliable, sensitive and accurate as existing methods of clinical assessment. Although several studies [1-4] can be found on pilot projects and the feasibility of telepsychiatry, it seems that to date none have attempted to test a predetermined hypothesis for a complete set of clinical parameters in adult psychiatry. Earlier reviews [5-7] were complemented by evaluation of psychiatric assessment using the telephony system [8], video recording [9] and then video-conferencing [10,11]. A recent study [12] demonstrated usefulness of telepsychiatry as a valuable clinical and research tool. However, most of these studies focussed narrowly on the diagnostic aspect of psychiatric assessment. Other studies have attempted to deal with psychopathology [13,14], cost and feasibility [15,16], user satisfaction [3,17], acceptability [18] and psychological intervention [19,20]. The authors of this study failed to identify studies of reasonable quality on complete and comprehensive assessments of new psychiatric referrals in a general adult outpatient clinic. This study attempts to detect the level of intermethod agreement between telepsychiatric assessment and face-to-face interview for routine new outpatient referrals to the general adult psychiatric unit. It is anticipated that there is a high level of agreement between conclusions drawn from psychiatric interviews through video-conferencing (V) and the standard method of face-to-face psychiatric assessment (S) for diagnosis, risk assessment and clinical intervention. This study aims to test this assumption.

Methods

Setting and participants

The study was conducted at Hawkes Bay Health Care, which provides a National Health Service to the Hawkes Bay and Chatham Island areas of New Zealand (NZ). The study was approved by the Hawkes Bay Ethics Committee, New Zealand. The sample consisted of consecutive new adult psychiatric referrals to the Napier Community Mental Health Team (NCMHT). They belonged to the 19 to 65 age group, were not under care of the NCMHT and had not received care for any mental health issue from this unit for a period of at least 6 months at the time of referral. Cases requiring urgent assessment or home visit were excluded. In clinical practice, the outcome of the standard method of face-to-face assessment (S) is always supposed to be accurate. Accordingly, using method S as the gold standard, the results from V can be classified as 'accurate' if the outcome is identical to that from S for a given attribute, or as 'inaccurate' if there is disagreement between methods S and V. For the purpose of this study, the accuracy ratio (AR) is defined as the risk ratio (RR) between the accurate outcomes of video-assessment (V) and the results from face-to-face assessment (S). Assuming an AR of 0.95 or above for face-to-face assessment and results of video-assessment at a significance level of 0.05 and a power of 0.8, a sample size of 34 for each method would suffice to detect a difference of 15% or more between these two methods of assessment [21]. Accordingly, a sample of 40 participants based on single stage cluster sampling was considered to be adequate for this two-way, within subjects, crossed balanced design. The data derived from this study was also used for calculation of Cohen's kappa (CK) and its bootstrap confidence interval. From the 40 consecutive new psychiatric referrals fulfilling above criteria, two cases declined to participate and one case could not be located. A written informed consent was obtained from all remaining 37 cases and they all went through their complete intended assessments. The referral period extended from 26 February 2001 to 15 May 2001 and the assessments were completed between 23 March 2001 and 17 May 2001.

Assessment procedure

The assessment order for each method and for each psychiatrist was predetermined using a method of random allocation. The whole list was randomly divided into the two sub-lists, then participants were randomly allocated to have their first assessment either by researcher R1 or R2. The randomly selected half of the cases of each researcher (R1 and R2) had their first assessment by method S and the remaining half had their first assessment by method V. The second assessment (S or V as appropriate) of each individual case was subsequently completed by the other researcher (R1 or R2). The details of the randomisations and assessment procedures have been displayed in the Figure 1.
Figure 1

The sample randomisation. The numbers in the boxes are the serial numbers of the sample cases. Those crossed are drop-outs.

The sample randomisation. The numbers in the boxes are the serial numbers of the sample cases. Those crossed are drop-outs. None of the researchers had prior experience of conducting formal telepsychiatric interviews for clinical care. Prior to initiating the research assessments, the researchers spent one session to familiarise with the equipment, and two sessions practising on known cases to evaluate and compare their findings in order to enhance their interrater agreement. All face-to-face interviews were conducted at Hastings, while video assessment was carried out from Wairoa, 140 km away. All participants underwent both methods of assessment; each participant having one assessment on video by one psychiatrist and one face-to-face interview by the other psychiatrist. The interviewers utilised their own usual practice of clinical interview to resemble a standard outpatient setting. The main confounding variables likely to influence the level of agreement are; bias between the researchers doing the assessments, duration of interview, use of interpreter, order-effect (effect on the second interview due to practice or residual memory from the first interview) and the time interval between the two methods of interview. The influence of such biases was minimised by adopting a crossover design and assigning an equal number of cases to each of the interviewing psychiatrists, to each interview methods, and to each order of assessments (S followed by V and V followed by S). The researchers were not aware of each others findings while assessing an individual participant. Both assessments for each participant were completed on the same date and each assessment lasted up to 60 minutes. If an interpreter was involved, he/she had to attend both sessions for that given case. Video-Conferencing Units were available at Wairoa and Hastings. Both centres were equipped with a PictureTel Venue 2000 model 50 with 29 inch colour TV and were linked with a 384 KB (128 KB × 3) bandwidth ISDN line. Scanning and zooming of each of these video-conferencing units could be remotely controlled by the interviewer or by the interviewee. The Picture-in-picture (PIP) facility was not used on the interviewee side to prevent unnecessary distraction during the interview.

Diagnostic tools, scales and data

Diagnoses on the DSM-IV axes were based on the method described in the Decision Trees for Differential Diagnosis of the Handbook [22] with assistance from the manual when required. A Risk Assessment Schedule (RAS) was adopted from the guidelines for assessment of risk factors identified by the NZ Ministry of Health [23]. This scale has not been tested for its reliability and validity and is included in the appendix for information [Appendix-I]. A List of Psychiatric Intervention (LIPI, Appendix-II) was developed to record options of admission/discharge/follow up, investigations, psychological intervention and community support. Primarily, this is a list of clinical decisions to select if applicable. The details of any pharmacological intervention were also recorded in a structured format. The full diagnostic code for the DSM-IV-Axis-1, the presence or absence of diagnoses on Axis2 and Axis-3, the applicability or non-applicability of DSM-IV-Axis-4 questions and the score for DSM-IV-Axis-5 Global Assessment of Functioning (GAF) was recorded for each assessment. To confer uniformity, the numerical score of GAF was changed into ordinal type ranging from lower to high categories (A to E) based on a class interval method fulfilling the transformation criteria [24]. The RAS original scoring options of 'NIL' and 'LOW' were merged to 'low', and 'HIGH' and 'VERY HIGH' to 'serious'. This produced three distinct 'low', 'medium' and 'serious' risk categories for the purpose of statistical analysis. Possible responses for items of LIPI scale were dichotomous in nature excepting drug-related outcomes. Clinical decisions for investigations, psychological intervention and community support were summarised on a group wise basis. All medications were classified into nine types for eleven indications, resulting in five drug related initiatives. Minor adjustments were made to present the data table in an n-by-n format as a pre-requisite for kappa calculation. One DSM-IV Axis-1 diagnosis of disorganised schizophrenia (V, 295.10) was changed to paranoid schizophrenia (295.30) giving a concordant entry; one case of cyclothymic disorder, (S, 301.13) was changed to bipolar disorder (296.56) giving a discordant entry; and one case of factitious disorder (V, 300.16) was changed to somatoform disorder (300.81) giving a concordant pair. If the number of total diagnoses for a given case differed between methods of assessment, a category of 'NIL' was introduced to reflect the lack of identification of an equivalent diagnosis by the corresponding method. This led to the introduction of 2 concordant pairs and one discordant pair by the first method of adjustment and 3 discordant pairs arising from 'NIL' categories from the second method of adjustment. The resulting preponderance of discordant pairs over concordant pairs is likely to influence the interpretation against the research hypothesis, rather than in favour of it.

Statistical evaluation

The test statistics of AR, Risk Difference (RD) and CK were calculated and summarised in accordance with methods described in the standard texts [24-26]. For the purpose of comparisons using AR and RD, an assumption is made that all outcomes from face-to-face assessment are 100% accurate. While using the asymptotic method of computation, some of the upper confidence interval of CK may exceed the permitted value of 1. Techniques like Bias Corrected Accelerated Bootstrap Confidence Interval (BCaCI) [27] and exact p estimate [28] have been advocated to resolve this paradox. Accordingly, this study applied non-parametric BCaCI methodology using 50,000 bootstrap samples with replacement. The re-sampling was performed in a manner that retained the structural consistency of each subgroup. The techniques of bootstrapping and re-sampling are well established statistical methods and yet are little known in medical literature. The required software codes were developed for these models by the principal author (SPS) using R (version 2.4.1) [29] and were tested against other packages (SPSS, SAS and S-Plus) before data analysis. R is an open source statistical language software from the R Foundation of Statistical Computing, Vienna (ISBN 3-900051-07-0, 2006).

Results

Of 37 participants, 20 were female and 17 male. Ethnically, there were 27 participants of European descent, 8 were of Maori origin and 2 were from other groups. Their ages ranged between 19.21 and 63.29 years; with an average age of 35.40 years and a standard deviation of 12.46 years. The presence of statistical significance for the results of ARs in the Table 1 is based on two sided p value of 0.05 or less. The primary data from rows 1 to 30 in Table 1 and from rows 1 to 28 in Table 2 have been re-used to summarise in the remaining rows of their respective tables. This has invariably lead to multiple comparisons and interpretation of the results should reflect this limitation.
Table 1

Comparison of results of telepsychiatric assessments and face-to-face interviews

SampleAccuracy Ratio StatisticsRisk Difference Statistics
Primary attributes and accuracyAISizeARLCIUCIpRDLCIUCIp
01DSM-IV Axis 1495540.910.830.990.025-0.09-0.17-0.020.019
02DSM-IV Axis 2343370.920.841.010.083-0.08-0.170.010.071
03DSM-IV Axis 3362380.950.881.020.157-0.05-0.120.020.146
04DSM-IV Axis 4 Q1316370.840.730.970.014-0.16-0.28-0.040.007
05DSM-IV Axis 4 Q2316370.840.730.970.014-0.16-0.28-0.040.007
06DSM-IV Axis 4 Q3370371.001.001.00NA0.000.000.00NA
07DSM-IV Axis 4 Q4361370.970.921.030.317-0.03-0.080.030.311
08DSM-IV Axis 4 Q5370371.001.001.00NA0.000.000.00NA
09DSM-IV Axis 4 Q6325370.860.760.980.025-0.14-0.25-0.020.016
10DSM-IV Axis 4 Q7370371.001.001.00NA0.000.000.00NA
11DSM-IV Axis 4 Q8343370.920.841.010.083-0.08-0.170.010.071
12DSM-IV Axis 4 Q9136370.030.000.190.000-0.97-1.03-0.920.000
13DSM-IV Axis 52611370.700.570.870.001-0.30-0.44-0.150.000
14Risk to Self Q12710370.730.600.890.002-0.27-0.41-0.130.000
15Risk to Self Q2298370.780.660.930.005-0.22-0.35-0.080.001
16Risk to Self Q3289370.760.630.910.003-0.24-0.38-0.100.001
17Risk to Self Q4289370.760.630.910.003-0.24-0.38-0.100.001
18Risk to Others Q1289370.760.630.910.003-0.24-0.38-0.100.001
19Risk to Others Q2352370.950.881.020.157-0.05-0.130.020.146
20Risk to Others Q3334370.890.801.000.046-0.11-0.21-0.010.034
21Risk to Others Q4325370.860.760.980.025-0.14-0.25-0.020.016
22Risk to Others Q5334370.890.801.000.046-0.11-0.21-0.010.034
23Risk to Others Q6298370.780.660.930.005-0.22-0.35-0.080.001
24Admit-Discharge-Follow-up343370.920.841.010.083-0.08-0.170.010.071
25Investigations2512370.680.540.840.001-0.32-0.48-0.170.000
26Psychological Input2611370.700.570.870.001-0.30-0.44-0.150.000
27Community Support298370.780.660.930.005-0.22-0.35-0.080.001
28Drug Type6411750.850.780.940.001-0.15-0.23-0.070.000
29Drug Action5817750.770.680.870.000-0.23-0.32-0.130.000
30Drug Indication4926750.650.550.770.000-0.35-0.45-0.240.000

Main Attributes

31DSM-IV Axis 1 (1)495540.910.830.990.025-0.09-0.17-0.020.019
32DSM-IV Axis 2 (2)343370.920.841.010.083-0.08-0.170.010.071
33DSM-IV Axis 3 (3)362380.950.881.020.157-0.05-0.120.020.146
34DSM-IV Axis 4 (4:12)276573330.830.790.870.000-0.17-0.21-0.130.000
35DSM-IV Axis 5 (13)2611370.700.570.870.001-0.30-0.44-0.150.000
36Risk to Self (14:17)112361480.760.690.830.000-0.24-0.31-0.170.000
37Risk to Others (18:23)190322220.860.810.900.000-0.14-0.19-0.100.000
38Admit-Discharge-Follow-up (24)343370.920.841.010.083-0.08-0.170.010.071
39Investigations (25)2512370.680.540.840.001-0.32-0.48-0.170.000
40Psychological Input (26)2611370.700.570.870.001-0.30-0.44-0.150.000
41Community Support (27)298370.780.660.930.005-0.22-0.35-0.080.001
42Drug Type (28)6411750.850.780.940.001-0.15-0.23-0.070.000
43Drug Action (29)5817750.770.680.870.000-0.23-0.32-0.130.000
44Drug Indication (30)4926750.650.550.770.000-0.35-0.45-0.240.000

Major Attributes

45DSM Diagnosis (1:13)421784990.840.810.880.000-0.16-0.19-0.120.000
46Risks (14:23)302683700.820.780.860.000-0.18-0.22-0.140.000
47Non-Drug Intervention (24:27)114341480.770.710.840.000-0.23-0.30-0.160.000
48Drug Intervention (28:30)171542250.760.710.820.000-0.24-0.30-0.180.000

Overall

49Overall Result (1:30)100823412420.810.790.830.000-0.19-0.21-0.170.000

A: Accurate outcome, I: Inaccurate Outcome, AR: Accuracy Ratio (Risk Ratio), LCI: Lower 95% Confidence Interval, UCI: Upper 95% Confidence Interval, RD: Risk Difference when the sample statistics compared with Accuracy (Risk) of 1 for the face-to-face interview. All approximated p values are more than zero.

Numbers in brackets represent index number of primary attributes serialised in the previous rows.

Drug Types- Atypical Antipsychotic, Mood Stabilizer, Tricyclic Antidepressant, Newer Antidepressant, Other Antidepressant, Benzodiazepine Group, Non-Benzodiazepine Anxolytic, Anti-Parkinsonian, and 'No Medication'.

Drug related actions were grouped into 'New Prescription', 'Continuation of previously prescribed medication with no change', and 'Adjustment of previously prescribed medication'.

Drug Indications were categorised for Positive Psychotic Symptoms, Manic symptoms, Bipolar Disorder, Depression, Anxiety and Associated Symptoms, Drugs for Side Effects (e.g. Anti-parkinsonian), Hypnotics, and Use of Drugs' sedative effect for anxiety control and sleep.

Table 2

Cohen's Kappa results of intermethod and interviewers assessments

Standard vs VideoInterviewers
SNPrimary AttributesSizeGroupCKLCIUCICKLCIUCI
1DSM-IV Axis 154260.900.281.000.900.331.00
2DSM-IV Axis 23720.62-0.040.870.630.210.87
3DSM-IV Axis 33820.840.490.930.840.490.93
4DSM-IV Axis 4 Q13720.570.250.820.570.220.83
5DSM-IV Axis 4 Q23720.17-0.110.720.18-0.090.68
6DSM-IV Axis 4 Q33720.850.660.910.850.660.91
7DSM-IV Axis 4 Q43720.870.650.940.870.650.94
8DSM-IV Axis 4 Q53720.66NANA0.66NANA
9DSM-IV Axis 4 Q63720.550.180.840.540.130.84
10DSM-IV Axis 4 Q83720.36-0.070.790.37-0.080.54
11DSM-IV Axis 53750.890.810.940.890.810.94
12Risk to self Q13730.660.390.860.650.390.86
13Risk to self Q23730.680.340.860.680.370.86
14Risk to self Q33730.55-0.110.730.550.220.80
15Risk to self Q43730.45-0.110.680.450.120.79
16Risk to others Q13730.620.270.860.620.270.86
17Risk to others Q23730.820.301.000.820.371.00
18Risk to others Q3372-0.04-0.10-0.03-0.04-0.10-0.03
19Risk to others Q43730.55-0.110.780.55-0.060.78
20Risk to others Q53720.44-0.070.840.44-0.060.84
21Risk to others Q63730.41-0.130.660.41-0.120.66
22Admit-Discharge-Follow up3730.760.370.930.760.370.93
23Investigations3720.29-0.030.590.30-0.020.59
24Psychological Input3720.16-0.170.530.18-0.120.54
25Community Support3720.550.220.780.560.250.78
26Drug Type7590.830.610.900.820.520.90
27Drug Action7550.670.520.790.660.520.79
28Drug Indication75110.590.350.740.590.350.76

Main Attributes

29DSM-IV Axis 1 (1)54260.900.281.000.900.331.00
30DSM-IV Axis 2 (2)3720.62-0.040.870.630.210.87
31DSM-IV Axis 3 (3)3820.840.490.930.840.490.93
32DSM-IV Axis 4 (4:10)259140.650.520.780.650.520.78
33DSM-IV Axis 5 (11)3750.890.810.940.890.810.94
34Risk to self (12:15)148120.610.480.750.610.480.75
35Risk to Others (16:21)222160.05-0.010.110.05-0.010.11
36Admit-Discharge-Follow up (22)3730.760.370.930.760.370.93
37Investigations (23)3720.29-0.030.590.30-0.020.59
38Psychological Input (24)3720.16-0.170.530.18-0.120.54
39Community Support (25)3720.550.220.780.560.250.78
40Drug Type (26)7590.830.610.900.820.520.90
41Drug Action (27)7550.670.520.790.660.520.79
42Drug Indication (28)75110.590.350.740.590.350.76

Major Attributes

43DSM-IV (1:11)425490.860.810.900.860.810.90
44Risks (12:21)370280.140.080.190.140.090.19
45Non-Drug Intervention (22:25)14890.490.350.640.490.350.64
46Drug Intervention (26:28)225250.720.660.790.720.660.79

Overall

47Overall Result (1:28)11681110.600.570.630.600.570.63

The column 'Size' constitutes the number of total sample points used for assessments.

The column 'Group' stands for the number of total categories of results e.g. all diagnoses with Paranoid Schizophrenia will make one group.

CK: Cohen's Kappa values derived from 'n by n' table made from the Groups. The significant values (Lower CI more than 0) are printed in bold. The summary CK results in the rows from 29 to 47 are based on "Inverse Variance method" using CKs and variances from previous rows displayed in the brackets for the respective assessments.

95% Lower & Upper Confidence Intervals (LCI and UCI) from the rows 1 to 28 are derived from non-parametric Bias Corrected Accelerated (BCa) variance from 50,000 bootstrap samples with replacement. The summary LCIs and UCIs in the rows from 29 to 47 are based on "Inverse Variance method" using CKs and original variances from previous rows are displayed in the brackets for the respective assessments.

Weighted Kappa using "square error weights" were computed for ordinal observations where applicable.

The definitions for Drug Type, Drug Action and Drug Indication are same as in the Table 1.

Comparison of results of telepsychiatric assessments and face-to-face interviews A: Accurate outcome, I: Inaccurate Outcome, AR: Accuracy Ratio (Risk Ratio), LCI: Lower 95% Confidence Interval, UCI: Upper 95% Confidence Interval, RD: Risk Difference when the sample statistics compared with Accuracy (Risk) of 1 for the face-to-face interview. All approximated p values are more than zero. Numbers in brackets represent index number of primary attributes serialised in the previous rows. Drug Types- Atypical Antipsychotic, Mood Stabilizer, Tricyclic Antidepressant, Newer Antidepressant, Other Antidepressant, Benzodiazepine Group, Non-Benzodiazepine Anxolytic, Anti-Parkinsonian, and 'No Medication'. Drug related actions were grouped into 'New Prescription', 'Continuation of previously prescribed medication with no change', and 'Adjustment of previously prescribed medication'. Drug Indications were categorised for Positive Psychotic Symptoms, Manic symptoms, Bipolar Disorder, Depression, Anxiety and Associated Symptoms, Drugs for Side Effects (e.g. Anti-parkinsonian), Hypnotics, and Use of Drugs' sedative effect for anxiety control and sleep. Cohen's Kappa results of intermethod and interviewers assessments The column 'Size' constitutes the number of total sample points used for assessments. The column 'Group' stands for the number of total categories of results e.g. all diagnoses with Paranoid Schizophrenia will make one group. CK: Cohen's Kappa values derived from 'n by n' table made from the Groups. The significant values (Lower CI more than 0) are printed in bold. The summary CK results in the rows from 29 to 47 are based on "Inverse Variance method" using CKs and variances from previous rows displayed in the brackets for the respective assessments. 95% Lower & Upper Confidence Intervals (LCI and UCI) from the rows 1 to 28 are derived from non-parametric Bias Corrected Accelerated (BCa) variance from 50,000 bootstrap samples with replacement. The summary LCIs and UCIs in the rows from 29 to 47 are based on "Inverse Variance method" using CKs and original variances from previous rows are displayed in the brackets for the respective assessments. Weighted Kappa using "square error weights" were computed for ordinal observations where applicable. The definitions for Drug Type, Drug Action and Drug Indication are same as in the Table 1. The ARs (Table 1) with nil variance (rows 6, 8, 10) were excluded from comparison due to the constant nature of observation data. The results with upper 95% confidence interval of AR>1 (rows 2, 3, 7, 11, 19, 24) were not treated as statistically significant due to the fact that the accuracy ratio cannot exceed a maximum value of 1. Using these criteria, all the remaining observations of the 'primary attributes' are both valid and statistically significant between 0.65 and 0.91 excepting one of the items (row 12) in Axis 4 of the DSM. The pooled ARs for the main attributes (rows 31, 34 to 37, and 39 to 44) and major attributes (rows 45 to 48) are from 0.65 to 0.91 and from 0.76 to 0.84 respectively. The overall AR (row 49) for the combined assessments is 0.81. The criteria described in the previous paragraph were also applied for RDs (Table 1). Accordingly, all the valid observations excluding DSM-IV Axis 4 Q9, range between -0.35 and -0.09 and are statistically significant. There is an overall accuracy difference of -0.19, with 95% confidence interval between -0.21 and -0.17. This result supports a hypothesis that overall outcome of telepsychiatric assessment is about 19% inferior to face-to-face interview. Table 2 has observation data about agreements between the two methods of interview and between the interviewing psychiatrists. There is trend to classify CK values ≤ 0 as poor, those from 0.01 to 0.20 as slight, from 0.21 to 0.40 as fair, from 0.41 to 0.60 as moderate, from 0.61 to 0.80 as substantial and from 0.81 to 1 as perfect [30]. From a total of 27 valid primary attributes (rows 1 to 28), 16 have moderate to substantial statistically significant intermethod BCa Kappa values. Similarly, 10 of 14 main attributes (rows 29 to 42) have agreements at moderate to substantial level. The Kappa scores for intermethod agreement of DSM Axes (rows 29 to 33) varied from substantial (0.65) to perfect (0.90) excepting axis 2, where the result was not statistically significant. The agreement was perfect (0.86) for combined DSM categories (row 43). The overall Kappa score for risk (row 44) was only slight, though it was substantial in respect of assessment of 'risk to self' (row 34). Agreement levels for investigations and psychological input (rows 23 and 24) were non-significant while that for Community Support (row 25) was moderate (0.55). There was moderate agreement (0.49) on non-drug intervention as a whole (row 45). Various components of Drug Treatment (rows 26 to 28) had agreement levels between high moderate (0.59) to perfect (0.83) with substantial rating (0.72) as a whole (row 46). Overall agreement between the telepsychiatric assessments and face-to-face interviews for the completed psychiatric assessments reached a downward approximated value of 0.60, hence it was substantial. The interrater agreements between the two interviewing psychiatrists were very close and could not be differentiated from that of intermethod assessment. Their respective values do not differ more than 0.01, with a considerable degree of overlap between their confidence intervals. Accordingly, there is no significant difference between intermethod and interrater agreements and are equivalent to each other.

Discussion

This study aims to establish whether conclusions drawn from telepsychiatric assessments are in agreement with those from the standard method of face-to-face assessment for new referrals in an outpatient clinic. Most new and old referrals to the Community Mental Health Teams in the UK and NZ are discussed in a multidisciplinary team setting. Application of DSM diagnostic criteria in NZ has gradually evolved into standard clinical practice and is becoming popular in the UK. The application of Decision Tree for diagnosis on Axis-1 of DSM-IV can be easily adopted without significant additional resources and training. The methodology adopted in this study emulates the real clinic situation; hence its findings are both applicable and relevant to day-to-day clinical practice. The authors have taken all necessary precautions in dealing with anticipated results, some of which might be paradoxical or erroneous e.g. CK and AR exceeding a maximum permitted value of 1. Those results where AR>1 and with nil variance were excluded from further statistical interpretation. Though the number of studied cases was only 37, the numbers of sample points (as in 'Size' column of Table 1) for statistical calculation were large enough for meaningful interpretation. The issue of sample size in handling large number of diagnostic categories for CK was dealt with bootstrapping technique and non-parametric Bias Corrected Accelerated Bootstrap Confidence Interval. This approach enhances the statistical quality of the data analysis. A previous study [5] evaluating twelve telepsychiatric and face-to-face assessments on multiple scales found a mean weighted kappa coefficient of 0.85. The interrater reliability for diagnosis between two psychiatrists in three different experimental conditions on 63 patients has been found to vary between 0.69 and 0.85 [6]. A review of telepsychiatry services in Australia concluded that this technology could be reliably used for treatment recommendations and diagnostic assessments [7]. A Canadian study involving child psychiatrists found that in 96% of cases, the diagnosis and treatment recommendations made via video-conferencing were identical to those made in face-to-face interviews [10]. Another study [8] using telecommunication and audiovisual technology found interrater diagnostic agreement of 0.70. Despite methodological differences, the results from the present study are consistent with the findings quoted above in this paragraph. Interrater agreement among clinicians for video-taped face-to-face interviews has been noted to have a relatively lower CK value of 0.55 [9]. It is possible that the flexibility conferred by the ability to question the patient in real time in a face-to-face or video-conferencing interview is an advantage over videotape assessment and accounting for improved intermethod and interrater agreement. In a field trial of DSM-III, the interrater reliability for face-to-face interviews for the major disorders varied between kappa values of 0.28 to 0.92 [31] and an another study on ICD-10 also yielded a fair to good kappa values for the four-character diagnostic codes [32]. In comparison, the outcome of telepsychiatric assessments in the current study is at least similar to the interrater reliability of face-to-face interviews from these large field trials. None of the primary studies quoted above tested intermethod agreement for a complete set of clinical parameters. Their sample size and statistical methodology are also limiting factors for satisfactory conclusions. In contrast, the present study employs suitable statistical methods for comprehensive outpatient assessment for multi-axial DSM-IV diagnosis, risk assessment, investigations, and treatment. As compared to standard method of face-to-face interviews, telepsychiatric assessments in this study have a high accuracy ratio (AR 0.81) and a substantial intermethod agreement (CK 0.60). Although the kappa value of intermethod agreement for risk assessment is low, it would be premature to ascribe this to telepsychiatric assessment itself. A study on videotaped interviews of 30 patients attending emergency psychiatry service revealed interrater correlation coefficients of 0.32 and 0.44 for risks to self and others respectively [33]. Another study conducted in a comparable setting also found similar results and came with the observation that in some circumstances the level of disagreement was high enough to warrant concern [34]. In a prospective study for risk assessment on 161 inmates of a high risk forensic unit [35] the agreement level among psychiatrists for face-to-face interviews in absence of operational criteria was very poor (CK -0.006). The same study reported that this agreement can be greatly enhanced (CK 0.742) by application of operational criteria. The findings in the present study for risk assessment are comparable to these results. In addition, the lower base level of risk in routine outpatient clinics in comparison to emergency and forensic psychiatric units is likely to cause further decrement in the kappa level. There is a paucity of research-based knowledge concerning levels of agreement for risk factors. Most of the scales currently used for risk-assessments have yet to have their reliability, validity and predictability ascertained. To establish agreement levels in statistical terms for uncommon risk elements (suicides and homicides etc.) would require an enormous sample which may not be feasible. Lack of a valid and reliable tool for risk assessment may produce erroneous results while uncertainty over the time frame (short-term, immediate and long-term) for risk anticipation may lead to inconsistencies in reporting and recording. The sort-term serious risk will probably be dealt with in the emergency system rather than through routine outpatient referrals and the long-term risks are likely to have minimal influence on the decision making process while dealing with routine outpatient referrals. The ability to reach an accurate DSM-IV-Axis-1 diagnosis through telepsychiatric assessment is perfect (CK 0.90) and accurate (AR 0.91). Arriving at a reasonable diagnostic impression is a pre-requisite of the medical recommendation for assessment or treatment under the Mental Health Act and this objective can be very well achieved through telepsychiatry. Another prerequisite under the act is to evaluate potential risks with input and information from various sources. Identification of risk related concerns direct from the interviewee constitutes only one component of this whole process. The referring agency usually indicates and expresses its concerns about risk elements and additional information are generally obtained on telephone from other sources such as clinicians and family members. With this in mind, low concordance on risk assessment may not necessarily be a limiting factor in the use of telepsychiatry for the purpose of Mental Health Act assessment.

Conclusion

Telepsychiatry is a dependable mode of service delivery for diagnostic assessment and psychiatric intervention in routine new referrals. Its accuracy varies between 79% and 83% in comparison with face-to-face interview. There is also an overall substantial agreement between these two methods of psychiatric evaluation. Although there is potential for usage of telepsychiatry for the Mental Health Act assessment, this requires further research using more refined operational tools to enhance the low accuracy and agreement scores found in the present study. The accuracy of conclusions arrived at from telepsychiatric assessment is likely to improve in future with further advances in technology [36].

Clinical implications

1. Allows telepsychiatric services to be made available to a geographically distant and inaccessible population where it is difficult and expensive to recruit mental health professionals. 2. Enhances confidence in use of telepsychiatry as an alternative mode of service delivery. 3. Increases scope of international research and collaboration in the practice of clinical psychiatry in different parts of the world.

Limitations and solutions

1. Although the outcome of risk assessment was similar to other studies, the level of agreement for this parameter is significantly low. There is scope to overcome this deficit through usage of operational criteria [35]. On this subject, some of the scientifically unexplored topics such as tools for risk assessment, its reliability and predictive value require further research. 2. The study assumed that there is 100% concordance between clinical decisions amongst psychiatrists if they conduct face-to-face interviews. This is seldom the case. Further studies with an added component to detect overall interrater agreement for face-to-face assessment will help in eliminating the need for this hypothetical 100% concordance rate. 3. There is an inherent problem in determining the sample size for CK for an unknown number of categories that may be encountered during a prospective research. This requires application of alternative statistical approaches. The current study has attempted to address some of these concerns through usage of resampling method and bootstrap confidence intervals.

List of Abbreviations used

APA: American Psychiatric Association AR: Accuracy Ratio BCaCI: Bias Corrected Accelerated Bootstrap Confidence Interval CK: Cohen's Kappa DSMIV: Diagnostic and Statistical Manual of Mental Disorders – 4th edition GAF: Global Assessment of Functioning HBHB: Hawkes Bay Health Board, New Zealand ICD: International Classification of Diseases ISDN: Integrated Services Digital Network KB: Kilo bits per second LIPI: List of Psychiatric Intervention LCI: Lower 95% Confidence Interval NZ: New Zealand NCMHT: Napier Community Mental Health Team R: Open source statistical package similar to S-Plus [29] R1: Researcher 1 – Surendra Singh R2: Researcher 2 – Dinesh Arya RAS: Risk Assessment Schedule RD: Risk Ratio RR: Relative Risk S: Standard method of face to face psychiatric assessment S-Plus: The statistics package built upon the S programming language SAS: The Statistical Analysis System from SAS Institute SPSS: The Statistical Package for the Social Sciences SPS: Principal author S P Singh UCI: Upper 95% Confidence Interval UK: United Kingdom V: Video-conferencing

Competing interests

The author(s) declare that they have no competing interests.

Authors' contributions

Surendra P Singh Planning, design, resource arrangement, coordination, template design, clinical assessment, data collection, data entry, statistical consideration, data analysis, review, writing and submission. Dinesh Arya Consultation and suggestion; especially about risk assessment and elements of statistical consideration, clinical assessment and data collection. Trish Peters Randomisation, liaison, consenting and coordination with other researchers and participants. All authors have read and approved the submitted manuscript.

Appendix-I

Table 3
Table 3
RAS (Risk Assessment Schedule)Enter the RISK in terms of NIL, LOW, MEDIUM, HIGH & VERY HIGH Risk.
MENTAL STATE (RAS – 1)NLMHV

Behaviour
• Dangerous or threatening actionsNLMHV
• Verbal/non-verbal risksNLMHV
• Deliberate self harmNLMHV
• AggressionNLMHV

Affect
• Arousal, anger, hostility, irritability, suspiciousness, fearNLMHV
• Low mood or elevated moodNLMHV

Cognition
• Fantasies of deliberate self harm or harm to othersNLMHV
• Persecutory thoughts, delusionsNLMHV
• External controlNLMHV
• ConfusionNLMHV
• Preoccupation, obsession, jealousyNLMHV
• Control over-rideNLMHV
• Cultural beliefsNLMHV

Perceptions
• Command hallucinationsNLMHV
• MisidentificationNLMHV
• MatakiteNLMHV

ENVIRONMENTAL/CURRENT FACTORS (RAS – 2)NLMHV

Immediate stressors
• Substance use, intoxication or withdrawalNLMHV
• RelationshipsNLMHV
• Presence or absence of supportNLMHV
• Absence of treatment, non complianceNLMHV
• Persecution or threats from othersNLMHV
• Arrest or criminal chargesNLMHV
• Loss including death of a peerNLMHV
• Cultural transgressionNLMHV
• Financial stressNLMHV

Access
• To weapons, pills, victimsNLMHV

Situation
• Referral from Prison, Police, Secure UnitNLMHV

Individual's attitude
• Co-operationNLMHV
• Refusal to co-operate or fear of compulsory treatmentNLMHV

HISTORICAL INFORMATION (RAS-3)NLMHV

Illness and incidents
• Patterns of illness – Chronic active, Neurological disorder, H/O Head injuryNLMHV
• Psychiatric history – Serious mental illness, Multiple diagnoses, Treatment under MHANLMHV
• History of incidents (and context) – Repeated antisocial behaviorNLMHV
• Treatment and outcomes – Compliance and response of treatment given in pastNLMHV
• Features of past crises – Pathological intoxication, Episodic dyscontrol, Cruelty to animals, BlackoutsNLMHV
• Personal history – Criminal charges, Previous offenses, Forensic involvementNLMHV

Personality
• Usual coping style – Loner, Displaced rage reaction, self mutilism, Impulsivity, denial, Blaming othersNLMHV

Family background
• Demographics – Single, Male, Poor, Low educational and vocational successNLMHV
• Culture – Tolerance to antisocial behavior, Reluctance to disclose, Shame & GuiltNLMHV
• Dynamics – H/O Violence, Contact of violent gangs, Intrafamilial violenceNLMHV

OUTCOME OF RISK ASSESSMENT(Use knowledge & Information gathered from RAS-1, 2 & 3)

RISK TO SELFNLMHV

1. Safety (including suicidal acts, deliberate self harm)NLMHV
2. Health (incl. drug & alcohol abuse, physical & psychological harm)NLMHV
3. Self neglect and vulnerability (incl. exploitation, sexual abuse, violence from others)NLMHV
4. Quality of life (including dignity, social and financial status)NLMHV

RISK TO OTHERSNLMHV

1. Violence (including emotional, sexual and physical violence)
2. Intimidation/threatsNLMHV
3. Neglect/abuse of DependantsNLMHV
4. Stalking/harassmentNLMHV
5. Reckless behavior (including driving)Property damage (including arson)NLMHV
6. Public nuisanceNLMHV

Appendix-II

Table 4
Table 4

LIST OF PSYCHIATRIC INTERVENTION (LIPI)

Admission/Discharge/Followup (LIPI-1):
Admission
Discharge
Followup
Investigations (LIPI-2):
Haematological
Biochemical
Serum Level of Medication
ECG tests
EEG Examination
CT/NMRI/PET/Isotope/Ultrasound
IQ Assessment
Other biological test
Other Psychological Tests
Psychological Intervention (LIPI-3):
Investigation
Simple Explanations
Support & Advice
Self Help by Book
Self Help Group
Assertiveness Training
Anger Control
Domestic Skills Training
Budget Handling Training
Social Skill Training
Counselling
Marital Therapy
Divers ional Therapy
Relaxation Training
Paper bag Ventilation
Systemic Desensitisation
Biofeedback
Cognitive Behaviour Therapy
Psychotherapy
Family Therapy
Other
Community Support (LIPI-4):
Intervention
CPN visit to support patient
CPN visit to support family
Activity Therapy (Gym, sports etc)
Handling Bills and Finances
Helping in making Applications to other agencies
Outreach
Home Help
Meals on Wheels
Art & Work therapy (Art, wood, gardening)
Day Hospital for socialisation

LIPI-5: Information related to Drug Type, Action and Indication scored in different table

LIST OF PSYCHIATRIC INTERVENTION (LIPI) LIPI-5: Information related to Drug Type, Action and Indication scored in different table

Pre-publication history

The pre-publication history for this paper can be accessed here:
  24 in total

Review 1.  Telepsychiatry in South Australia.

Authors:  F Hawker; S Kavanagh; P Yellowlees; R S Kalucy
Journal:  J Telemed Telecare       Date:  1998       Impact factor: 6.184

2.  Diagnostic reliability of telepsychiatry in American Indian veterans.

Authors:  Jay H Shore; Daniel Savin; Heather Orton; Jan Beals; Spero M Manson
Journal:  Am J Psychiatry       Date:  2007-01       Impact factor: 18.112

3.  A randomized, controlled trial of child psychiatric assessments conducted using videoconferencing.

Authors:  R Elford; H White; R Bowering; A Ghandi; B Maddiggan; K St John; M House; J Harnett; R West; A Battcock
Journal:  J Telemed Telecare       Date:  2000       Impact factor: 6.184

4.  Interrater agreement among psychiatrist in psychiatric emergency assessments.

Authors:  B B Way; M H Allen; J L Mumpower; T R Stewart; S M Banks
Journal:  Am J Psychiatry       Date:  1998-10       Impact factor: 18.112

5.  Telemedicine in Psychiatry: making the dream reality.

Authors:  D Bear; G Jacobson; S Aaronson; A Hanson
Journal:  Am J Psychiatry       Date:  1997-06       Impact factor: 18.112

6.  Evaluation of a telepsychiatry pilot project.

Authors:  S Doze; J Simpson; D Hailey; P Jacobs
Journal:  J Telemed Telecare       Date:  1999       Impact factor: 6.184

7.  Telemedicine technology and clinical applications.

Authors:  D A Perednia; A Allen
Journal:  JAMA       Date:  1995-02-08       Impact factor: 56.272

8.  Applicability of telemedicine for assessing patients with schizophrenia: acceptance and reliability.

Authors:  C A Zarate; L Weinstock; P Cukor; C Morabito; L Leahy; C Burns; L Baer
Journal:  J Clin Psychiatry       Date:  1997-01       Impact factor: 4.384

9.  Progress toward achieving a common language in psychiatry, II: Results from the international field trials of the ICD-10 diagnostic criteria for research for mental and behavioral disorders.

Authors:  N Sartorius; T B Ustün; A Korten; J E Cooper; J van Drimmelen
Journal:  Am J Psychiatry       Date:  1995-10       Impact factor: 18.112

10.  Client satisfaction in a feasibility study comparing face-to-face interviews with telepsychiatry.

Authors:  J E Bishop; R L O'Reilly; K Maddox; L J Hutchinson
Journal:  J Telemed Telecare       Date:  2002       Impact factor: 6.184

View more
  11 in total

1.  Review of key telepsychiatry outcomes.

Authors:  Sam Hubley; Sarah B Lynch; Christopher Schneck; Marshall Thomas; Jay Shore
Journal:  World J Psychiatry       Date:  2016-06-22

Review 2.  Furthering the reliable and valid measurement of mental health screening, diagnoses, treatment and outcomes through health information technology.

Authors:  Jessica E Haberer; Tom Trabin; Michael Klinkman
Journal:  Gen Hosp Psychiatry       Date:  2013-04-28       Impact factor: 3.238

Review 3.  Usefulness of telepsychiatry: A critical evaluation of videoconferencing-based approaches.

Authors:  Subho Chakrabarti
Journal:  World J Psychiatry       Date:  2015-09-22

4.  Current Directions in Videoconferencing Tele-Mental Health Research.

Authors:  Lisa K Richardson; B Christopher Frueh; Anouk L Grubaugh; Leonard Egede; Jon D Elhai
Journal:  Clin Psychol (New York)       Date:  2009-09-01

5.  Telepsychiatry and the meaning of in-person contact: a preliminary ethical appraisal.

Authors:  Aimee van Wynsberghe; Chris Gastmans
Journal:  Med Health Care Philos       Date:  2009-07-21

Review 6.  Telepsychiatry in the 21(st) century: transforming healthcare with technology.

Authors:  Stacie Deslich; Bruce Stec; Shane Tomblin; Alberto Coustasse
Journal:  Perspect Health Inf Manag       Date:  2013-07-01

7.  Collaborative Care at a Distance: Student Therapists' Experiences of Learning and Delivering Relationally Focused Telemental Health.

Authors:  Paul Springer; Richard J Bischoff; Kara Kohel; Nathan C Taylor; Adam Farero
Journal:  J Marital Fam Ther       Date:  2020-04-11

8.  Telepsychiatry and Outpatient Department Services.

Authors:  Laxmi Naresh Vadlamani; Virinchi Sharma; Amala Emani; Mahesh R Gowda
Journal:  Indian J Psychol Med       Date:  2020-11-01

9.  Can the Rorschach be Administered Remotely? A Review of Options and a Pilot Study Using a Newly Developed R-PAS App.

Authors:  Francesca Ales; Gregory J Meyer; Joni L Mihura; Andrea Corgiat Loia; Sara Pasqualini; Alessandro Zennaro; Luciano Giromini
Journal:  Psychol Inj Law       Date:  2022-03-16

10.  Telepsychiatry: Promise, potential, and challenges.

Authors:  Savita Malhotra; Subho Chakrabarti; Ruchita Shah
Journal:  Indian J Psychiatry       Date:  2013-01       Impact factor: 1.759

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.