| Literature DB >> 28298265 |
Chris Gibbons1,2, Suzanne Richards3, Jose Maria Valderas4, John Campbell4.
Abstract
BACKGROUND: Machine learning techniques may be an effective and efficient way to classify open-text reports on doctor's activity for the purposes of quality assurance, safety, and continuing professional development.Entities:
Keywords: data mining; feedback; machine learning; surveys and questionnaires; work performance
Mesh:
Year: 2017 PMID: 28298265 PMCID: PMC5371715 DOI: 10.2196/jmir.6533
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Number of comments, distribution of words, and statistical comparison for each of the 5 categories.
| Categories | Reports in category | Length of report, mean (SD) | ANOVAa |
| Innovator | 59 | 41.99 (30.84) | <.001 |
| Interpersonal | 432 | 23.87 (16.39) | .99 |
| Popular | 131 | 25.49 (16.74) | .97 |
| Professional | 701 | 24.46 (17.34) | .91 |
| Respected | 346 | 20.69 (19.13) | .03 |
| More than 1 category | 1189 | 21.63 (16.76) | .56 |
| No categories | 425 | 19.54 (13.62) | <.001 |
aANOVA: analysis of variance; conducted with post hoc Tukey tests.
Example quotes from each category
| Theme | Comment |
| Innovator | “It is clear from the advice he gives that he is aware of [the] current good practice, is highly motivated, very practical and very much a team player. His advice, when working with consultant colleagues was respected, and he recognized where practice/primary care limitations were and yet looked for opportunities for change and improvement.” |
| “She has an admirable level of commitment and enthusiasm for her patients and her work. She has been instrumental in promoting change and improvement in her department. She is a great asset to the department and the hospital.” | |
| Interpersonal | “She is a very good, committed colleague always keen to improve, very liked by her patients and highly valued by all who work with her.” |
| “Very approachable and professional.” | |
| Popular | “Excellent well liked and easy working colleague.” |
| “Very popular doctor. Works to high standards.” | |
| Professional | “I find this doctor to be very efficient, caring, honest and very professional.” |
| “I find that he very easy and helpful to work with, he always has time for patients and staff.” | |
| Respected | “A first class colleague.” |
| “Pleasant and valued colleague.” | |
| Not coded by qualitative rater (given label of 0) | “Supportive colleague, excellent time management skills.” |
| “I think I have a good working relationship with this doctor. I have been impressed with his openness to Psychological work with his patients and his support for my work. In my opinion he gives thorough consideration to his diagnosis.” |
Figure 1Flow diagram of the stages “training,” “validation,” and “application to new data.”.
An example term-document matrix for 3 texts.
| Texts | Terms | |||||||||||
| a | and | colleague | doctor | great | is | patients | respected | this | troublesome | well | with | |
| Text 1a | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Text 2b | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| Text 3c | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 |
aText 1: “A great colleague.”
bText 2: “A troublesome colleague.”
cText 3: “This doctor is well respected, and great with patients.”
Summary of algorithm and ensemble performance in the main analysis.
| Modela | Metric | Innovator | Interpersonal | Popular | Professional | Respected | Average |
| Support vector machine | .73 | .69 | .84 | .79 | .73 | .76 | |
| Scaled linear discriminant analysis | .77 | .65 | .88 | .73 | .77 | .76 | |
| Boosting | .75 | .77 | .81 | .76 | .75 | .77 | |
| Bootstrap boosting | .87 | .85 | .83 | .80 | .82 | .83 | |
| Random forests | .67 | .59 | .87 | .78 | .74 | .75 | |
| Decision tree | .80 | .75 | .88 | .78 | .80 | .80 | |
| Generalized linear model | .89 | .82 | .88 | .81 | .89 | .85 | |
| Maximum entropy | .70 | .62 | .73 | .65 | .70 | .68 | |
| Final ensemble (3+ models with | Recall | .98 | .80 | .97 | .82 | .87 | .89 |
| 10-Fold validation mean (range) | .97 (.96-.98) | .80 (.74-.86) | .97 (.96-.98) | .79 (.75-.83) | .86 (.84-.89) | .88 |
aTraining set size=1000; validation=636.
Figure 2Algorithm performance with differing training sample sizes. Performance decreases as expected with smaller training corpora.
Figure 3Comparison of General Medical Council Colleague Questionnaire (GMC-CQ) scores between doctors who were placed in 1 of the 5 categories versus those who were not (positive comments only). Significance (P) values for the t tests are shown to indicate the relationship between the 2 groups.
Comparison of means between doctors classified into a category and those who were unclassified.
| Categories | Panel Aa | Panel Bb | |||||||||
| Mean score | Reports in | Mean score | Reports in | ||||||||
| dfc | df | ||||||||||
| Innovator | 0.00 | 48 | 0.00 | 55.74 | .99 | 0.01 | 59 | 1.14 | 35.69 | .26 | |
| Interpersonal | 1.97 | 435 | 1.98 | 857.97 | .04 | 0.07 | 432 | 2.97 | 346.63 | <.01 | |
| Popular | −0.05 | 107 | −0.88 | 176.42 | .38 | 0.13 | 131 | 1.32 | 149.05 | .19 | |
| Professional | −0.03 | 643 | 2.51 | 901.34 | .01 | 0.1 | 701 | 3.47 | 286.99 | <.001 | |
| Respected | 0.15 | 243 | 3.75 | 629.17 | <.001 | 0.44 | 346 | 5.58 | 300.13 | <.001 | |
| More than 1 category | 0.04 | 1081 | 0.77 | 173.81 | .001 | 0.12 | 1189 | 3.81 | 239.8 | <.001 | |
| No categories | −0.09 | 413 | N/Ad | N/A | N/A | −0.4 | 425 | N/A | N/A | N/A | |
aPanel A: analysis using machine ensemble classifications on entire corpus.
bPanel B: analysis using human rater classifications on entire corpus.
cdf: degrees of freedom.
dN/A: not applicable.