| Literature DB >> 32729844 |
Zackary Dunivin1, Lindsay Zadunayski2, Ujjwal Baskota3, Katie Siek1, Jennifer Mankoff4.
Abstract
BACKGROUND: Online physician reviews are an important source of information for prospective patients. In addition, they represent an untapped resource for studying the effects of gender on the doctor-patient relationship. Understanding gender differences in online reviews is important because it may impact the value of those reviews to patients. Documenting gender differences in patient experience may also help to improve the doctor-patient relationship. This is the first large-scale study of physician reviews to extensively investigate gender bias in online reviews or offer recommendations for improvements to online review systems to correct for gender bias and aid patients in selecting a physician.Entities:
Keywords: gender; physician-patient relationship; reviews; soft-skills
Year: 2020 PMID: 32729844 PMCID: PMC7426798 DOI: 10.2196/14455
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Figure 1Left: Gender distribution in the complete data set (N=154,305). The unknown category represents clinics and names that were androgynous or unknown to the gender classifier. Middle: Distribution of physician ratings by physician gender (N=137,329). Right: Distribution of physician ratings by reviewer gender (N=129,985).
Themes that emerged in affinity diagramming and examples of the associated terms in the dictionaries used in the quantitative analysis.
| Theme | Sample terms | Kappaa | Count | Accuracyb | Precision | Recall | |||||||
|
| 0.61 | 113 | 0.77 | 0.74 | 0.90 | ||||||||
|
| Candor | Honest, explain, answer, direct | 0.95 | 41 | 0.84 | 0.57 | 0.76 | ||||||
|
| Trust | Support, safe, reassure, comfort | 0.98 | 27 | 0.42 | 0.41 | 0.63 | ||||||
|
| Investment | Respect, care, compassion, listen | 0.93 | 65 | 0.42 | 0.60 | 0.74 | ||||||
|
| Amicability | Warm, friendly, personable, funny | 0.94 | 31 | 0.82 | 0.46 | 0.84 | ||||||
|
| 0.53 | 25 | 0.87 | 0.44 | 0.16 | ||||||||
|
| Indifference | Cold, dismiss, ignored, abandoned | 0.74 | 16 | 0.87 | 0.50 | 0.16 | ||||||
|
| Disrespect | Rude, harass, condescending, arrogant | 0.40 | 25 | 0.92 | 0.50 | 0.44 | ||||||
| Process | Cost, nurse, staff, wait | 0.83 | 115 | 0.87 | 0.84 | 0.96 | |||||||
aKappa represents interrater agreement (on 20 reviews).
bAccuracy, precision, and recall, respectively, on a random sample (N=100) of 200 total reviews. A review is labeled as pertaining to a theme if at least one of the words in the theme in dictionary is presented in the review. Note the infrequency of negative soft skills (16 and 25 for indifference and disrespect, respectively), contributing to low precision and recall.
Prevalence of themes by physician gender.
| Theme | Doctors (female, n=36,847; male, n=74,189)a,b | ||||||||
|
| Female (negative; n=13,874), n (%)c | Female (positive; n=22,973), n (%)c | Male (negative; n=17,906), n (%)c | Male (positive; n=56,283), n (%)c | |||||
|
| 7403 (53.36) | 16,984 (73.93) | 7530 (42.05) | 37,993, (67.50) | |||||
|
| Candor | 2756 (19.86) | 3531 (15.37) | 2227 (12.44) | 8431 (14.98) | ||||
|
| Trust | 754 (5.43) | 2725 (11.86) | 745 (4.16) | 5566 (9.89) | ||||
|
| Investment | 4724 (34.05) | 12,266 (53.39) | 4830 (26.97) | 26,032 (46.25) | ||||
|
| Amicability | 1928 (13.90) | 7481 (32.56) | 1621 (9.05) | 16,047 (28.51) | ||||
|
| 4629 (33.36) | 488 (2.12) | 4505 (25.16) | 721 (1.28) | |||||
|
| Indifference | 868 (6.26) | 163 (0.71) | 836 (4.67) | 298 (0.53) | ||||
|
| Disrespect | 4112 (29.64) | 343 (1.49) | 3942 (22.01) | 442 (0.79) | ||||
| Process | 9330 (67.25) | 10,099 (43.96) | 9981 (55.74) | 23,837 (42.35) | |||||
aMany reviews contain multiple themes, so the overall rows (bold) have smaller numbers than the sum of themes would indicate. This table includes only those reviews for which a gender was assigned (n=111,036).
bThe physician rating is denoted as negative/positive.
cPercentages represent the proportion of reviews containing the theme for that particular gender/rating combination.
Prevalence of themes by reviewer gender.
| Theme | Reviewers (female, n=67,857; male, n=43,179)a,b | ||||||||
|
| Female (negative; n=18,780), n (%)c | Female (positive; n=49,077), n (%)c | Male (negative; n=13,000), n (%)c | Male (positive; n=30,179), n (%)c | |||||
|
| 9102 (48.47) | 35,334 (72.00) | 5831 (44.85) | 19,643 (65.09) | |||||
|
| Candor | 3104 (16.52) | 7815 (15.92) | 1879 (14.45) | 4147 (13.74) | ||||
|
| Trust | 906 (4.82) | 5636 (11.48) | 593 (4.56) | 2655 (8.80) | ||||
|
| Investment | 5834 (31.06) | 24,994 (50.93) | 3720 (28.62) | 13,304 (44.08) | ||||
|
| Amicability | 2231 (11.88) | 15,630 (31.85) | 1318 (10.14) | 7898 (26.17) | ||||
|
| 5868 (31.25) | 790 (1.61) | 3266 (25.12) | 419 (1.39) | |||||
|
| Indifference | 1101 (5.86) | 305 (0.62) | 603 (4.64) | 156 (0.52) | ||||
|
| Disrespect | 5178 (27.57) | 513 (1.05) | 2876 (22.12) | 272 (0.90) | ||||
| Process | 11,469 (61.07) | 21,728 (44.27) | 7842 (60.32) | 12,208 (40.45) | |||||
aMany reviews contain multiple themes, so the overall rows (bold) have smaller numbers than the sum of themes would indicate. This table includes only those reviews for which a gender was assigned (N=111,036).
bThe physician rating is denoted as negative/positive.
cPercentages represent the proportion of reviews containing the theme for that particular gender/rating combination.
Logistic regression on the presence of a theme in review (n=106,325).
| Modela | Intercept | DoctorFb | ReviewerFb | RatingNeg | DoctorF×RatingNeg | ReviewerF×RatingNeg | ReviewerF ×ReviewerF | |
|
| ||||||||
|
| Candor | −1.92c | 0.15c | 0.16c | −0.10c | —d | — | — |
|
| Trust | −2.40c | 0.19c | 0.27c | −1.01c | — | — | — |
|
| Investment | −0.40c | 0.27c | 0.21c | −0.88c | — | — | — |
|
| Amicability | −1.57c | 0.25c | 0.23c | −1.20c | — | — | — |
|
| Indifference | −5.26c | 0.29c | 0.18c | 2.19c | — | — | — |
|
| Disrespect | −5.30c | 0.35c | 0.24c | 3.39c | — | — | — |
|
| Candor | −1.88c | 0.01 | 0.16c | −0.31c | 0.47c | — | — |
|
| Trust | −2.39c | 0.18c | 0.27c | −1.03c | 0.04 | — | — |
|
| Investment | −0.40c | 0.26c | 0.21c | −0.89c | 0.01 | — | — |
|
| Amicability | −1.56c | 0.20c | 0.23c | −1.34c | 0.29c | — | — |
|
| Candor | −1.92c | 0.15c | 0.16c | −0.10c | — | −0.00 | — |
|
| Trust | −2.42c | 0.19c | 0.31c | −0.86c | — | −0.23c | — |
|
| Investment | −0.42c | 0.27c | 0.25c | −0.80c | — | −0.14c | — |
|
| Amicability | −1.58c | 0.25c | 0.25c | −1.12c | — | −0.13e | — |
|
| Disrespect | −5.34c | 0.46c | 0.31c | 3.39c | — | — | −0.16e |
aRows represent distinct logit models for each of the 7 themes. Each cell reports the log-likelihood that a variable is associated with the given theme. Sentences containing terms related to the process have been removed from the reviews.
bFemale=1, male=0.
cP<.001.
dMissing value indicates that no coefficient was estimated for the given endogenous variable.
eP<.01.
Logistic regression on rating negative (A: n=111,036; B: n=106,325). Models 1 and 2 differ in the inclusion of review content themes. The B variants show the effects of filtering sentences mentioning process from each review.
| Variable | Model 1A | Model 1B (no process) | Model 2A | Model 2B (no process) | |
| Intercept | −1.05 | −1.08 | −1.12 | −0.80 | <.001 |
| DoctorFa | 0.67 | 0.69 | 0.63 | 0.71 | <.001 |
| ReviewerFa | −0.16 | −0.14 | −0.14 | −0.11 | <.001 |
| DoctorF×ReviewerF | −0.03 | −0.05 | −0.02 | −0.03 | NSb |
| Candor | N/Ac | N/A | 0.06 | 0.05 | <.05 |
| Trust | N/A | N/A | −0.95 | −0.92 | <.001 |
| Investment | N/A | N/A | −0.89 | −0.90 | <.001 |
| Amicability | N/A | N/A | −1.49 | −1.18 | <.001 |
| Indifference | N/A | N/A | 2.35 | 2.28 | <.001 |
| Disrespect | N/A | N/A | 3.42 | 3.45 | <.001 |
| Process | N/A | N/A | 0.84 | N/A | <.001 |
aFemale=1, male=0.
bNS: not significant.
cNot applicable.
Logistic regression on the presence of process in review (N=111,036).
| Variable | Model 1 | Model 2 | |||
|
| DoctorFa×RatingNeg | ReviewerFa×RatingNeg | DoctorF×ReviewerF | Themes | |
| Intercept | −0.37b | −0.41b | −0.43b | −0.84b | |
| DoctorF | 0.06b | 0.20b | 0.19b | 0.12b | |
| ReviewerF | 0.11b | 0.11b | 0.14b | 0.04c | |
| RatingNeg | 0.54b | 0.70b | 0.78b | 0.83b | |
| DoctorF × RatingNeg | 0.42b | —d | — | — | |
| ReviewerF × RatingNeg | — | −0.02 | — | — | |
| DoctorF × ReviewerF | — | — | −0.13b | — | |
| Candor | — | — | — | 0.26b | |
| Trust | — | — | — | 0.34b | |
| Investment | — | — | — | 0.22b | |
| Amicability | — | — | — | 1.01b | |
| Indifference | — | — | — | 0.00 | |
| Disrespect | — | — | — | 0.70b | |
aFemale=1, male=0.
bP<.001.
cP<.01.
dMissing value indicate that no coefficient was estimated for the given endogenous variable.