| Literature DB >> 34946405 |
Afiq Izzudin A Rahim1, Mohd Ismail Ibrahim1, Sook-Ling Chua2, Kamarul Imran Musa1.
Abstract
While experts have recognised the significance and necessity of social media integration in healthcare, no systematic method has been devised in Malaysia or Southeast Asia to include social media input into the hospital quality improvement process. The goal of this work is to explain how to develop a machine learning system for classifying Facebook reviews of public hospitals in Malaysia by using service quality (SERVQUAL) dimensions and sentiment analysis. We developed a Machine Learning Quality Classifier (MLQC) based on the SERVQUAL model and a Machine Learning Sentiment Analyzer (MLSA) by manually annotated multiple batches of randomly chosen reviews. Logistic regression (LR), naive Bayes (NB), support vector machine (SVM), and other methods were used to train the classifiers. The performance of each classifier was tested using 5-fold cross validation. For topic classification, the average F1-score was between 0.687 and 0.757 for all models. In a 5-fold cross validation of each SERVQUAL dimension and in sentiment analysis, SVM consistently outperformed other methods. The study demonstrates how to use supervised learning to automatically identify SERVQUAL domains and sentiments from patient experiences on a hospital's Facebook page. Malaysian healthcare providers can gather and assess data on patient care via the use of these content analysis technology to improve hospital quality of care.Entities:
Keywords: Facebook; Malaysia; SERVQUAL; health informatics; machine learning; sentiment analysis; topic classification
Year: 2021 PMID: 34946405 PMCID: PMC8701188 DOI: 10.3390/healthcare9121679
Source DB: PubMed Journal: Healthcare (Basel) ISSN: 2227-9032
SERVQUAL Guideline.
| Domain | Description | Facebook Reviews Example |
|---|---|---|
| Tangible | General: The appearance of employees, equipment, and physical facilities of the hospital. | “Cleanliness of the Hospital is good” |
| Specific: | “Car parking is difficult and limited” | |
| “Satisfied with the facilities. Large room, feels like a hotel.” | ||
| “The hospital is well maintained, and their food is delicious.” | ||
| Reliability | General: Accurate, dependable, and consistent performance of the service. | “My appointment scheduled at 9 am but then it was postponed to 12.00 pm. Unbelievable.” |
| Specific: | “System needs to be improved especially discharge process. It took hours to settle it.” | |
| “Efficient and top-quality hospital services” | ||
| “Staff mistakenly collected medical record of other patient with similar name of mine” | ||
| Responsiveness | General: Willingness to provide prompt service to the patients. | “My specialist took his time to explain me about my disease and how he will treat it” |
| Specific: | “They answered all my questions during the admission.” | |
| “Arrived at emergency department due to road traffic accident and the medical team immediately respond to it.” | ||
| “I don’t feel any pain throughout the minor surgery on my arm, and it was done in a flash” | ||
| Assurance | General: the staff knowledge and courtesy, ability to inspire trust, confidence, and security. Also reflects on confidentiality and privacy of patients. | “The surgery was successful. Mr A is a competent and trusted surgeon.” |
| Specific: | “I feel comfortable and safe in this hospital. Just like at home” | |
| “The staff at the front desk was rude.” | ||
| “The doctors and staff nurses in this hospital are skillful and well-trained” | ||
| Empathy | General: Providing convenient services and giving attention or patience of the staffs to the patients’ needs. | “Nurses are very helpful.” |
| Specific: | “A staff came and offered to help my father climb stairs without we ask him. We appreciated his kindness.” | |
| “They are very concerned about patient’s condition and served it with their heart” | ||
| “The price is affordable compared to private hospital.” |
Figure 1Machine Learning Development Process.
Sentiment Analysis Guideline.
| Category | Description | Facebook Reviews Example |
|---|---|---|
| Positive | Expression of liking, approval, gratefulness | “I like this hospital. Doctors and nurses are pleasant and helpful.” |
| (Like, love, support, thankful etc.) | “Thank you for your service, Doctor and nurses.” | |
| Positive qualities of hospital services and facilities | “The wait time was brief. The pharmacy counter did an excellent job.” | |
| (Clean room, efficient, fast appointment, affordable etc.) | “The room is neat and tidy, and the food is delicious. I really like it.” | |
| Positive qualities of staff | “Staff are polite and kind.” | |
| (Polite, friendly, helpful, responsive etc.) | “Dr. B took her time explaining my health condition until I understood it. It was greatly appreciated.” | |
| Encourage or recommend others to use | “I recommend having your baby delivered at this hospital.” | |
| “I like their antenatal counselling and will recommend it to other couples. It is extremely beneficial to us.” | ||
| Positive/desirable effects of service | “I’d like to thank Mr A for performing bowel surgery on my father. He is now doing well.” | |
| (Successful treatment/procedures, good health outcome etc.) | “I found the physiotherapy session to be beneficial. I’m able to walk with less pain now.” | |
| Negative | Expression of disliking or disapproval | “I hate the security guard.” He was impolite to me!” |
| (Do not like, hate etc.) | “I’m not a fan of the food service here. The food has no taste.” | |
| Negative characteristic of hospital services or facilities | “The discharge procedure was extremely slow.” | |
| “There are a limited number of parking spaces available, and getting one is difficult.” | ||
| “We waited for 5 h at the out-patient clinic before seeing the doctor. This is intolerable.” | ||
| Negative qualities of staff | “Staff nurses were rude and stubborn. I requested assistance but received no response.” | |
| (Rude, not-friendly, not-helpful, slow responsive, incompetency etc.) | “The doctor criticised us for arriving at the emergency department at 3 a.m. for treatment. We were annoyed by his attitude.” | |
| Negative/undesirable effects | “My father fell in the toilet and was left alone for a few minutes. The hospital director must explain the incident to our family.” | |
| (Surgical or procedural complications, medicolegal, poor health outcome etc.) | “After being admitted to this hospital two days ago, my husband’s condition has deteriorated. No one, however, can explain the situation to us.” | |
| Neutral | Review that reports factual | “Serdang Hospital is one of the Klang Valley’s cardiac centres.” |
| information/no opinion. | “A Muslim-friendly hospital” | |
| Review as questions | “Do you have any spine surgeon in your hospital?” | |
| “How to get an appointment with your ear. Nose and throat (ENT) clinic?” | ||
| Too ambiguous/unclear/Greetings only | “Good morning.” | |
| “No comment.” | ||
| “Let’s wait and see first” |
Figure 2The number of records in training and test datasets for each SERQUAL domain.
Performance of ML models based on 5-fold cross validation.
| Multilabel Classifier | Model | Accuracy | Recall | Precision | F1-Score | Hamming Loss |
|---|---|---|---|---|---|---|
| Binary | NB | 0.147 | 0.761 | 0.701 | 0.730 | 0.315 |
| Relevance | SVM | 0.211 | 0.763 | 0.745 | 0.754 | 0.278 |
| LR | 0.193 | 0.775 | 0.732 | 0.753 | 0.285 | |
| Label Powerset | NB | 0.130 | 0.896 | 0.633 | 0.741 | 0.349 |
| SVM | 0.166 | 0.799 | 0.679 | 0.734 | 0.323 | |
| LR | 0.158 | 0.825 | 0.669 | 0.739 | 0.326 | |
| Classifier chain | NB | 0.149 | 0.756 | 0.705 | 0.730 | 0.313 |
| SVM | 0.215 | 0.761 | 0.753 | 0.757 | 0.273 | |
| LR | 0.191 | 0.770 | 0.727 | 0.748 | 0.290 | |
| RakEL | NB | 0.157 | 0.749 | 0.699 | 0.722 | 0.322 |
| SVM | 0.186 | 0.764 | 0.724 | 0.743 | 0.295 | |
| LR | 0.180 | 0.765 | 0.726 | 0.745 | 0.293 | |
| MLkNN | N/A | 0.140 | 0.737 | 0.697 | 0.715 | 0.327 |
| BRkNN | N/A | 0.157 | 0.648 | 0.732 | 0.687 | 0.330 |
Performance metrics for each SERVQUAL dimension of MLQC following 5-fold cross validation.
| Multi-Label | Base Classifier | Metrics | Tangible | Reliability | Responsive | Assurance | Empathy |
|---|---|---|---|---|---|---|---|
| Binary relevance | NB | Accuracy | 0.675 | 0.690 | 0.636 | 0.643 | 0.782 |
| Recall | 0.271 | 0.998 | 0.390 | 0.797 | 1.000 | ||
| Precision | 0.765 | 0.689 | 0.665 | 0.603 | 0.782 | ||
| F1-score | 0.399 | 0.815 | 0.485 | 0.681 | 0.878 | ||
| SVM | Accuracy | 0.716 | 0.736 | 0.640 | 0.730 | 0.786 | |
| Recall | 0.511 | 0.885 | 0.514 | 0.730 | 0.951 | ||
| Precision | 0.692 | 0.765 | 0.619 | 0.719 | 0.809 | ||
| F1-score | 0.587 | 0.820 | 0.558 | 0.721 | 0.874 | ||
| LR | Accuracy | 0.680 | 0.715 | 0.657 | 0.733 | 0.792 | |
| Recall | 0.369 | 0.970 | 0.464 | 0.764 | 0.999 | ||
| Precision | 0.678 | 0.716 | 0.675 | 0.711 | 0.791 | ||
| F1-score | 0.474 | 0.823 | 0.546 | 0.732 | 0.883 | ||
| Label powerset | NB | Accuracy | 0.661 | 0.692 | 0.554 | 0.566 | 0.782 |
| Recall | 0.497 | 0.998 | 0.876 | 0.941 | 0.999 | ||
| Precision | 0.612 | 0.690 | 0.506 | 0.529 | 0.783 | ||
| F1-score | 0.531 | 0.816 | 0.633 | 0.675 | 0.878 | ||
| SVM | Accuracy | 0.666 | 0.685 | 0.610 | 0.636 | 0.787 | |
| Recall | 0.471 | 0.884 | 0.688 | 0.816 | 0.948 | ||
| Precision | 0.618 | 0.720 | 0.553 | 0.590 | 0.812 | ||
| F1-score | 0.527 | 0.793 | 0.610 | 0.682 | 0.874 | ||
| LR | Accuracy | 0.642 | 0.702 | 0.614 | 0.612 | 0.802 | |
| Recall | 0.429 | 0.941 | 0.738 | 0.825 | 0.980 | ||
| Precision | 0.576 | 0.714 | 0.555 | 0.567 | 0.808 | ||
| F1-score | 0.487 | 0.812 | 0.629 | 0.670 | 0.886 | ||
| Classifier chain | NB | Accuracy | 0.675 | 0.690 | 0.635 | 0.652 | 0.782 |
| Recall | 0.271 | 0.997 | 0.371 | 0.786 | 1.000 | ||
| Precision | 0.765 | 0.689 | 0.675 | 0.619 | 0.782 | ||
| F1-score | 0.399 | 0.814 | 0.473 | 0.684 | 0.878 | ||
| SVM | Accuracy | 0.716 | 0.731 | 0.651 | 0.737 | 0.799 | |
| Recall | 0.511 | 0.873 | 0.538 | 0.730 | 0.938 | ||
| Precision | 0.692 | 0.766 | 0.630 | 0.727 | 0.829 | ||
| F1-score | 0.587 | 0.816 | 0.577 | 0.726 | 0.879 | ||
| LR | Accuracy | 0.680 | 0.716 | 0.644 | 0.716 | 0.794 | |
| Recall | 0.369 | 0.961 | 0.546 | 0.706 | 0.977 | ||
| Precision | 0.678 | 0.719 | 0.617 | 0.713 | 0.803 | ||
| F1-score | 0.474 | 0.822 | 0.576 | 0.704 | 0.881 | ||
| RakEL | NB | Accuracy | 0.639 | 0.692 | 0.628 | 0.648 | 0.782 |
| Recall | 0.173 | 0.995 | 0.506 | 0.714 | 1.000 | ||
| Precision | 0.689 | 0.691 | 0.651 | 0.630 | 0.782 | ||
| F1-score | 0.274 | 0.815 | 0.521 | 0.657 | 0.878 | ||
| SVM | Accuracy | 0.717 | 0.707 | 0.630 | 0.688 | 0.785 | |
| Recall | 0.494 | 0.900 | 0.522 | 0.719 | 0.952 | ||
| Precision | 0.708 | 0.733 | 0.598 | 0.666 | 0.807 | ||
| F1-score | 0.580 | 0.808 | 0.555 | 0.688 | 0.874 | ||
| LR | Accuracy | 0.675 | 0.718 | 0.650 | 0.693 | 0.799 | |
| Recall | 0.396 | 0.931 | 0.521 | 0.721 | 0.983 | ||
| Precision | 0.654 | 0.732 | 0.641 | 0.679 | 0.804 | ||
| F1-score | 0.491 | 0.819 | 0.563 | 0.693 | 0.884 | ||
| MLkNN | N/A | Accuracy | 0.648 | 0.688 | 0.629 | 0.641 | 0.761 |
| N/A | Recall | 0.493 | 0.829 | 0.530 | 0.683 | 0.936 | |
| N/A | Precision | 0.565 | 0.745 | 0.600 | 0.616 | 0.795 | |
| N/A | F1-score | 0.526 | 0.783 | 0.554 | 0.645 | 0.859 | |
| BRkNN | N/A | Accuracy | 0.640 | 0.690 | 0.641 | 0.631 | 0.750 |
| N/A | Recall | 0.292 | 0.860 | 0.376 | 0.529 | 0.878 | |
| N/A | Precision | 0.614 | 0.734 | 0.689 | 0.645 | 0.817 | |
| N/A | F1-score | 0.388 | 0.790 | 0.479 | 0.580 | 0.844 |
Figure 3Number of Records used in Sentiment Analysis (n = 1393).
Performance metrics of MLSA with 5-fold cross validation.
| Model | Accuracy | Recall | Precision | F1-Score |
|---|---|---|---|---|
| NB | 0.7810 | 0.9988 | 0.7769 | 0.8740 |
| SVM | 0.8743 | 0.9363 | 0.9028 | 0.9189 |
| LR | 0.8429 | 0.9917 | 0.8334 | 0.9057 |
Performance metrics of MLSA with hold out method.
| Model | Accuracy | Recall | Precision | F1-Score | |
|---|---|---|---|---|---|
| NB | Negative | 81% | 19% | 100% | 33% |
| Positive | 100% | 80% | 89% | ||
| SVM | Negative | 90% | 73% | 82% | 77% |
| Positive | 95% | 92% | 93% | ||
| LR | Negative | 87% | 49% | 92% | 64% |
| Positive | 99% | 86% | 92% |