| Literature DB >> 35565099 |
Mario Jojoa1, Begonya Garcia-Zapirain1, Marino J Gonzalez2, Bernardo Perez-Villa3, Elena Urizar4, Sara Ponce5, Maria Fernanda Tobar-Blandon6.
Abstract
The aim of this study is to analyze the effects of lockdown using natural language processing techniques, particularly sentiment analysis methods applied at large scale. Further, our work searches to analyze the impact of COVID-19 on the university community, jointly on staff and students, and with a multi-country perspective. The main findings of this work show that the most often related words were "family", "anxiety", "house", and "life". Besides this finding, we also have shown that staff have a slightly less negative perception of the consequences of COVID-19 in their daily life. We have used artificial intelligence models such as swivel embedding and a multilayer perceptron as classification algorithms. The performance that was reached in terms of accuracy metrics was 88.8% and 88.5% for students and staff, respectively. The main conclusion of our study is that higher education institutions and policymakers around the world may benefit from these findings while formulating policy recommendations and strategies to support students during this and any future pandemics.Entities:
Keywords: COVID-19; Swivel embedding; continents; habits; institutions; mental health; natural language processing; online learning; perception; satisfaction; socio-demographic factors; university student; word cloud
Mesh:
Year: 2022 PMID: 35565099 PMCID: PMC9104371 DOI: 10.3390/ijerph19095705
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 4.614
Students and staff responses sorted by country. Source: elaborated by the authors from survey data.
| Country | Students | Staff |
|---|---|---|
| Spain | 106 | 82 |
| Colombia | 119 | 58 |
| Subtotal | 225 | 140 |
| Total | 365 | |
Figure 1Histogram of student data distribution of countries Spain and Colombia. Source: elaborated by the authors from survey data.
Figure 2Histogram of the distribution of the data of the staff set for Spain. Source: elaborated by the authors from survey data.
Figure 3Histogram of the distribution of the data, per sentiment categories, of the students and staff datasets for two countries, Spain and Colombia. Source: elaborated by the authors from survey data.
Figure 4Proposed solution model based on natural language processing.
Figure 5Diagram of the sentiment analysis of the input texts.
Figure 6Outline of the proposal with MLP.
Figure 7SVM proposal outline.
Figure 8Flowchart of the decision block to determine the text sentiment.
Figure 9Frequency-based infographic retrieval diagram to obtain the cloud words.
Average decision thresholds for each proposed model and respective dataset.
| Model | Dataset | Positive Max | Mean Positive to Neutral Threshold | Mean Neutral to Negative Threshold | Negative Min |
|---|---|---|---|---|---|
| MLP | Students | 1 | 0.49 | 0.22 | −1 |
| MLP | Staff | 1 | 0.52 | 0.38 | −1 |
| SVM | Students | 1 | 0.41 | 0.24 | −1 |
| SVM | Staff | 1 | 0.42 | 0.26 | −1 |
Metrics obtained for the present work for the different datasets and different classifiers.
| Metrics | MLP | MLP | SVM | SVM |
|---|---|---|---|---|
| Weighted | 92.49% | 92.59% | 88.42% | 82.55% |
| Weighted | 88.89% | 88.57% | 83.11% | 72.86% |
| Weighted | 88.64% | 88.47% | 82.86% | 71.77% |
| Weighted | 88.74% | 88.29% | 82.88% | 71.75% |
| Accuracy | 88.88% | 88.57% | 83.11% | 72.85% |
Confusion matrix obtained for the set of students with the proposed MLP model.
| Class | Real | |||
|---|---|---|---|---|
| Negative | 165 | 8 | 2 | |
| Neutral | 1 | 22 | 6 | |
| Positive | 6 | 2 | 13 | |
| Predicted | Class | Negative | Neutral | Positive |
Relevant misclassifications of the classification model. Input positives labeled as negatives for the student set. In original Spanish language.
| Positives Classified as Negatives |
|---|
| Solidaridad |
| Con la pandemia he a prendido a valorar lo que tenemos |
Relevant misclassifications of the classification model. Input negatives labeled as positives for the whole student body. In original Spanish language.
| Negatives Classified as Positives |
|---|
| La mayoría de estudiantes universitarios sufren mucho |
| que den buen descuento de la matrícula |
| La respuesta de los organismos (Gobierno, Universidad…) fue insuficiente con poca información y bastante incertidumbre esto es lo que ha provocado la mayor fuente de estres, ansiedad y malestar. Además de una sensación de indefensión y vulnerabilidad |
| No se ha tenido en cuenta la desigualdad de recuerdos e imposibilidad de muchas familias sin internet a proseguir sus estudios. |
| SOY ESTUDIANTE DE UNIVERSIDAD. EL TRABAJO SE HA MULTIPLICADO POR 3 PORQUE LOS PROFESORES CONSIDERAN QUE “TENEMOS MÁS TIEMPO PORQUE ESTAMOS TODOS EN CASA”. LA ACUMULACIÓN DE TRABAJO ES BOCHORNOSA E INJUSTIFICABLE. |
| La pandemia ha colaborado en que la generación de los milenials tengan muchas dificultades para encontrar estabilidad laboral acorde con sus estudios. |
Relevant misclassifications of the classification model. Input positives categorized as negatives for the staff set. In original Spanish language.
| Positives Classified as Negatives |
|---|
| Los países tenemos una oportunidad de aprender a mejorar la conciliación familiar, la lucha contra la contaminación ambiental, el teletrabajo y el modo no presencial o mixto en educación, oportunidad para ir bajando el uso de la moneda en papel y bajar el fraude monetario, usar mano de obra nacional en puestos de trabajo de importación de trabajadores extranjeros, mejorar la tasa poblacional en los pueblos y disminuir el descenso de la población rural, aumentar las horas dedicadas al ejercicio. |
| He aprendido a gestionar mejor mi tiempo, mis recursos económicos y mis relaciones con los demás miembros de la familia. No estaba en activo durante el tiempo de confinamiento, lo que me ha ayudado a centrarme en la familia. Considero muy complicado conciliar vida laboral y familiar si tengo que trabajar con mis hijos en casa. |
| Soy personal de administración y servicios de mi universidad. Trabajo desde casa casi igual que antes, usando la línea familiar de ADSL. Trabajar en casa me permite disponer de luz solar, mientras que mi puesto de trabajo se encuentra iluminado artificialmente. |
| El confinanmiento me ha hecho reflexionar sobre lo rápido que iba la vida cotidiana: el trabajo, la vida social...que somos muy vulnerables, que apenas tenemos tiempo para dedicarlo a lo que nos gusta o a nuestros amigos... |
| Esta experiencia me ha hecho pensar que para algunos sectores, el teletrabajo es una opción fiable y beneficiosa para los trabajadores. Creo que tiene que seguir incluso cuando salgamos de esta pandemia. Dar la opción de trabajar desde casa se tiene que contemplar seriamente no solo como medida de preparación para posibles rebrotes sino también una herramienta para mejorar el bienestar de los trabajadores |
Relevant misclassifications of the classification model. Input negatives labeled as positives for the staff set. In original Spanish language.
| Negatives Classified as Positive |
|---|
| Reduce mucho la calidad de trabajo, es decir, el rendimiento y la concentración. |
Relevant misclassifications of the classification model (positives labeled as negatives) for the student set.
| Positives Classified as Negatives |
|---|
| Solidarity |
| With the pandemic, I have come to appreciate what we have |
Relevant misclassifications of the classification model (negatives labeled as positives) for the whole student body.
| Negatives Classified as Positives |
|---|
| Most college students suffer greatly |
| that give a good tuition discount |
| The response of the organizations (Government, University...) was insufficient, with little information and a great deal of uncertainty, which is what has caused the greatest source of stress, anxiety and discomfort. In addition to a feeling of helplessness and vulnerability. |
| The inequality of memories and the impossibility of many families without internet to continue their studies has not been taken into account. |
| I am a university student. the workload has been multiplied by 3 because the professors consider that ‘‘we have more time because we are all at home’’. the accumulation of work is embarrassing and unjustifiable. |
| The pandemic has contributed to the fact that the millennial generation is having a very difficult time finding job stability in line with their studies. |
Confusion matrix obtained for the staff ensemble with the proposed MLP model.
| Class | Real | |||
|---|---|---|---|---|
| Negative | 70 | 3 | 5 | |
| Neutral | 0 | 22 | 4 | |
| Positive | 1 | 3 | 32 | |
| Predicted | Class | Negative | Neutral | Positive |
Class
Real
Relevant misclassifications of the classification model (positives categorized as negatives) for the staff.
| Positives Classified as Negatives |
|---|
| Countries have the opportunity to learn how to improve family reconciliation, the fight against environmental pollution, teleworking and non-face-to-face or mixed modality in education, the opportunity to decrease the use of paper money and reduce monetary fraud, use national labor in jobs imported by foreign workers, improve the population rate in villages and reduce the decline in rural population, increase the hours dedicated to exercise, etc. |
| I have learned to better manage my time, my financial resources and my relationships with other family members. I was not working during the confinement time, which has helped me to focus on the family. I find it very difficult to reconcile work and family life if I have to work with my children at home. |
| I am an administration and services staff at my university. I work from home almost as before, using the family ADSL line. Working at home allows me to have sunlight, while my workstation is artificially illuminated. |
| The confinement has made me reflect on how fast everyday life was going: work, social life... that we are very vulnerable, that we hardly have time to dedicate to what we like or to our friends... |
| This experience has made me think that for some sectors, telework is a reliable and beneficial option for workers. I think it has to continue even when we come out of this pandemic. Giving the option to work from home has to be seriously contemplated not only as a preparedness measure for possible resurgences but also as a tool to improve the well-being of the workers. |
Relevant misclassifications of the classification model (negatives labeled as positives) for the staff set.
| Negatives Classified as Positives |
|---|
| It greatly reduces the quality of work, i.e., performance and concentration. |
Confusion matrix obtained for the set of students with the proposed SVM model.
| Class | Real | |||
|---|---|---|---|---|
| Negative | 160 | 11 | 3 | |
| Neutral | 8 | 18 | 9 | |
| Positive | 4 | 3 | 9 | |
| Predicted | Class | Negative | Neutral | Positive |
Confusion matrix obtained for the set of staff with the proposed SVM model.
| Class | Real | |||
|---|---|---|---|---|
| Negative | 66 | 5 | 11 | |
| Neutral | 3 | 15 | 9 | |
| Positive | 2 | 8 | 21 | |
| Predicted | Class | Negative | Neutral | Positive |
Figure 10Infographic of the analyzed text corresponding to the open question of the data collection instrument.