Literature DB >> 29599699

A literature review of empirical research on learning analytics in medical education.

Abstract

The number of publications in the field of medical education is still markedly low, despite recognition of the value of the discipline in the medical education literature, and exponential growth of publications in other fields. This necessitates raising awareness of the research methods and potential benefits of learning analytics (LA). The aim of this paper was to offer a methodological systemic review of empirical LA research in the field of medical education and a general overview of the common methods used in the field in general. Search was done in Medline database using the term "LA." Inclusion criteria included empirical original research articles investigating LA using qualitative, quantitative, or mixed methodologies. Articles were also required to be written in English, published in a scholarly peer-reviewed journal and have a dedicated section for methods and results. A Medline search resulted in only six articles fulfilling the inclusion criteria for this review. Most of the studies collected data about learners from learning management systems or online learning resources. Analysis used mostly quantitative methods including descriptive statistics, correlation tests, and regression models in two studies. Patterns of online behavior and usage of the digital resources as well as predicting achievement was the outcome most studies investigated. Research about LA in the field of medical education is still in infancy, with more questions than answers. The early studies are encouraging and showed that patterns of online learning can be easily revealed as well as predicting students' performance.

Entities: Disease Species

Keywords: Learning analytics; medical education; review

Year: 2018 PMID： 29599699 PMCID： PMC5870333

Source DB: PubMed Journal: Int J Health Sci (Qassim) ISSN： 1658-3639

Introduction

Learning analytics (LA) is a rapidly evolving research discipline that aims at “measurement, collection, analysis, and reporting of data about learners and their contexts, for purposes of understanding and optimizing learning and the environments in which it occurs.”[1] The field of LA has been formally recognized after it had the first widely known definition during the first International Conference on LA and Knowledge (LAK, 2011).[1] The process of LA can be conceptualized as a cycle as shown in Figure 1; the first stage involves capturing data produced by learners from various sources; mainly the virtual or learning management system (LMS), registration systems, assessment platforms, and library systems. The second stage involves data preparation, analysis, and interpretation. The final stage is to create meaningful interventions using the insights generated from data analysis such as proactive intervention, personalizing support, or adaptive content. The last step is to evaluate the intervention; the feedback is then used to improve the whole process.[1-3]

Figure 1

The learning analytics cycle

A General Overview of LA Research Methodology: Data Collection

Both quantitative and qualitative data collection methods are used in empirical LA research. Nonetheless, the majority of studies favor the quantitative approach. Regarding the quantitative approach, the most common data captured about learners are the data generated by LMS such as time on task, the number of logins, participation in assessments, assignments, chats, or forum discussions.[4-8] To increase the scope of data collected and improve the accuracy of prediction, demographic data stored in registration systems or student information systems were added, such as age, gender, socioeconomic status was added.[9-14] To account for the social dimension of online activity, some researchers added social network analysis of online interactions to the demographic and LMS data.[8,15] For more in-depth analysis of online forums beyond the simple counts of replies and posts, Romero et al.[16] used qualitative, quantitative and social networks analysis to predict student performance. While not exactly fitting the strict definition of LA, Templar et al.[14] extended the scope of data collected to include self-reported surveys about learning styles, learning motivation and engagement, along with data from registration systems and LMS, and so did Dawson by including motivation survey results.[15] Data collected using qualitative methods included open-ended questions about students’ opinions of the usefulness of LA visualizations and how it can be used to support their learning,[17] semi-structured interviews about students’ perception of LA and their understanding of the different metrics.[18] It also includes interviews with registered students to get an understanding of the way they use the LMS,[19] focus group interviews to evaluate the design of a tool,[20] and semi-structured focus group interviews with instructors about patterns of LA data usage.[21] The quantitative data collection methods capture the number of times a student used a certain learning module or the duration he used an online tool. However It falls short of answering particular questions, like why this learning module was used and why others were not, what is the motive behind usage and how was the user experience. Furthermore, implementing LA requires collecting data from stakeholders such as students, teachers, administrators, and technicians. This necessitates the collection of data about how they perceive the technology, what is their feedback on the implementation, and suggestions to move forward and improve the current setup. Whereas qualitative methods enable researchers and administrators alike to gain a deep understanding of the practice and the process of LA and answer the why question, they are difficult to verify objectively, need skillful interviewers and resource intensive regarding time and effort. These shortcomings can be a serious hurdle in a field like LA that is built around the premise of automated data processing and recommendations. This problem might be solvable with qualitative data mining techniques. There is wide variation about what each study considers LA data. However, the common approach is that the nature of instructional conditions dictates to a greater extent which data are collected. In that sense, it would be relevant for courses offering interactive discussions to include social network analysis, financial data in paid colleges and library access in courses requiring research duties.[13] Relatedly, qualitative methods may be used in studies evaluating the implantation of LA or proposing a new framework for assessing the feasibility of a tool.

Data analysis

Similar to data collection, there is a lot of diversity among studies in the approach they take to data analysis and the outcome they are trying to achieve. Early studies used simple bivariate correlations to predict students’ performance[4,5] and were able to find cues to how students perform. Mazza et al.[6] pioneered the idea of using visual dashboards that show a summary of students’ activity. He created a tool, “CourseVis” which was intended to help teachers discover students in need for attention. The evaluation included a focus group discussion with five teachers and a semi-structured interview with another three teachers who used the improved version. Visual dashboards have become widely used later as to give feedback and information to teachers and students.[9,11,12] Ramos et al.[7] used a stepwise multiple regression models to predict the final outcome of students. To further validate the findings, he performed cross-validation using the equation generated from the “screening sample” to predict another test sample “the calibration sample.” Different regression methods were then used, and became the most widely used method for analysis, however, to predict students who are at risk of failing rather than just the final grade.[8,11,13,14,16] Logistic regression (LR) models are among the most widely used methods for detecting a dichotomous outcome. LR has gained popularity in LA due to the advantages they have, like, assumptions are easy to fulfill, the availability of powerful software to perform the test and comprehensively analyze the performance of the technique.[22] Exploration of new models of analysis that might prove more effective is a common practice in emergent LA research. This includes Course signals proprietary algorithm (A predictive student success algorithm) to predict students in need for support,[9] Spoon et al.[23] used random forest algorithms to identify which group of students might need a supplementary module and Romero et al.[16] used a classifier algorithm arguing that student success or failure is a classification problem. Wolf et al.[10] from the Open University one of the world’s largest distance education institutions which have an interesting and probably well-developed work in analytics-used General Unary Hypothesis Automation (GUHA) for data analysis. The main hypothesis was that identifying failing students would be more accurate if they are compared to peers of similar condition or their previous levels of activity; their work resulted later in a more mature and probably more complex system called “OU Analyse” reported by Kuzilek.[11] OU Analyse uses Four predictive models: A Bayesian classifier for finding the most significant LMS resources, k-Nearest Neighbors (k-NN) and classification and regression tree for the analysis of demographic/static data and k-NN with LMS data to predict students who are at risk and identify which kind of support they might need. Quantitative methods analysis might enable researchers to show correlations, predictions, patterns, or trends of usages. Nevertheless, the addition of qualitative data analysis boosts the robustness of the research and highlights different sides of the problem, like “how” and “why.” Analysis of interviews about visualization enabled Kirsi et al.[17] to understand why certain visualizations are better than the others from students’ point of view. Students suggested that visualization needs to be clearer with more hints for easy interpretation. Haden et al.[20] used interviews to get feedback about the infographic LA tool they created; the interviews helped them design a better final version. Simone et al.[18] used structured interviews with the students to assess motivations of LA usage; they found that students used LA to set goals for participation, monitor behavior using themselves and others as a reference and comparison yardstick. Similarly, Mtebe et al.[19] used qualitative methods to understand the “how” question; he conducted interviews with students to find out how they used LA. It is apparent from the aforementioned studies that there is no unified approach to data analysis, each approach might have some advantages and disadvantages. However, there are general conclusions one can make. Comparing students to their peers in the same course has proven to be the most efficient way to build an accurate predictive model,[9,10,13] that is why many predictive models were applied on course by course basis.[4,5,8,14] This might be because each course is structurally different, uses distinct LMS feature and incur a different load on learners.[10,13] A mixed methods approach that gathers and integrates insights from various data sources and stakeholders would have a significant role in the evolution of the LA, the tools we use and the frameworks we work within.[24]

Validity and reliability

Most studies did not use usual validation methods to test the performance of the predicting models across multiple courses.[10,13] Nonetheless, 10 cross fold-validation was used in two studies.[10] 10 cross fold-validation is a split sample technique (divides dataset it into 10 equal parts, and uses nine parts as a training dataset and one as a test).[10,16] In summary, evidence is growing toward individualizing prediction models for each context and against using a one size fits all prediction model.[13]

Objectives of LAs research

Predicting students’ performance,[4,5] or students who are at-risk of underachieving or failing a course is the most common objective of studies exploring the potential of LA.[4,7-13,16] Examples of this include, Macfadyen et al.,[8] were able to identify to at-risk students 70.3% with accuracy using only LMS data; Open University GUHA predicted failure with 88% confidence using both LMS and demographics[10] Results from OU analyze (included LMS, demographics, and financials) indicated that identifying at-risk students were possible with an accuracy of 50% at the beginning of the semester; accuracy increased to over 90% by the end of a semester.[11] The predictors of students’ success varied greatly among studies; some studies found that hits or clicking patterns are the main predictors of success.[5,7,10] Others have emphasized the role of forum postings and social network analysis.[8,16] In one study, formative assessment seemed to be the most important predictor,[14] and in another, indicators of student motivation were reported to play a role.[15] The variation in predictors importance among courses was also obvious in studies investigating more than one course at the same time; these studies found predictors for each course are much more accurate than using generalized predictors for all courses.[10,13] This variability in predictor performance is not a sign of inconsistency; it is rather a sign of the importance of considering teaching methods, instructional design and the way each LMS feature is used. It also underscores the need to tailor methodologies to specific contexts.[13] While most studies have investigated the predictors of students’ success for research purposes, some have built “early warning systems” that present results to students and teachers, so that teachers can create the appropriate intervention.[9-12] Alerting the students along with custom interventions have proven to improve retention and students’ success rates.[9-11]

LA in medical education

The number of publications in the field of medical education is still markedly scarce,[25] Despite recognition of the value of the discipline in the medical education literature,[26,27] and exponential growth of publications in other fields.[28] The reasons LA in medical education is lagging behind other educational fields include implementation issues that require advanced technical and analytical skills, the ethical and privacy issues and the lack of awareness about the potentials and benefits LA can bring.[25,26,29] A review of LA in medical education does not yet exist; Thus, the aim of this paper is to offer a systemic methodological review of empirical LA research in medical education to shed light on an emerging discipline to medical educators. The hope is that more researchers may appreciate the merits of the field and contribute to the existing volume of research.

Methods

The search was done in Medline database through the PubMed search engine, using the search term “LA.” The inclusion criteria included empirical original research articles investigating LA using qualitative, quantitative, or mixed methodologies. Articles were also required to be written in English, published in a scholarly peer-reviewed journal and have a dedicated section for methods and results. The following types of articles were excluded. Theoretical discussion of a framework, a concept or a suggested model without empirically testing the concept. Articles that discussed ethical, privacy, policy, or data handling issues. Editorials, reviews, opinions, reports, or commentary articles. Abstracts of conference papers where no full text is published.

The research question for this overview is

What are the types of data commonly captured about learners? What are the types of analytical approaches taken? What are the objectives and outcome of research papers?

Results

A Medline search for the term “LA” performed on September 13, 2017, resulted in 26 abstracts, one article was excluded as it was repeatedly published, 25 unique abstracts remained. All articles were retrieved and carefully reviewed. Five articles were excluded as they were opinion articles, four articles discussed a possible framework or methodology but did not assess it empirically, seven articles were not relevant to the topic of the study, two articles discussed the ethics of LA, and one article was excluded as it assessed the attitude toward LA. Only six articles were found to be empirical research articles fulfilling the inclusion criteria for this review as shown in Table 1. These articles were labeled as S1 to S6. S1by Choi-Lundberg et al., 2016, Australia,[30] S2 by Manzanares et al., 2017, Spain[31] S3 by Braatvedt et al., 2017, New Zealand,[32] S4 by Pecaric et al., 2017, Canada,[33] S5 by Saqr et al., 2017, Saudi Arabia,[29] and S6 by Scott et al., 2017, Australia.[34] All of these articles were published in the past two years; one study S1 was published in 2016,[30] the other five studies S2, S3, S4, S5, and S6 were published on 2017.[29,31-34]

Table 1

Tabulation of all studies and their methodologies

Tabulation of all studies and their methodologies The objectives of LA research were quite diverse, S1 aim was to improve students’ preparedness for dissection sessions, S2 was to study the pattern of students’ usage of blended learning, S3 was to study the pattern and engagement of usage of anatomy e-cases, S4 was to improve feedback and diagnostic skills of radiology simulation, S5 was to predict underachievers, and S6 investigated the learning habits and patterns of final stage clinical students. Regarding data collected, most studies (S1, S2, S5, and S6) collected LMS log data of student access, clicks and time of usage of online learning resources. The other two studies were more limited in scope as they collected usage data of a single online (anatomy e-cases in S3, and radiograph case simulation in S4). Given that most studies aimed to study the patterns of usage of online resources, the analysis methods were basic in most studies, including descriptive statistics and time pattern analysis; this was the prevalent analysis method in S1, S2, S3, and S6. S5 and S5 included more advanced regression analysis and predictive techniques. Most of the studies included only quantitative methods except for S3 that included thematic qualitative analysis. The outcomes of these results were not far from the objectives they sought, S1 was able to show students’ engagement with the e-cases and that e-cases were associated with better dissection skills. S2 was able to show a correlation between the degree of blending in e-learning and students’ online behavior and that pattern of online learning can be used to predict performance. S3 reported the pattern and engagement with the online anatomy cases. Similarly, S6 reported the patterns of usage of online resources. S4 was able to use the insights generated by LA to improve feedback given to radiology trainees. Finally, S5 used LA to predict underachieving students at the end of the course and at the mid-course time.

Discussion

Methodologically, the progress in the field of LA can be categorized into three main areas, the scope of data collected, data analysis methods, and the objectives of the research. In data capture, there has been a lot of advance in the scope of data collected gathering data across multiple systems and domains to increase predictability, namely, LMS, registration data, library data, demographics, and self-reported surveys. Regarding data analysis, the field has achieved considerable developments and understanding of the different models that were proven successful and could be replicated in future research especially regression models and classifiers. Regarding objectives, several studies have achieved fairly acceptable predictive capabilities of under-achieving students between 80% and 90% by the end of the course and (50–60%) at a mid-course time. There are also some early warning systems available, and others are being built and improved. Frameworks for intervention are being developed and tested for efficacy across multiple institutions. The reviewed literature here highlights the idea that research in LA is still exploratory in nature. There are some areas that need a lot of work, a standard mechanism for data collection and reporting is needed, the mechanism needs to be LMS independent and uses a standard set of metrics that can be shared across multiple systems. An accepted framework for analytics for classification of outcomes is also required and a so that data and results across can be shared or compared across institutions. Unexplored areas include using artificial intelligence, the role of semantic data mining in adding to the predictive accuracy and data collection beyond computer logs. For Medical education, this review is a testimony that we are yet to scratch the surface. There are lots of venues and uncharted territories that have not been explored, despite the considerable potential that we might get. The list of unanswered questions is long, examples include, the factors that best predict students’ achievement in health-care education, the correlation between online and offline learning, the rule of simulation in students’ learning, the modules that have the most cost-efficient impact on students’ learning, the competencies that students always miss, the areas that students find most challenging and what areas that we need to redesign so that it helps students perform better.

Conclusions

This paper aimed to offer a systemic methodological review of empirical LA research in the field of medical education and a general overview of the common methods used in the field in general. A Medline search resulted in only six articles fulfilling the inclusion criteria for this review. Most of the studies collected data about learners for LMSs or online learning resources. The analysis used mostly quantitative methods including descriptive statistics, correlation tests, and regression in two studies. Patterns of online behavior and usage of the digital resources as well as predicting achievement was the outcome most studies investigated. Apparently, research about LA in the field of medical education is still in infancy, with more questions than answers. The early studies are encouraging and showed that patterns of online learning could be easily revealed as well as predicting students’ performance.

9 in total

1. Learning Analytics And Medical Education.

Authors: Mohammed Saqr
Journal: Int J Health Sci (Qassim) Date: 2015-10

2. How learning analytics can early predict under-achieving students in a blended medical education course.

Authors: Mohammed Saqr; Uno Fors; Matti Tedre
Journal: Med Teach Date: 2017-04-19 Impact factor: 3.650

3. Online dissection audio-visual resources for human anatomy: Undergraduate medical students' usage and learning outcomes.

Authors: Derek L Choi-Lundberg; William A Cuellar; Anne-Marie M Williams
Journal: Anat Sci Educ Date: 2016-03-24 Impact factor: 5.958

4. Medical student use of digital learning resources.

Authors: Karen Scott; Anne Morris; Ben Marais
Journal: Clin Teach Date: 2017-03-16

5. Clinical anatomy e-cases: a five-year follow-up of learning analytics.

Authors: Vivek Perumal; Russell Butson; Phil Blyth; Ben Daniel
Journal: N Z Med J Date: 2017-01-27

6. Contemporary and future eLearning trends in medical education.

Authors: Iain Doherty; Neel Sharma; Darren Harbutt
Journal: Med Teach Date: 2014-08-26 Impact factor: 3.650

7. Developing the role of big data and analytics in health professional education.

Authors: Rachel H Ellaway; Martin V Pusic; Robert M Galbraith; Terri Cameron
Journal: Med Teach Date: 2014-02-03 Impact factor: 3.650

8. A Big Data and Learning Analytics Approach to Process-Level Feedback in Cognitive Simulations.

Authors: Martin Pecaric; Kathy Boutis; Jason Beckstead; Martin Pusic
Journal: Acad Med Date: 2017-02 Impact factor: 6.893

9. How Do B-Learning and Learning Patterns Influence Learning Outcomes?

Authors: María Consuelo Sáiz Manzanares; Raúl Marticorena Sánchez; César Ignacio García Osorio; José F Díez-Pastor
Journal: Front Psychol Date: 2017-05-16