| Literature DB >> 35937141 |
Yigal Attali1, Andrew Runge1, Geoffrey T LaFlair1, Kevin Yancey1, Sarah Goodwin1, Yena Park1, Alina A von Davier1.
Abstract
Automatic item generation (AIG) has the potential to greatly expand the number of items for educational assessments, while simultaneously allowing for a more construct-driven approach to item development. However, the traditional item modeling approach in AIG is limited in scope to content areas that are relatively easy to model (such as math problems), and depends on highly skilled content experts to create each model. In this paper we describe the interactive reading task, a transformer-based deep language modeling approach for creating reading comprehension assessments. This approach allows a fully automated process for the creation of source passages together with a wide range of comprehension questions about the passages. The format of the questions allows automatic scoring of responses with high fidelity (e.g., selected response questions). We present the results of a large-scale pilot of the interactive reading task, with hundreds of passages and thousands of questions. These passages were administered as part of the practice test of the Duolingo English Test. Human review of the materials and psychometric analyses of test taker results demonstrate the feasibility of this approach for automatic creation of complex educational assessments.Entities:
Keywords: automatic item generation; language modeling; psychometrics; reading assessment; transformer models
Year: 2022 PMID: 35937141 PMCID: PMC9354894 DOI: 10.3389/frai.2022.903077
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
Figure 1Median response time distributions (in seconds).
Figure 2Mean score (easiness) distributions.
Figure 3Item-total correlations with total practice test score.
Figure 4Partial correlations between item pairs.