Journal ArticleDOI
A comparison of reliability and precision of subscore reporting methods for a state English language proficiency assessment
Tanya Longabach,Vicki Peyton +1 more
TLDR
K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to...Abstract:
K–12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to ...read more
Citations
More filters
Journal ArticleDOI
Designing and validating a potential assessment inventory for assessing ELTs’ assessment literacy
TL;DR: In this article, a theoretical framework for the main four components of teacher assessment literacy, named validity, reliability, interpretability of the results, and efficiency, was developed through extensive review of the related literature and conducting interviews with PhD candidates of TEFL.
Dissertation
A comparison of subscore reporting methods for a state assessment of english language proficiency
TL;DR: This dissertation explored several methods of assigning subscores to the four domains of an English language proficiency test, including classical test theory (CTT)-based number correct, unidimensional item response theory (UIRT), augmented itemresponse theory (A-IRT), and multiddimensional item response Theory (MIRT),and compared the reliability and precision of these different methods across language domains and grade bands.
Journal ArticleDOI
Designing and validating a scale for evaluating the sources of unreliability of a high-stakes test
TL;DR: In this article , the authors went through a thorough literature review with the aim to identify the issues to be counted as sources of unreliability of a high-stakes test, i.e., the MA University Entrance Exam of English (UEEE) in Iran.
References
More filters
Book
Language Assessment: Principles and Classroom Practices
TL;DR: Preface 1 Testing, Assessing, and Teaching What Is a Test?
Journal ArticleDOI
mirt: A Multidimensional Item Response Theory Package for the R Environment
TL;DR: The mirt package was created for estimating multidimensional item response theory parameters for exploratory and confirmatory models by using maximum-likelihood meth- ods.
Journal ArticleDOI
A Generalized Partial Credit Model: Application of an EM Algorithm
TL;DR: The generalized partial credit model (GPCM) as discussed by the authors is a generalized PCM with a varying slope parameter, which is based on Andrich's (1978) rating scale formulation.
Book ChapterDOI
Multidimensional Item Response Theory
TL;DR: In this paper, the authors describe the commonly used multidimensional item response theory (MIRT) models and the important methods needed for their practical application, including ways to determine the number of dimensions required to adequately model data, procedures for estimating model parameters, ways to define the space for a MIRT model, and procedures for transforming calibrations from different samples to put them in the same space.