Research Library

Showing 11 - 20 of 27 Research Library Publications

Timing Considerations for Performance Assessments

Posted: June 25, 2020 | M. J. Margolis, M. von Davier, B. E. Clauser

Integrating Timing Considerations to Improve Testing Practices

This chapter addresses timing considerations in the context of other types of performance assessments and reports on a previously unpublished experiment examining timing with respect to performance on computer-based case simulations that are used in physician licensure.

Category:Assessment-Oriented Research, General Measurement

A History of Test Speededness: Tracing the Evolution of Theory and Practice

Posted: June 25, 2020 | D. Jurich

Integrating Timing Considerations to Improve Testing Practices

This chapter presents a historical overview of the testing literature that exemplifies the theoretical and operational evolution of test speededness.

Category:Assessment-Oriented Research, General Measurement, Reliability/Validity

Using multisite instrumental variables to estimate treatment effects and treatment effect heterogeneity

Posted: April 29, 2020 | C.R. Runyon

UT Electronic Theses and Dissertations

Using Monte Carlo simulation, the current study examines the performance of three IV estimators and two conventional estimators in recovering the CATE and CATE heterogeneity under simulation conditions that resemble multisite trials of well-known educational programs.

Category:Assessment-Oriented Research, General Measurement

Adding Objectivity to Standard Setting: Evaluating Consequence Using the Conscious and Subconscious Weight Methods

Posted: February 26, 2020 | B.C. Leventhal, I. Grabovsky

Educational Measurement: Issues and Practice, 39: 30-36

This article proposes the conscious weight method and subconscious weight method to bring more objectivity to the standard setting process. To do this, these methods quantify the relative harm of the negative consequences of false positive and false negative misclassification.

Category:Assessment-Oriented Research, General Measurement

Handbook of Diagnostic Classification Models

Posted: August 31, 2019 | M. von Davier, YS. Lee

Springer International Publishing; 2019

This handbook provides an overview of major developments around diagnostic classification models (DCMs) with regard to modeling, estimation, model checking, scoring, and applications. It brings together not only the current state of the art, but also the theoretical background and models developed for diagnostic classification.

Category:Assessment-Oriented Research, General Measurement, Scoring

The Choice of Response Probability in Bookmark Standard Setting: An Experimental Study

Posted: January 16, 2019 | P. Baldwin, M.J. Margolis, B.E. Clauser, J. Mee, M. Winward

Educational Measurement: Issues and Practice, 39: 37-44

This article presents the results of an experiment in which content experts were randomly assigned to one of two response probability conditions: .67 and .80. If the standard-setting judgments collected with the bookmark procedure are internally consistent, both conditions should produce highly similar cut scores.

Category:Assessment-Oriented Research, General Measurement

Commentary: On the Importance of the Speed-Ability Trade-Off When Dealing with Not Reached Items

Posted: October 30, 2018 | S. Pohl, M. von Davier

Front. Psychol. 9:1988

In their 2018 article, (T&B) discuss how to deal with not reached items due to low working speed in ability tests (Tijmstra and Bolsinova, 2018). An important contribution of the paper is focusing on the question of how to define the targeted ability measure. This note aims to add further aspects to this discussion and to propose alternative approaches.

Category:Assessment-Oriented Research, Reliability/Validity, General Measurement

The Optimal Number of Options for Multiple-Choice Questions on High-Stakes Tests: Application of a Revised Index for Detecting Nonfunctional Distractors

Posted: October 25, 2018 | M.R. Raymond, C. Stevens, S.D. Bucak

Adv in Health Sci Educ 24, 141–150 (2019)

Research suggests that the three-option format is optimal for multiple choice questions (MCQs). This conclusion is supported by numerous studies showing that most distractors (i.e., incorrect answers) are selected by so few examinees that they are essentially nonfunctional. However, nearly all studies have defined a distractor as nonfunctional if it is selected by fewer than 5% of examinees.

Category:Assessment-Oriented Research, General Measurement

Evaluation of a New Method for Providing Full Review Opportunities in Computerized Adaptive Testing — Computerized Adaptive Testing with Salt

Posted: October 1, 2018 | Z. Cui, C. Liu, Y. He, H. Chen

Journal of Educational Measurement: Volume 55, Issue 4, Pages 582-594

This article proposes and evaluates a new method that implements computerized adaptive testing (CAT) without any restriction on item review. In particular, it evaluates the new method in terms of the accuracy on ability estimates and the robustness against test‐manipulation strategies. This study shows that the newly proposed method is promising in a win‐win situation: examinees have full freedom to review and change answers, and the impacts of test‐manipulation strategies are undermined.

Category:Assessment-Oriented Research, General Measurement, Applications of Technology

ALS Specific Quality of Life Short Form (ALSSQOL-SF): A Brief, Reliable and Valid Version of the ALSSQOL-R

Posted: July 20, 2018 | S. H. Felgoise, R. A. Feinberg, H. B. Stephens, P. Barkhaus, K. Boylan, J. Caress, Z. Simmons

Muscle Nerve, 58: 646-654

The Amyotrophic Lateral Sclerosis (ALS)‐Specific Quality of Life instrument and its revised version (ALSSQOL and ALSSQOL‐R) have strong psychometric properties, and have demonstrated research and clinical utility. This study aimed to develop a short form (ALSSQOL‐SF) suitable for limited clinic time and patient stamina.

Category:Assessment-Oriented Research, General Measurement, Health Professions

RESEARCH LIBRARY