On the vulnerability of automated scoring to construct-irrelevant response strategies (CIRS): An illustration

Isaac I. Bejar; Michael Flor; Yoko Futagi; Chaintanya Ramineni

doi:10.1016/j.asw.2014.06.001

Assessing Writing Oct 2014

On the vulnerability of automated scoring to construct-irrelevant response strategies (CIRS): An illustration

Isaac I. Bejar Educational Testing Service ; Michael Flor Educational Testing Service ; Yoko Futagi Educational Testing Service ; Chaintanya Ramineni Educational Testing Service

Journal: Assessing Writing
Published: 2014-10-01
DOI: 10.1016/j.asw.2014.06.001
CompPile: Search in CompPile ↗
Open Access: Closed
Export: BibTeX RIS

Citation Context

Cited by in this index (2)

Huang et al. (2021)

Using automated feedback to develop writing proficiency

Computers and Composition
Wilson et al. (2017)

Automated formative writing assessment using a levels of language framework

Assessing Writing

References (23)

American Educational Research Association (1999)

Standards for educational and psychological testing
Attali (2006)

Automated essay scoring with e-rater® V.2

Journal of Technology, Learning, and Assessment
Beigman Klebanov (2013)

Word association profiles and their use for automated scoring of essays

Proceedings of the 51st annual meeting of the association for computational linguistics (Sofia, Bulgaria)
Ben-Simon (2007)

Toward more substantively meaningful automated essay scoring

Journal of Technology, Learning, and Assessment
Bennett (1998)

Validity and automated scoring: It's not only the scoring

Educational Measurement: Issues and Practice ↗

Show all 23 →

Bejar (2011)

A validity-based approach to quality control and assurance of automated scoring

Assessment in Education ↗
Breland (1996)

Word frequency and word difficulty: A comparison of counts in four corpora

Psychological Science ↗
Breland (1994)

The College Board vocabulary study (No. College Board Report No. 94-4; Educational Testing Service Research Report No. 94-26)
Elliot (2005)

On a scale: A social history of writing assessment in America
Embretson (1983)

Construct validity: Construct representation versus nomothetic span

Psychological Bulletin ↗
Fellbaum (1998)

WordNet: An electronic lexical database
Godshalk (1966)

The measurement of writing ability
Hillocks (2002)

The testing trap
Huot (1996)

Toward a new theory of writing assessment

College Composition and Communication ↗
Kane (2006)

Validation

Educational measurement
Koretz (2006)

Testing for accountability in K-12

Educational measurement
Page (1966)

The imminence of grading essays by computer

Phi Delta Kappan
Powers (2002)

Comparing the validity of automated and human scoring of essays

Journal of Educational Computing Research ↗
Powers (2002)

Stumping e-rater: Challenging the validity of automated essay scoring

Computers in Human Behavior ↗
Powers (1994)

Direct assessment, direct validation? An example from the assessment of writing

Educational Assessment ↗
Quinlan (2009)

Evaluating the construct-coverage of e-rater® (RR 09-01)
Yu (2010)

Lexical diversity in writing and speaking task performances

Applied Linguistics ↗
Zeno (1995)

The educator's word frequency guide

CrossRef global citation count: 19 View in citation network → Build reading path →