On the vulnerability of automated scoring to construct-irrelevant response strategies (CIRS): An illustration

Isaac I. Bejar Educational Testing Service ; Michael Flor Educational Testing Service ; Yoko Futagi Educational Testing Service ; Chaintanya Ramineni Educational Testing Service
Journal
Assessing Writing
Published
2014-10-01
DOI
10.1016/j.asw.2014.06.001
CompPile
Search in CompPile ↗
Open Access
Closed
Export

Citation Context

Cited by in this index (2)

  1. Computers and Composition
  2. Assessing Writing

References (23)

  1. Standards for educational and psychological testing
  2. Automated essay scoring with e-rater® V.2
    Journal of Technology, Learning, and Assessment
  3. Word association profiles and their use for automated scoring of essays
    Proceedings of the 51st annual meeting of the association for computational linguistics (Sofia, Bulgaria)
  4. Toward more substantively meaningful automated essay scoring
    Journal of Technology, Learning, and Assessment
  5. Validity and automated scoring: It's not only the scoring
    Educational Measurement: Issues and Practice  
Show all 23 →
  1. A validity-based approach to quality control and assurance of automated scoring
    Assessment in Education  
  2. Word frequency and word difficulty: A comparison of counts in four corpora
    Psychological Science  
  3. The College Board vocabulary study (No. College Board Report No. 94-4; Educational Testing Service Research Report No. 94-26)
  4. On a scale: A social history of writing assessment in America
  5. Construct validity: Construct representation versus nomothetic span
    Psychological Bulletin  
  6. WordNet: An electronic lexical database
  7. The measurement of writing ability
  8. The testing trap
  9. Toward a new theory of writing assessment
    College Composition and Communication  
  10. Validation
    Educational measurement
  11. Testing for accountability in K-12
    Educational measurement
  12. The imminence of grading essays by computer
    Phi Delta Kappan
  13. Comparing the validity of automated and human scoring of essays
    Journal of Educational Computing Research  
  14. Stumping e-rater: Challenging the validity of automated essay scoring
    Computers in Human Behavior  
  15. Direct assessment, direct validation? An example from the assessment of writing
    Educational Assessment  
  16. Evaluating the construct-coverage of e-rater® (RR 09-01)
  17. Lexical diversity in writing and speaking task performances
    Applied Linguistics  
  18. The educator's word frequency guide