How invariant and accurate are domain ratings in writing assessment?

Stefanie A. Wind University of Georgia ; George Engelhard University of Georgia
Journal
Assessing Writing
Published
2013-10-01
DOI
10.1016/j.asw.2013.09.002
CompPile
Search in CompPile ↗
Open Access
Closed
Topics
Export

Citation Context

References (47) · 1 in this index

  1. An index of person separation in latent trait theory, the traditional KR.20 indices and t…
    Education Research and Perspectives
  2. Think-aloud protocols in research on essay rating: An empirical study of their veridicali…
    Language Testing  
  3. Recurrent issues and recent advances in scoring performance assessments
    Applied Psychological, Measurement  
  4. The growing (but still limited) importance of evidence in education policy and practice
    Journal of Educational Change  
  5. A model of rater behavior in essay grading based on signal detection theory
    Journal of Educational Measurement  
Show all 47 →
  1. Many-facet Rasch measurement
    Reference supplement to the manual for relating language examinations to the Common European Framework of Reference for Languages: Learning, teaching, assessment (Section H)
  2. Introduction to many-facet Rasch measurement: Analyzing and evaluating rater-mediated assessments
  3. The element of chance in competitive examinations
    Journal of the Royal Statistical Society
  4. TOEFL iBT test scores
  5. Individual feedback to enhance rater training: Does it work?
    Language Assessment Quarterly  
  6. On a scale: A social history of writing assessment in America
  7. Monitoring raters in performance assessments
    Large-scale Assessment Programs for All Students: Development, Implementation, and Analysis
  8. Invariant measurement: Using Rasch Models in the Social, Behavioral and Health Sciences
  9. Georgia grade 8 writing assessment interpretive guide
  10. Psychometric methods
  11. Assessing Writing
  12. The promises and challenges of implementing evidence-centered design in large-scale assessment
    Applied Measurement in Education  
  13. Assessing performance: Designing, scoring, and validating performance tasks
  14. A critique of Rasch residual fit statistics
    Journal of Applied Measurement
  15. Investigating the effectiveness of individualized feedback to rating behavior: A longitud…
    Language Testing  
  16. Performance rating
    Psychological Bulletin  
  17. Many-facet Rasch measurement
  18. Facets: Rasch Measurement Computer Program
  19. Facets Rasch measurement computer program, version 3.67, 1
  20. Assessment criteria in a large-scale writing test: What do they really mean to the raters?
    Language Testing  
  21. Rater characteristics and rater bias: Implications for training
    Language Testing  
  22. Validity
    Educational measurement
  23. Making sense of data from complex assessments
    Applied Measurement in Education  
  24. TIMSS 2011 international results in mathematics
  25. TIMSS 2011 international results in reading
  26. Performance appraisal: An organizational perspective
  27. Rater cognition research: Some possible directions for the future
    Educational Measurement: Issues and Practice  
  28. Detecting and measuring rater effects using many-facet Rasch measurement: Part I
    Journal of Applied Measurement
  29. Detecting and measuring rater effects using many-facet Rasch measurement: Part II
    Journal of Applied Measurement
  30. A model of background influences on holistic raters
    Validating holistic scoring for writing assessment: Theoretical and empirical foundations
  31. Probabilistic models for some intelligence and attainment tests Copenhagen: Danish Institute for Educational Research
  32. Rating the ratings: Assessing the psychometric quality of rating data
    Psychological Bulletin  
  33. Examining replication effects in Rasch fit statistics
    Objective measurement: Theory into practice (Vol. 5)
  34. Meaning and measurement of performance rating accuracy: Some methodological and theoretic…
    Journal of Applied Psychology  
  35. Race to the top assessment program executive summary
  36. Using FACETS to model rater training effects
    Language Testing  
  37. Examining rating quality in writing assessment: Rater agreement, error, and accuracy
    Journal of Applied Measurement
  38. Item and rater analysis of constructed response items via the multi-faceted Rasch model
    Journal of Applied Measurement
  39. A bootstrap approach to evaluating person and item fit to the Rasch model
    Journal of Applied Measurement
  40. Application of latent trait models to identifying substantively interesting raters
    Educational Measurement: Issues and Practice  
  41. Rating scale analysis: Rasch measurement
  42. Best test design: Rasch measurement