Beyond literacy and competency – The effects of raters’ perceived uncertainty on assessment of writing

Mari Honko University of Jyväskylä ; Reeta Neittaanmäki University of Jyväskylä ; Scott Jarvis Northern Arizona University ; Ari Huhta University of Jyväskylä

Abstract

This study investigated how common raters’ experiences of uncertainty in high-stakes testing are before, during, and after the rating of writing performances, what these feelings of uncertainty are, and what reasons might underlie such feelings. We also examined if uncertainty was related to raters’ rating experience or to the quality of their ratings. The data were gathered from the writing raters (n = 23) in the Finnish National Certificates of Proficiency, a standardized Finnish high-stakes language examination. The data comprise 12,118 ratings as well as raters’ survey responses and notes during rating sessions. The responses were analyzed by using thematic content analysis and the ratings by descriptive statistics and Many-Facets Rasch analyses. The results show that uncertainty is variable and individual, and that even highly experienced raters can feel unsure about (some of) their ratings. However, uncertainty was not related to rating quality (consistency or severity/leniency). Nor did uncertainty diminish with growing experience. Uncertainty during actual ratings was typically associated with the characteristics of the rated performances but also with other, more general and rater-related or situational factors. Other reasons external to the rating session were also identified for uncertainty, such as those related to the raters themselves. An analysis of the double-rated performances shows that although similar performance-related reasons seemed to cause uncertainty for different raters, their uncertainty was largely associated with different test-takers’ performances. While uncertainty can be seen as a natural part of holistic ratings in high-stakes tests, the study shows that even if uncertainty is not associated with the quality of ratings, we should constantly seek ways to address uncertainty in language testing, for example by developing rating scales and rater training. This may make raters’ work easier and less burdensome.

Journal
Assessing Writing
Published
2023-07-01
DOI
10.1016/j.asw.2023.100768
CompPile
Search in CompPile ↗
Open Access
OA PDF Hybrid
Topics
Export

Citation Context

Cited by in this index (0)

No articles in this index cite this work.

References (46)

  1. Rimaa hipoen selviää tilanteesta: yleisten kielitutkintojen suomen kielen arvioijien käsityksiä kielitaidon arvioinnista ja suullisesta kielitaidosta
  2. Puhetta arvioinnista: yleisten kielitutkintojen arvioijien käsityksiä arvioinnista
    Kielitaidon arviointitutkimus 2000-luvun Suomessa
  3. Language test construction and evaluation
  4. The relationship between uncertainty and affect
    Frontiers in Psychology  
  5. A comprehensive review of Rasch measurement in language assessment: Recommendations and g…
    Language Testing  
Show all 46 →
  1. Think-aloud protocols in research on essay rating: An empirical study of their veridicali…
    Language Testing  
  2. Rater cognition: Implications for validity
    Educational Measurement: Issues and Practice  
  3. Defining and measuring diagnostic uncertainty in medicine: A systematic review
    Journal of General Internal Medicine  
  4. Native and non-native raters of L2 speaking performance: Accent familiarity and cognitive processes. (Ph.D. dissertation)
  5. Objective and subjective clinical swallowing outcomes via telehealth: Reliability in outp…
    American Journal of Speech - Language Pathology  
  6. Measuring stuttering in preschool-aged children across different languages: An internatio…
    Folia Phoniatrica Et Logopaedica  
  7. Using thematic analysis in psychology
    Qualitative Research in Psychology  
  8. Posed emotional expression in brain-damaged patients across three channels of communication. (Ph.D. dissertation)
  9. The common european framework of reference for languages: Learning, teaching, assessment
  10. Decision making while rating ESL/EFL writing tasks: A descriptive framework
    Modern Language Journal  
  11. Dictionary of language testing
  12. Examining rater effects in TestDaF writing and speaking performance assessments: A many-f…
    Language Assessment Quarterly  
  13. Rater types in writing performance assessments: A classification approach to rater variability
    Language Testing  
  14. Introduction to many-facets Rasch measurement: Analyzing and evaluating rater-mediated assessments
  15. Rater effects: Advances in item response modeling of human ratings—Part I [Guest editorial]
    Psychological Test and Assessment Modeling
  16. Individual feedback to enhance rater training: Does it work
    Language Assessment Quarterly  
  17. Uncertainty increases the reliance on affect in decisions
    Journal of Consumer Research
  18. Word association research and the L2 lexicon
    Language Teaching  
  19. Yleisten kielitutkintojen perusteet
  20. Second language testing for student evaluation and classroom research
  21. Instrument to study rater behaviour: Rater uncertainty and its impact on the quality of a…
    University of Jyväskylä
  22. The impact of world englishes on language assessment: Perception, rating behavior and challenges. (Unpublished doctoral dissertation)
  23. Assessing learners’ writing skills in a SLA study: Validating the rating process across t…
    Language Testing  
  24. The silent world of doctor and patient
  25. Rating written performance: What do raters do and why?
    Language Testing  
  26. Many-facet Rasch measurement.
  27. Facets computer program for many-facet Rasch measurement, version 3.83.5
  28. Assessing second language writing: The rater’s perspective
  29. Decision-making perspectives from psychology: Dealing with risk and uncertainty
    American Behavioral Scientist  
  30. Yleisten kielitutkintojen osallistujat taustatietojen valossa
    Yleiset kielitutkinnot 20 vuotta
  31. Raters. Behavior and training
    The Routledge Handbook of second language acquisition and language testing, Chapter 13.New
  32. How judges think
  33. Revisiting raters and rating in oral language assessment
  34. Rater bias patterns in an EFL writing assessment
  35. Arvioija valokeilassa – Suomi toisena kielenä -kirjoittamisen arviointia
    Jyväskylän yliopisto: Centre for Applied Language Studies
  36. Arvioija taidon arvottajana
    Yleiset Kielitutkinnot 20 vuotta.
  37. Raters and ratings
    The companion to language assessment
  38. Holistic assessment: What goes on in the rater’s mind?
    Assessing second language writing in academic contexts
  39. Exploring patterns of principal judgments in teacher evaluation related to reported gende…
    Studies in Educational Evaluation  
  40. Principals’ severity affects teacher evaluation: statistical adjustments mitigate effects
    School Effectiveness and School Improvement  
  41. Rater variability across examinees and rating criteria in paired speaking assessment
    Papers in Language Testing and Assessment