-
Brennan (1995)
Generalizability analyses of work key listening and writing tests
Educational and Psychological Measurement
↗
-
Brown (1991)
Do English and ESL faculties rate writing samples differently?
-
Caracelli (1993)
Data analysis strategies for mixed-method evaluation designs
Educational Evaluation and Policy Analysis
↗
-
Connor (1993)
The interpretation of tasks by writers and readers in holistically rated direct assessmen…
Reading in the composition classroom
-
Creswell (2007)
Designing and conducting mixed methods research
-
Cronbach (1990)
Essentials of psychological testing
-
Cumming (1990)
Expertise in evaluating second language compositions
-
Cumming (2002)
Assessing Writing
-
Cumming, A., Kantor, R., Powers, D., Santos, T., & Taylor, C. (2000). TOEFL 2000 writing framework: A working…
-
DeRemer (1998)
Assessing Writing
-
Engelhard (1994)
Examining rater errors in the assessment of written composition with a many-faceted Rasch model
Journal of Educational Measurement
↗
-
Ericsson (1984)
Protocol analysis: Verbal reports as data
-
Fulcher (2003)
Testing second language speaking
-
Guilford (1954)
Psychometric methods
-
Hamp-Lyons (1994)
Examining expert judgments of task difficulty on essay tests
Journal of Second Language Writing
↗
-
Henning (1996)
Accounting for nonsystematic error in performance ratings
-
Huot (1993)
The influence of holistic scoring procedures on reading and rating student essays
Validating holistic scoring for writing assessment
-
Johnson (2009)
The influence of rater language background on writing performance assessment
-
Kim (2009)
An investigation into native and non-native teachers’ judgments of oral English performan…
-
Kobayashi (1992)
Native and nonnative reactions to ESL compositions
-
Kondo-Brown (2002)
A FACETS analysis of rater bias in measuring Japanese second language writing performance
-
Linacre, J. M. (2005). A user's guide to FACETS: Rasch measurement computer program. Version 3.57. Chicago, IL.
-
Lumley (2005)
Assessing second language writing: The rater's perspective
-
Lynch (1998)
Using G-theory and many-facet Rasch measurement in the development of performance assessm…
-
McNamara (1996)
Measuring second language performance
-
Mendelsohn (1987)
Professors’ ratings of language use and rhetorical organizations in ESL compositions
-
Milanovic (1996)
A study of the decision-making behavior of composition markers
Studies in language testing, Vol. 3: Performance testing, cognition and assessment
-
Myford, C., & Wolfe, E. (2000). Monitoring sources of variability within the Test of Spoken English assessmen…
-
Myford (2004)
Detecting and measuring Edward effects using many-facet Rasch measurement: Part 1
Introduction to Rasch measurement
-
Orr (2002)
The FCE speaking test: Using rater reports to help interpret test scores
-
Saal (1980)
Rating the ratings: Assessing the psychometric quality of rating data
-
Sakyi (2000)
Validation of holistic scoring for ESL writing assessment: How raters evaluate ESL compositions
Fairness and validation in language assessment
-
Santos (1988)
Professors’ reactions to the academic writing of non-native speaking students
TESOL Quarterly
-
Shohamy (1992)
The effect of raters’ background and training on the reliability of direct writing tests
The Modern Language Journal
↗
-
Smith (2000)
Rater judgments in the direct assessment of competency-based second language writing ability
Studies in immigrant English language assessment
-
Spool (1978)
Training programs for observers of behaviors: A review
-
Stansfield (1988)
A long-term research agenda for the Test of Written English
-
Stock (1987)
Taking on testing: Teachers as testers researchers
English Education
-
Sweedler-Brown (1993)
ESL essay evaluation: The influences of sentence-level and rhetorical features
Journal of Second Language Writing
↗
-
Van Weeren (1987)
Testing pronunciation: An application of generablizability theory
-
Vann (1990)
Error gravity: Faculty response to errors in the written discourse of nonnative speakers …
Assessing second language writing in academic contexts
-
Vaughan (1991)
Holistic assessment: What goes on in the raters’ minds?
Assessing second language writing in academic contexts
-
Weigle (1994)
Effects of training on raters of ESL compositions
-
Weigle (1998)
Using FACETS to model rater training effects
-
Weir (2005)
Language testing and validation: An evidence-based approach
-
Wiseman, C. (2005). A validation study comparing an analytic scoring rubric and a holistic scoring rubric in …