Pinakes — Rhetoric & Composition

Assessing Writing

1016 articles

–

October 2025

Oct 2025

Improving writing feedback quality and self-efficacy of pre-service teachers in Gen-AI contexts: An experimental mixed-method design ↗

Siyu Zhu; Qingyang Li; Yuan Yao; Jialin Li; Xinhua Zhu

doi:10.1016/j.asw.2025.100960
Oct 2025

Lexical richness in young English learners’ writing: A focus on opinion and listen-write task types ↗

Hakyung Sung; Mikyung Kim Wolf; Michael Suhan; Kristopher Kyle

doi:10.1016/j.asw.2025.100975
Oct 2025 OA PDF

Linguistic predictors of L2 writing performance: Variations across genres ↗

Weiwei Yang; Sara T. Cushing; Guoxing Yu

genre theory multilingual writers

doi:10.1016/j.asw.2025.100985
Oct 2025

Assessing writing practices in higher education: Characterizing self-reported practices and identifying their determinants ↗

Dyanne Escorcia; Kiara Campo; Gabriela Navarro; Christine Ros

assessment

doi:10.1016/j.asw.2025.100976
Oct 2025

Impact of task repetition schedules and emotions on L2 writing performance profiles using latent transition analysis ↗

Mahmoud Abdi Tabari; Hansol Lee

multilingual writers

doi:10.1016/j.asw.2025.100974
Oct 2025 OA PDF

Can generative AI figure out figurative language? The influence of idioms on essay scoring by ChatGPT, Gemini, and Deepseek ↗

Enis Oğuz

artificial intelligence

doi:10.1016/j.asw.2025.100981
Oct 2025

Understanding the critical thinking experiences of L2 student writers engaged in linguistically supported peer feedback giving ↗

Xiaodong Zhang

argument

doi:10.1016/j.asw.2025.100977
Oct 2025

Response time for English learners on large-scale writing assessments ↗

Catherine Welch; Stephen Dunbar; Jeongmin Ji; Annette Vernon; Junhee Park

assessment

doi:10.1016/j.asw.2025.100979
Oct 2025

Comparing GPT-based approaches in automated writing evaluation ↗

Yingying Liu; Xiaofei Lu; Huilei Qi

assessment artificial intelligence

doi:10.1016/j.asw.2025.100961
Oct 2025

Assessing L2 writing formality using syntactic complexity indices: A fuzzy evaluation approach ↗

Zhiyun Huang; Guangyao Chen; Zhanhao Jiang

assessment multilingual writers grammar and mechanics

doi:10.1016/j.asw.2025.100973
Oct 2025

Growth mindset and writing engagement: The roles of motivation regulation and engagement with teacher’s written corrective feedback ↗

Mahdieh Darvari; S. Yahya Hejazi; Majid Sadoughi

teacher development

doi:10.1016/j.asw.2025.100980
Oct 2025

Criterion validity evidence and alternate form reliability of curriculum-based measures of written expression for eighth grade students ↗

John Elwood Romig; Amanda A. Olsen; Elizabeth Medina; Anna Tulloh

doi:10.1016/j.asw.2025.100958
Oct 2025 OA PDF

Judgment accuracy in primary school EFL writing assessment: Do text characteristics matter? ↗

Ruth Trüb; Jens Möller; Julian Lohmann; Thorben Jansen; Stefan D. Keller

Abstract

Assessing the writing competence of pupils learning English as a foreign language (EFL) at primary school is challenging. This study aimed at examining a largely unexplored topic, namely the role of text characteristics in writing assessment, and analysed judgment accuracy differentiated by nine aspects of text quality (communicative effect, level of detail, coherence, cohesion, complexity of syntax and grammar, correctness of syntax and grammar, vocabulary, orthography and punctuation). Two hundred pre-service teachers assessed four randomly assigned texts from learners in grade six. Their assessment was compared to the existing ratings of two experts from a previous study. We found a relative judgment accuracy between r = .34 and .60 for the nine assessment criteria, with vocabulary being assessed significantly more accurately than almost all other criteria. Orthography, complexity and correctness of syntax and grammar and punctuation were rated with significantly more accuracy than cohesion, level of detail, communicative effect and coherence. The pre-service teachers assessed most criteria more strictly and with higher variability than the experts. The results suggest that teacher education should offer pre-service teachers concrete opportunities to practise writing assessment, implement activities to strengthen the assessment of content- and structure-related criteria, and help them adjust their assessment rigour. • Judgment accuracy in the assessment of primary school EFL learners’ texts. • Relative judgment accuracy between r = .34 and .60 for the different criteria. • Significant differences in relative judgment accuracy between assessment criteria. • Linguistic text qualities are assessed with more accuracy than content- and structure-related aspects. • Pre-service teachers are more rigorous and heterogeneous in rating than experts.

teacher development assessment multilingual writers grammar and mechanics

doi:10.1016/j.asw.2025.100957
Oct 2025 OA PDF

Exploring the scoring validity of holistic and dimension-based Comparative Judgements of young learners’ EFL writing ↗

Rebecca Sickinger; John Pill; Tineke Brunfaut

Abstract

Comparative Judgement (CJ) is a pairwise comparison evaluation method, typically conducted online. Multiple judges each compare the quality of a series of paired performances and, from their decisions, a rank order is constructed and scores calculated. Research across different educational contexts supports CJ’s reliability for evaluating written performances, permitting more precise scoring of scripts and for dimension-focused evaluation. However, scant insights are available about the basis of judges’ evaluations. This issue is important because argument-based approaches to validation (common in the field of language testing and adopted in this study) require evidence to support claims about how scores are appropriate for test purpose. Therefore, we investigate the scoring validity of CJ, both when used holistically (the standard application of CJ) and when evaluating scripts by individual criteria (termed dimensions in the research context). Twenty-seven judges evaluated 300 scripts addressing two writing task types in a national English as a Foreign Language examination for young learners in Austria. Judges reported via questionnaires what they had focused on while judging. Subsequently, eight judges provided think-aloud data while evaluating 157 scripts, offering further insight into the writing features they considered and their decision-making during CJ. Findings showed that while most judges adapted a decision-making process similar to traditional rating methods, some adapted their method to accommodate the nature of CJ evaluation. Furthermore, results indicated that the judges considered construct-relevant criteria when using CJ, both holistically and by dimension, thus offering support to an argument for the appropriateness of using CJ in this context. • Comparative Judgement can offer an alternative to analytic rating of EFL writing. • Judges with teaching or rating experience largely focus on relevant text features. • Some judges adopt a decision-making process that appears well suited to CJ. • Dimension-based CJ has the potential to provide richer feedback than holistic CJ.

teacher development assessment multilingual writers

doi:10.1016/j.asw.2025.100986
Oct 2025 OA PDF

Editorial Board ↗

editorial matter

doi:10.1016/s1075-2935(25)00091-1
Oct 2025

Using ChatGPT to score essays and short-form constructed responses ↗

Mark D. Shermis

artificial intelligence

doi:10.1016/j.asw.2025.100988
Oct 2025 OA PDF

Which gender provides more specific peer feedback? Gender and assessment training’s effects on peer feedback specificity and intrapersonal factors ↗

José Carlos G. Ocampo; Ernesto Panadero; David Zamorano; Iván Sánchez-Iglesias

Abstract

This study investigated the effects of assessor gender (male vs. female), fictitious assessee gender (male vs. female), and assessment training (with vs. without) on peer feedback specificity (i.e. localisation and focus) and intrapersonal factors (i.e. trust in the self as an assessor and discomfort). This study involved 240 undergraduate psychology students (nMen=120, nWomen=120), with half receiving assessment training and the other half receiving the task instructions. Participants were divided into eight subgroups based on training condition and their self-reported gender to provide peer feedback to three writing samples (poor, average, excellent quality) by fictitious male or female peer assessees in Eduflow. A total of 3017 peer feedback segments were analysed, revealing that trained or untrained male and female assessors were comparable in most peer feedback specificity categories when assessing fictitious male or female assessees. Nonetheless, we also found that female assessors excelled in certain categories of peer feedback specificity, while male assessors also demonstrated competencies in other categories. Results also showed that assessors who received assessment training provided localised peer feedback in all the writing samples. Finally, gender and training did not affect participants’ trust in their abilities and (dis)comfort when providing peer feedback.

assessment gender and writing affect and writing

doi:10.1016/j.asw.2025.100987
Oct 2025

Integrating move analysis and sentence reconstruction in automated writing evaluation for L2 academic writers ↗

Bo-Ren Mau; Hui-Hsien Feng

assessment artificial intelligence

doi:10.1016/j.asw.2025.100984
Oct 2025

The development of syntactic complexity in integrated writing: A focus on fine-grained measures ↗

Seyyed Ehsan Golparvar; J. Elliott Casal; Hamideh Abolhasani

grammar and mechanics

doi:10.1016/j.asw.2025.100983
Oct 2025

Exploring the cross-lingual influence of linguistic complexity in second language writing assessment ↗

Sara Geremia; Thomas Gaillat; Nicolas Ballier; Andrew J. Simpkin

assessment multilingual writers

doi:10.1016/j.asw.2025.100951
Oct 2025

Challenges and opportunities of automated essay scoring for low-proficient L2 English writers ↗

Vanessa De Wilde; Orphée De Clercq

doi:10.1016/j.asw.2025.100982
Oct 2025

GenAI and human assessments of L2 Chinese writing: Interrater reliability and rater bias ↗

Yuan Lu; Xiaoying Liles; Xi Ma

doi:10.1016/j.asw.2025.100989
Oct 2025

Predictive validity evidence for a no-stakes, untimed, machine-scored diagnostic writing assessment ↗

Elie ChingYen Yu; Oxana Rosca; Heidi L. Andrade; Angela M. Lui; Jason Bryer

assessment

doi:10.1016/j.asw.2025.100978

July 2025

Jul 2025

The relationship between executive functions, source use, and integrated writing performance ↗

Xian Liao; Pengfei Zhao; Zicheng Li

doi:10.1016/j.asw.2025.100936
Jul 2025 OA PDF

Making things happen: A study of grammatical metaphors in L2 writing scripts ↗

Nicholas Glasson; Andrew Kitney

Abstract

The notion of grammatical metaphor (GM) (Halliday, 1985) is essentially where a writer can shift an action or quality into being a ‘thing’. As in most senses of metaphor, the goal is to “represent something as something else” (McGrath & Liardét, 2023, p.33). This study investigated the use of grammatical metaphor (GM) in Linguaskill writing exam responses across CEFR proficiency levels (below-B1 to C1 or above). It analysed the presence of a pre-existing GM list (see McGrath & Liardét, 2023) to explore GM frequency in L2 responses, the correlative relationship with proficiency scores and qualitatively explored candidate responses in terms of how GMs were used. Results show a moderate positive correlation between proficiency and GM use, with a dominance of process-to-thing shifts (e.g., transform→transformation) and emergence of GM use from lower to higher proficiency levels. This underscores GM's significance in crafting academically valued meanings in L2 contexts, suggesting its potential for informing instructional and assessment practices. • Metaphorisation in Writing is a useful metric for L2 writing assessment. • Evidence suggests GM frequency correlates with increased performance. • Learners progress from emergent arguments to presenting ideas more concisely. • The majority of GM shifts were to ‘things’. • The study provides further weight to arguments for meaning-based complexity.

assessment multilingual writers

doi:10.1016/j.asw.2025.100939
Jul 2025

Promoting cognitive engagement with peer feedback through peer review training: The case of Chinese tertiary-level EFL learners ↗

Jia He; Jun Xia; Chun-mei Zhang; Jian-nan Liu

doi:10.1016/j.asw.2025.100947
Jul 2025

Trinka: Facilitating academic writing through an intelligent writing evaluation system ↗

Jessie S. Barrot

assessment

doi:10.1016/j.asw.2025.100953
Jul 2025

Editorial Board ↗

editorial matter

doi:10.1016/s1075-2935(25)00054-6
Jul 2025

Toward the fair and valid use of curriculum-based measurement for students with intensive writing needs and linguistically diverse backgrounds ↗

Seohyeon Choi; Kristen L. McMaster; Nana Kim

doi:10.1016/j.asw.2025.100948
Jul 2025

Using ChatGPT to facilitate vocabulary learning in continuation writing assessment tasks ↗

Fengkai Liu; Xiaofei Lu; Tan Jin

assessment artificial intelligence

doi:10.1016/j.asw.2025.100952
Jul 2025

Comparative judgment in L2 writing assessment: Reliability and validity across crowdsourced, community-driven, and trained rater groups of judges ↗

Peter Thwaites; Pauline Jadoulle; Magali Paquot

assessment multilingual writers

doi:10.1016/j.asw.2025.100937
Jul 2025

Editorial Volume 65 ↗

Martin East; David Slomp

doi:10.1016/j.asw.2025.100963
Jul 2025

Editorial introduction, Assessing writing Tools & Tech Forum 2025 ↗

Kelly Hartwell; Laura Aull

assessment

doi:10.1016/j.asw.2025.100956
Jul 2025 OA PDF

A large-scale corpus for assessing source-based writing quality: ASAP 2.0 ↗

Scott A. Crossley; Perpetual Baffour; L. Burleigh; Jules King

Abstract

This paper introduces ASAP 2.0, a dataset of ∼25,000 source-based argumentative essays from U.S. secondary students. The corpus addresses the shortcomings of the original ASAP corpus by including demographic data, consistent scoring rubrics, and source texts. ASAP 2.0 aims to support the development of unbiased, sophisticated Automatic Essay Scoring (AES) systems that can foster improved educational practices by providing summative to students. The corpus is designed for broad accessibility with the hope of facilitating research into writing quality and AES system biases. • We introduce the ASAP 2.0 corpus. • The corpus contains over 25,000 source-based essays. • Each essay is scored for overall writing quality. • The corpus can be used to computationally and quantitatively model source-based writing quality.

doi:10.1016/j.asw.2025.100954
Jul 2025

The impact of self-revision, machine translation, and ChatGPT on L2 writing: Raters’ assessments, linguistic complexity, and error correction ↗

Minjoo Kim; Yuah V. Chon

revision artificial intelligence multilingual writers grammar and mechanics

doi:10.1016/j.asw.2025.100950
Jul 2025

Potentials and pitfalls of Google Gemini in writing: Implications for educators ↗

Hieu Manh Do

doi:10.1016/j.asw.2025.100955
Jul 2025

Unveiling the precursors of negative emotions in second language writing through control-value theory: An explanatory sequential design approach ↗

Haijing Zhang; Fangwei Huang

multilingual writers

doi:10.1016/j.asw.2025.100949
Jul 2025

A mixed-methods approach to English-L1 teachers’ implementation of written feedback in EFL classrooms ↗

Xiaolong Cheng; Jinfen Xu

revision

doi:10.1016/j.asw.2025.100935

April 2025

Apr 2025

Modeling the interplay between teacher support, anxiety and grit in predicting feedback-seeking behavior in L2 writing ↗

Ya Zhang; Zhanhao Jiang

teacher development multilingual writers

doi:10.1016/j.asw.2025.100920
Apr 2025 OA PDF

Towards a better understanding of integrated writing performance: The influence of literacy strategy use and independent language skills ↗

Xinhua Zhu; Yiwen Sun; Yaping Liu; Wandong Xu; Choo Mui Cheong

Abstract

This study explores the influence mechanism of literacy strategy use and independent language skills (e.g., reading and writing) on integrated writing (IW) performance. 322 Secondary Four students from four schools in Hong Kong completed single-text reading, multiple-text reading, independent writing, and IW tasks, along with questionnaires investigating their reading strategy use and IW strategy use. Path analyses revealed that multiple-text reading and independent writing had comparable significant impacts on IW, mediating the influence of single-text comprehension. In addition, reading strategy use impacted IW indirectly through independent literacy skills and IW strategy use, while IW strategies exerted a direct influence on IW. Our findings underscore the critical role of language skills in mediating the influence of reading strategies on IW performance among young first language (L1) learners. The implications for research and practice, are discussed, emphasizing the complexity of the IW construct and the need for balanced language skills and strategy instruction to enhance IW task performance. • A noble exploration of concurrent effects of strategies and independent skills on IW. • Multiple-text reading and independent writing directly influence IW performance. • Independent skills mediate the impact of reading strategies on IW performance. • Reading strategy indirectly affect IW through independent skills and IW strategy. • Balanced language skills and strategy instruction are crucial for IW performance.

literacy studies affect and writing

doi:10.1016/j.asw.2025.100922
Apr 2025

The influence of working memory and proficiency on phraseological growth: A longitudinal study of adjective-noun combinations in Chinese EFL learners’ argumentative writing ↗

Lujie Zheng; Sheena Kaur; Azlin Zaiti Zainal

argument empirical research

doi:10.1016/j.asw.2025.100915
Apr 2025

Predicting inappropriate source use from scores of language use, source comprehension, and organizational features: A study using generalized linear models ↗

Kwangmin Lee; Ray J.T. Liao; I.-Chun Vera Hsiao; Junhee Park; Yafei Ye

doi:10.1016/j.asw.2025.100934
Apr 2025

Assessing academic language in tenth grade essays using natural language processing ↗

Andrew Potter; Mitchell Shortt; Maria Goldshtein; Rod D. Roscoe

doi:10.1016/j.asw.2025.100921
Apr 2025 OA PDF

Designing a rating scale for an integrated reading-writing test: A needs-oriented approach ↗

Aynur Ismayilli Karakoҫ; Peter Gu; Rachael Ruegg

Abstract

To meet the current trends in higher education, there is accountability on EAP programmes to prepare and assess students’ access to higher education. Thus, multimodal tasks including integrated writing (IW) assessments have seen a resurgence because they arguably closely mirror academic writing. However, test practicality constraints and variability in the use and format of these assessments mean rating scales often fall short in substantiating the central claims of IW assessment. We developed an integrated reading-writing scale taking into account reading-writing requirements and empirical research on IW tests designed to assess readiness for first-year humanities and social science courses. We approached test development as part of the ongoing validation efforts, detailing the considerations involved in the scale development process. We argue that alignment with academic writing requirements should guide the development of IW tests, thereby acknowledging and comprehending nuances of academic writing. The paper demonstrates considerations and decisions in scale design as the validation process from the start, which is a reminder that assessment is not just a quantitative exercise but a multifaceted process. • The design of a rating scale for first-year undergraduate academic writing is detailed. • Emphasis is placed on the role of reading in integrated writing scales. • Academic argumentation, rather than solely source-use mechanics, is considered. • Implications for construct operationalisation in academic evaluations are offered.

first-year composition argument assessment empirical research multimodality literacy studies

doi:10.1016/j.asw.2025.100918
Apr 2025 OA PDF

Validation of the individual and collective self-efficacy scale for teaching writing in post-secondary faculty ↗

Kim M. Mitchell; Johnson Li; Rasheda Rabbani

Abstract

Faculty actions in the classroom are known to impact student writing self-efficacy and academic achievement. The purpose of this paper was to validate Locke and Johnston’s Individual and Collective Self-Efficacy for Teaching Writing Scales, a tool originally validated in high school teachers, in a new population of post-secondary faculty. Exploratory and confirmatory factor analysis methods were used in two studies with independent samples of multidisciplinary faculty (N = 281) for the exploratory factor analysis (Study 1) and nursing discipline specific faculty (N = 187) for the confirmatory factor analysis (Study 2). Three factors were identified in the questionnaire which maintained the essence of the theoretical structure proposed by Locke and Johnston. Factor 1 was named Context and Process Competencies, Factor 2 Textural Competencies, and Factor 3 Motivational Competencies. This factor structure was confirmed with acceptable goodness of fit in the confirmatory factor analysis Study 2. Learning to be a teacher of writing is a developmental process and this measurement tool has important validation information that speaks to its usefulness in understanding that process. • Instructional practices are known to impact student achievement levels. • Faculty individual self-efficacy for teaching writing is three factors. • Faculty undergo a slow enculturation practice to teaching writing. • This scale can be used to assess impact of teacher agency on student outcomes.

writing pedagogy teacher development

doi:10.1016/j.asw.2025.100923
Apr 2025

Editorial ↗

David Slomp; Martin East

doi:10.1016/j.asw.2025.100938
Apr 2025

Editorial Board ↗

editorial matter

doi:10.1016/s1075-2935(25)00030-3
Apr 2025

How L2 student writers engage with automated feedback: A longitudinal perspective ↗

Li Xiaosa; Ke Ping

artificial intelligence

doi:10.1016/j.asw.2025.100919
Apr 2025

Does student assessment literacy matter between motivational constructs and engagement in L2 writing? A survey of Chinese EFL undergraduates ↗

Jian Xu; Yao Zheng

assessment multilingual writers literacy studies

doi:10.1016/j.asw.2025.100916

January 2025

Jan 2025 OA PDF

Editorial Board ↗

editorial matter

doi:10.1016/s1075-2935(25)00015-7