Open Access
Exploring the Feasibility of Using Writing Process Features to Assess Text Production Skills
Author(s) -
Deane Paul,
Zhang Mo
Publication year - 2015
Publication title -
ets research report series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.235
H-Index - 5
ISSN - 2330-8516
DOI - 10.1002/ets2.12071
Subject(s) - keystroke logging , computer science , feature (linguistics) , process (computing) , set (abstract data type) , correlation , variance (accounting) , artificial intelligence , stability (learning theory) , natural language processing , writing process , machine learning , psychology , mathematics education , mathematics , linguistics , philosophy , geometry , accounting , business , programming language , operating system
In this report, we examine the feasibility of characterizing writing performance using process features derived from a keystroke log. Using data derived from a set of CBAL ™ writing assessments, we examine the following research questions: (a) How stable are the keystroke timing and process features across testing occasions? (b) How consistent are the patterns of feature–human correlation across genres and topics? (c) How accurately can we predict human ratings on writing fundamentals using a combination of the keystroke timing and process features, and what are the contributions of each feature to the reliable variance in the human ratings? (d) If we train a predictive model on one prompt, how well do its predictions generalize to the other prompts of the same or different genre? The results of the study indicate that keystroke log features vary considerably in stability across testing occasions and display somewhat different patterns of feature–human correlation across genres and topics. However, using the most stable features, we can obtain moderate to strong prediction of human essay scores, and those models generalize reasonably well across prompts though more strongly within than across writing genres.