This paper reports on the development of an evidence based rating scale to rate peer-peer L2 communicative interaction. The scale was based on experienced judges' comments on videoed student samples filmed during operational paired candidate tests of beginner level Spanish. Six trained and experienced raters generated criteria for communicative interaction which were incorporated into a tool for developing sample based rating scales, the Empirically-based, Binarychoice, Boundary-definition (EBB) method (Turner and Upshur, 1996), was adapted for the context. The findings reported on in this article examine the features of paired candidate interaction which raters used to define the boundary between performance levels. Three main criteria emerged as the boundaries used to define levels of interaction: non-verbal interpersonal communication, interactive listening and interactional management. These new notions are evidence of how peer-peer interaction can bee rated and also advance our understanding of the significant features of interaction in this rating context.
Version of record that has been published in Tasks and Criteria in Performance Assessment edited by Annie Brown, Kathryn Hill. The original work can be found at: http://dx.doi.org/10.3726/978-3-653-05394-4 Users of this material shall give appropriate credit, provide a link to the license, and indicate if changes were made. They may do so in any reasonable manner, but not in any way that suggests the licensor endorses the licensee or his/ her use. The material may not be used for commercial purposes. If a user remixes, transforms, or builds upon the material, he/she may not distribute the modified material.