Skip Navigation

ELT Journal 1995 49(1):3-12; doi:10.1093/elt/49.1.3
© 1995 by Oxford University Press
This Article
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Upshur, J. A.
Right arrow Articles by Turner, C. E.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?


Articles

Constructing rating scales for second language tests

John A. Upshur and Carolyn E. Turner

Professor of Applied Linguistics at the TESL Centre at Concordia University in Montreal, Canada. He has been involved in second language education and assessment since 1960 in Canada, the United States, and other countries.
Assistant Professor and Certificate Program Director in the Department of Education in Second Languages at McGill University in Montreal. She has been involved in second language education for twenty years in Canada and the United States. Her main interest is evaluation.

Second language testing increasingly uses rated tasks in place of objectively scored items. Rating language performance is more demanding than scoring discrete-point tests. Lower reliability and validity for ratings are to be expected, especially in instructional settings where differences among students are small, and training for the use of rating scales is infrequent. The need for rating scales has not always led to effective and efficient scales, however.

Standard rating scales require the matching of examinee performance to a verbal description. An alternative type of scale consists of an empirically derived, ordered set of binary questions relating to boundaries between levels on the performance being evaluated. Rating depends on a series of decisions. Two scales of this type were developed to assess the grammatical accuracy and communicative effectiveness of a story-retell task. All scale categories were used, thus demonstrating that category range was appropriate to the learners. Reasons are offered to explain why this type of scale may ameliorate reliability and validity problems associated with standard rating scales.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Language TestingHome page
A. Brown
Book Review: Testing second language speaking
Language Testing, April 1, 2005; 22(2): 235 - 241.
[PDF]


Home page
Language TestingHome page
D. E. Powers, M. A. Schedl, S. W. Leung, and F. A. Butler
Validating the revised Test of Spoken English against a criterion of communicative success
Language Testing, October 1, 1999; 16(4): 399 - 425.
[Abstract] [PDF]


Home page
Language TestingHome page
J. A. Upshur and C. E. Turner
Systematic effects in the rating of second-language speaking ability: test method and learner discourse
Language Testing, January 1, 1999; 16(1): 82 - 111.
[Abstract] [PDF]


Home page
Language TestingHome page
B. North and G. Schneider
Scaling descriptors for language proficiency scales
Language Testing, April 1, 1998; 15(2): 217 - 262.
[Abstract] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.