Applying Content Similarity Metrics to Corpus Data: Differences between Native and Non-Native Speaker Responses to a TOEFL® Integrated Writing Prompt. Research Report. ETS RR-08-51Report as inadecuate




Applying Content Similarity Metrics to Corpus Data: Differences between Native and Non-Native Speaker Responses to a TOEFL® Integrated Writing Prompt. Research Report. ETS RR-08-51 - Download this document for free, or read online. Document in PDF available to download.



ETS Research Report Series, Oct 2008

For many purposes, it is useful to collect a corpus of texts all produced to the same stimulus, whether to measure performance (as on a test) or to test hypotheses about population differences. This paper examines several methods for measuring similarities in phrasing and content and demonstrates that these methods can be used to identify population differences between native and non-native speakers of English in a writing task.

Descriptors: Test Content, Computational Linguistics, Native Speakers, Writing Tests, English (Second Language), Second Language Learning, Prompting, Cues, Comparative Analysis, Language Tests, Grammar, Language Usage, Phrase Structure, Scores, Correlation, Multiple Regression Analysis, Statistical Analysis

Educational Testing Service. Rosedale Road, MS19-R Princeton, NJ 08541. Tel: 609-921-9000; Fax: 609-734-5410; e-mail: RDweb[at]ets.org; Web site: https://www.ets.org/research/policy_research_reports/ets





Author: Deane, Paul; Gurevich, Olga

Source: https://eric.ed.gov/?q=a&ft=on&ff1=dtySince_1992&pg=1826&id=EJ1111357



DOWNLOAD PDF




Related documents