The Buckeye GTA Corpus contains 9,664 L1 and L2 sentence productions by 89 talkers (27 American English, 19 Hindi, 23 Mandarin, & 20 Korean). A total of 5,696 sentences were read in English, with each talker contributing 64 sentences. Hindi, Mandarin, and Korean talkers also read 64 sentences each in their native languages, contributing a total of 3,968 sentences. Potential uses of the corpus are illustrated by research projects on classroom communication and acoustic phonetic patterns. These projects demonstrate how investigations in different disciplines can make use of the same corpus and provide converging data on second language phonological acquisition.
corpus studies, pronunciation, intelligibility, phonetics, SLA, L2 speech production and perception