Improving word prediction for augmentative communication by using idiolects and sociolects
until further notice
SourceDutch Journal of Applied Linguistics, 3, 2, (2014), pp. 136-153
Article / Letter to editor
Display more detailsDisplay less details
Humanities Lab (t/m 2018)
Communicatie- en informatiewetenschappen
Dutch Journal of Applied Linguistics
SubjectADNEXT (Adaptive Information Extraction over Time); Language & Speech Technology; Language in Society; Nederlab
Word prediction, or predictive editing, has a long history as a tool for augmentative and assistive communication. Improvements in the state-of-the-art can still be achieved, for instance by training personalized statistical language models. We developed the word prediction system Soothsayer. The main innovation of Soothsayer is that it not only uses idiolects, the language of one individual person, as training data, but also sociolects, the language of the social circle around that person. We use Twitter for data collection and experimentation. The idiolect models are based on individual Twitter feeds, the sociolect models are based on the tweets of a particular person and the tweets of the people he often communicates with. The sociolect approach achieved the best results. For a number of users, more than 50% of the keystrokes could have been saved if they had used Soothsayer.
Upload full text
Use your RU credentials (u/z-number and password) to log in with SURFconext to upload a file for processing by the repository team.