Improving word prediction for augmentative communication by using idiolects and sociolects
Fulltext:
134732.pdf
Embargo:
until further notice
Size:
193.9Kb
Format:
PDF
Description:
Publisher’s version
Publication year
2014Source
Dutch Journal of Applied Linguistics, 3, 2, (2014), pp. 136-153ISSN
Publication type
Article / Letter to editor
Display more detailsDisplay less details
Organization
Humanities Lab (t/m 2018)
Communicatie- en informatiewetenschappen
Journal title
Dutch Journal of Applied Linguistics
Volume
vol. 3
Issue
iss. 2
Languages used
English (eng)
Page start
p. 136
Page end
p. 153
Subject
ADNEXT (Adaptive Information Extraction over Time); Language & Speech Technology; Language in Society; NederlabAbstract
Word prediction, or predictive editing, has a long history as a tool for augmentative and assistive communication. Improvements in the state-of-the-art can still be achieved, for instance by training personalized statistical language models. We developed the word prediction system Soothsayer. The main innovation of Soothsayer is that it not only uses idiolects, the language of one individual person, as training data, but also sociolects, the language of the social circle around that person. We use Twitter for data collection and experimentation. The idiolect models are based on individual Twitter feeds, the sociolect models are based on the tweets of a particular person and the tweets of the people he often communicates with. The sociolect approach achieved the best results. For a number of users, more than 50% of the keystrokes could have been saved if they had used Soothsayer.
This item appears in the following Collection(s)
- Academic publications [245263]
- Electronic publications [132514]
- Faculty of Arts [29870]
Upload full text
Use your RU credentials (u/z-number and password) to log in with SURFconext to upload a file for processing by the repository team.