Relevancer: Finding and labeling relevant information in tweet collections
Publication year
2016Publisher
Berlijn : Springer International publishing
ISBN
9783319478739
In
Lecture Notes in Computer Science, (2016)Spiro, E.; Ahn, Y.Y. (ed.), Social Informatics. SocInfo 2016. Part II, pp. 210-224ISSN
Annotation
8th International Conference on Social Informatics (SocInfo 2016), 11 november 2016
Publication type
Article in monograph or in proceedings

Display more detailsDisplay less details
Editor(s)
Spiro, E.
Ahn, Y.Y.
Organization
Communicatie- en informatiewetenschappen
Taalwetenschap
Journal title
Lecture Notes in Computer Science
Languages used
English (eng)
Book title
Spiro, E.; Ahn, Y.Y. (ed.), Social Informatics. SocInfo 2016. Part II
Page start
p. 210
Page end
p. 224
Subject
Language & Speech Technology; Language in Society; Nederlab; Project in ADNEXT (Commit)Abstract
We introduce a tool that supports knowledge workers who want to gain insights from a tweet collection, but due to time constraints cannot go over all tweets. Our system first pre-processes, de-duplicates, and clusters the tweets. The detected clusters are presented to the expert as so-called information threads. Subsequently, based on the information thread labels provided by the expert, a classifier is trained that can be used to classify additional tweets. As a case study, the tool is evaluated on a tweet collection based on the key terms ‘genocide’ and ‘Rohingya’. The average precision and recall of the classifier on six classes is 0.83 and 0.82 respectively. At this level of performance, experts can use the tool to manage tweet collections efficiently without missing much information.
This item appears in the following Collection(s)
- Academic publications [234419]
- Electronic publications [117392]
- Faculty of Arts [28942]
- Open Access publications [84338]
Upload full text
Use your RU credentials (u/z-number and password) to log in with SURFconext to upload a file for processing by the repository team.