Memory-based named entity recognition in tweets
In
Ceur Workshop Proceedings, (2013)Cano, A.; Rowe, M.; Stankovic, M. (ed.), #MSM2013 Workshop Concept Extraction Challenge Proceedings, pp. 40-43ISSN
Related links
Annotation
Concept Extraction Challenge at Making Sense of Microposts 2013
Publication type
Article in monograph or in proceedings

Display more detailsDisplay less details
Editor(s)
Cano, A.
Rowe, M.
Stankovic, M.
Dadzie, A.-S.
Organization
Communicatie- en informatiewetenschappen
Former Organization
Bedrijfscommunicatie
Journal title
Ceur Workshop Proceedings
Languages used
English (eng)
Book title
Cano, A.; Rowe, M.; Stankovic, M. (ed.), #MSM2013 Workshop Concept Extraction Challenge Proceedings
Page start
p. 40
Page end
p. 43
Subject
CEUR Workshop proceedings; ADNEXT (Adaptive Information Extraction over Time); Language & Speech Technology; Language in Society; NederlabAbstract
We present a memory-based named entity recognition system that participated in the MSM-2013 Concept Extraction Challenge. The system expands the training set of annotated tweets with part-of-speech tags and seedlist information, and then generates a sequential memory-based tagger comprised of separate modules for known and unknown words. Two taggers are trained: one on the original capitalized data, and one on a lowercased version of the training data. The intersection of named entities in the predictions of the two taggers is kept as the final output.
This item appears in the following Collection(s)
- Academic publications [232016]
- Electronic publications [115283]
- Faculty of Arts [28856]
- Open Access publications [82629]
Upload full text
Use your RU credentials (u/z-number and password) to log in with SURFconext to upload a file for processing by the repository team.