A tool for efficient and accurate segmentation of speech data. Announcing POnSS
Number of pages
SourceBehavior Research Methods, 53, 2, (2021), pp. 744-756
Article / Letter to editor
Display more detailsDisplay less details
SW OZ DCC PL
PI Group Neurobiology of Language
Behavior Research Methods
SubjectLanguage & Communication; Psycholinguistics; Speech Production and Comprehension; Language in Interaction
Despite advances in automatic speech recognition (ASR), human input is still essential for producing research-grade segmentations of speech data. Conventional approaches to manual segmentation are very labor-intensive. We introduce POnSS, a browser-based system that is specialized for the task of segmenting the onsets and offsets of words, which combines aspects of ASR with limited human input. In developing POnSS, we identified several sub-tasks of segmentation, and implemented each of these as separate interfaces for the annotators to interact with to streamline their task as much as possible. We evaluated segmentations made with POnSS against a baseline of segmentations of the same data made conventionally in Praat. We observed that POnSS achieved comparable reliability to segmentation using Praat, but required 23% less annotator time investment. Because of its greater efficiency without sacrificing reliability, POnSS represents a distinct methodological advance for the segmentation of speech data.
NWO (Grant code:info:eu-repo/grantAgreement/NWO/Gravitation/024.001.006)
Upload full text
Use your RU credentials (u/z-number and password) to log in with SURFconext to upload a file for processing by the repository team.