Towards Automatic Identification of Discourse Markers in Dialogs: the Case of 'Like'

Zufferey, Sandrine; Popescu-Belis, Andrei (2004). Towards Automatic Identification of Discourse Markers in Dialogs: the Case of 'Like'. In: Proceedings of SIGDIAL'04 (5th SIGdial Workshop on Discourse and Dialogue). Cambridge, Massachussets. 30.04.-01.05.2004.

[img] Text
W04-2313.pdf - Published Version
Restricted to registered users only
Available under License Publisher holds Copyright.

Download (116kB) | Request a copy

This article discusses the detection of discourse
markers (DM) in dialog transcriptions,
by human annotators and by automated
means. After a theoretical discussion of the
definition of DMs and their relevance to natural
language processing, we focus on the role
of like as a DM. Results from experiments
with human annotators show that detection of
DMs is a difficult but reliable task, which requires
prosodic information from soundtracks.
Then, several types of features are defined for
automatic disambiguation of like: collocations,
part-of-speech tags and duration-based
features. Decision-tree learning shows that for
like, nearly 70% precision can be reached,
with near 100% recall, mainly using collocation
filters. Similar results hold for well, with
about 91% precision at 100% recall.

Item Type:

Conference or Workshop Item (Paper)


06 Faculty of Humanities > Department of Linguistics and Literary Studies > Institute of French Language and Literature

UniBE Contributor:

Zufferey, Sandrine


800 Literature, rhetoric & criticism > 840 French & related literatures
400 Language > 440 French & related languages




Sandrine Zufferey

Date Deposited:

25 Apr 2016 12:01

Last Modified:

05 Dec 2022 14:53




Actions (login required)

Edit item Edit item
Provide Feedback