Automatic identification of discourse markers in dialogues: An in-depth study of like and well

Zufferey, Sandrine; Popescu-Belis, Andrei (2011). Automatic identification of discourse markers in dialogues: An in-depth study of like and well. Computer Speech & Language, 25(3), pp. 499-518. Elsevier 10.1016/j.csl.2010.12.001

[img] Text
1-s2.0-S088523081000080X-main.pdf - Published Version
Restricted to registered users only
Available under License Publisher holds Copyright.

Download (335kB)

The lexical items like and well can serve as discourse markers (DMs), but can also play numerous other roles, such as verb or adverb. Identifying the occurrences that function as DMs is an important step for language understanding by computers. In this study, automatic classifiers using lexical, prosodic/positional and sociolinguistic features are trained over transcribed dialogues, manually annotated with DM information. The resulting classifiers improve state-of-the-art performance of DM identification, at about 90% recall and 79% precision for like (84.5% accuracy, κ = 0.69), and 99% recall and 98% precision for well (97.5% accuracy, κ = 0.88). Automatic feature analysis shows that lexical collocations are the most reliable indicators, followed by prosodic/positional features, while sociolinguistic features are marginally useful for the identification of DM like and not useful for well. The differentiated processing of each type of DM improves classification accuracy, suggesting that these types should be treated individually.

Item Type:

Journal Article (Original Article)

Division/Institute:

06 Faculty of Humanities > Department of Linguistics and Literary Studies > Institute of French Language and Literature

UniBE Contributor:

Zufferey, Sandrine

Subjects:

800 Literature, rhetoric & criticism > 840 French & related literatures
400 Language > 440 French & related languages

ISSN:

0885-2308

Publisher:

Elsevier

Language:

English

Submitter:

Sandrine Zufferey

Date Deposited:

25 Apr 2016 10:02

Last Modified:

05 Dec 2022 14:53

Publisher DOI:

10.1016/j.csl.2010.12.001

BORIS DOI:

10.7892/boris.78536

URI:

https://boris.unibe.ch/id/eprint/78536

Actions (login required)

Edit item Edit item
Provide Feedback