The English Dialects App: The creation of a crowdsourced dialect corpus

Leemann, Adrian; Kolly, Marie-José; Britain, David (2018). The English Dialects App: The creation of a crowdsourced dialect corpus. Ampersand, 5, pp. 1-17. Elsevier 10.1016/j.amper.2017.11.001

[img]
Preview
Text
1-s2.0-S2215039017300589-main.pdf - Published Version
Available under License Creative Commons: Attribution-Noncommercial-No Derivative Works (CC-BY-NC-ND).

Download (2MB) | Preview

In this paper, we present the English Dialects App (EDA) and the English Dialects App Corpus (EDAC). EDA is a free iOS and Android app, launched in January 2016 that features a dialect quiz and dialect recordings. For the quiz, users indicate which variants of 26 words they use and the application guesses their local dialect; for the recordings, users can record a short text. The result is EDAC which includes metadata on mobility, ethnicity, age, educational level, and gender. More than 47,000 users from across the UK have indicated dialect variants for these 26 words, and more than 3,500 users have provided audio recordings. Unavoidably, EDAC does not successfully reflect distributions of age, ethnicity, qualification levels, and other parameters found for the UK population given that smartphone-based research reaches a specific stratum of the population. Yet there are also clear benefits to the sampling strategy used – benefits and pitfalls are discussed in this article. Future analyses will provide the most comprehensive understanding of English regional dialect variation since the work of the traditional dialectologists. We showcase two such analyses in this article. EDAC should, we demonstrate, be of interest to researchers in dialectology but also in forensic phonetics.

Item Type:

Journal Article (Original Article)

Division/Institute:

06 Faculty of Humanities > Department of Linguistics and Literary Studies > Institute of English Languages and Literatures
06 Faculty of Humanities > Department of Linguistics and Literary Studies > Institute of English Languages and Literatures > Modern English Linguistics

UniBE Contributor:

Britain, David

Subjects:

400 Language
400 Language > 410 Linguistics
400 Language > 420 English & Old English languages

ISSN:

2215-0390

Publisher:

Elsevier

Language:

English

Submitter:

Leona Josefine Irmgard Goop

Date Deposited:

24 Jun 2019 14:02

Last Modified:

23 Oct 2019 19:00

Publisher DOI:

10.1016/j.amper.2017.11.001

BORIS DOI:

10.7892/boris.123639

URI:

https://boris.unibe.ch/id/eprint/123639

Actions (login required)

Edit item Edit item
Provide Feedback