(Commercial) Automatic Speech Recognition as a Tool in Sociolinguistic Research

Loading...
Thumbnail Image
Penn collection
University of Pennsylvania Working Papers in Linguistics
Degree type
Discipline
Subject
Funder
Grant number
License
Copyright date
Distributor
Related resources
Contributor
Abstract

As speech datasets used in sociolinguistic research increase in size, laborious and time-intensive manual orthographic transcription is a challenge, limiting the amount of (transcribed) data which can be analysed. In this paper, I discuss the use of (commercial) automatic speech recognition (ASR) as a tool in sociolinguistic research in the context of a case study: the Lothian Diary Project. I describe the kinds of errors produced by two commercial ASR systems for British English within the broader context of algorithmic bias in ASR, and suggest some best practices when working with ASR in sociolinguistic work.

Advisor
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Publication date
2022-09-19
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Journal Issue
Comments
Recommended citation
Collection