For the Record: Which Digital Media Can be Used for Sociophonetic Analysis?
Penn collection
Degree type
Discipline
Subject
Funder
Grant number
License
Copyright date
Distributor
Related resources
Contributor
Abstract
Sociolinguists now have more options for collecting speech data than ever before. Informants who might otherwise be inaccessible to the analyst could record themselves using smartphones and personal computers; researchers might also consider YouTube or recordings uploaded to other internet sites as sources of data. However, digital compression techniques simplify some acoustic content and discard others (Bulgin, De Decker & Nycz 2010) while lower-quality microphones may distort the quality of the signal (Van Son, R.J.J.H. 2005). Therefore, before such data can be used for dialect research, it is crucial to determine if these media affect the reliability of acoustic analyses (Gonzalez, Cervera and Llau 2003; Gonzalez and Cervera 2001). As an initial test, we looked the effect of these devices on representations of the vowel space. Male and female speakers were recorded reading a word list containing 10 English monophthongs in h_d context using a Roland Edirol R-09 (WAV format) recorder, an Apple iPhone (lossless Apple m4a), a Macbook Pro running Praat 5.1 (WAV) and a Mino Flip video camera (AVI converted to AIFF). The Mino Flip file was then uploaded to Youtube and subsequently downloaded (MP3) for analysis. Speakers read each word 3 times while seated in a quiet room with the recorders placed on a table in front of them. Measurements of F1 through F4 were taken at the temporal midpoint of each vowel using Praat 5.1. Differences between recording formats were tested in R using a Repeated Measures ANOVA with separate runs for each formant (F1-F4). Preliminary results indicate that the Mino and Mino-derived YouTube formats differ substantially from the lossless Edirol recording. F1 values for most vowels were raised in Mino and Youtube measurements. F2 was also affected, such that front vowels were artificially raised while back vowels were lowered. Thus the vowel space is effectively altered with lowering along the F1 dimension and a widening of the space along the F2 dimension. These effects seem to be exaggerated for the female speaker. Based on these results, Macbook Pro and iPhone may be suitable recording options for studying the vowel spaces of speakers. Mino and its Youtube derivative show a number of significant deviations from lossless recordings indicating that audio from these devices should not be used for this type of analysis until corrective measures are identified.