A dataset containing information on 1,210 vowel tokens produced by 11 people from the state of Idaho in the United States. For each speaker, there are ten tokens per canonical monophthong, randomly selected from a larger dataset. Vowels are not preceding sonorants and not following coronal consonants. For each token, F1, F2, F3, and F4 were extracted at the midpoint of each vowel using a Praat script. The individuals consented to their data being used in this way.

idahoans

Format

A dataframe with 1,210 rows and 7 variables.

speaker

a unique identifier per speaker

sex

biological sex of the speakers

vowel

vowel, in ARPABET. This is a handy transcription system since all General American English vowels are represented using two-letter codes.

F1, F2, F3, F4

vowel formant measurements, in Hz

Details

This dataset is useful for testing and demonstrating vowel normalization functions.