The Nationwide Speech Project

A new corpus of American English dialects

Cynthia G. Clopper, David Pisoni

Research output: Contribution to journalArticle

34 Citations (Scopus)

Abstract

Perceptual and acoustic research on dialect variation in the United States requires an appropriate corpus of spoken language materials. Existing speech corpora that include dialect variation are limited by poor recording quality, small numbers of talkers, and/or small samples of speech from each talker. The Nationwide Speech Project corpus was designed to contain a large amount of speech produced by male and female talkers representing the primary regional varieties of American English. Five male and five female talkers from each of six dialect regions in the United States were recorded reading words, sentences, passages, and in interviews with an experimenter, using high quality digital recording equipment in a sound-attenuated booth. The resulting corpus contains nearly an hour of speech from each of the 60 talkers that can be used in future research on the perception and production of dialect variation.

Original languageEnglish
Pages (from-to)633-644
Number of pages12
JournalSpeech Communication
Volume48
Issue number6
DOIs
StatePublished - Jun 2006

Fingerprint

dialect
recording
spoken language
Acoustics
Small Sample
acoustics
Reading
Language
Speech
Corpus
American English
Talkers
Dialects of English
Acoustic waves
Interviews
Equipment and Supplies
interview
Research
Dialect Variation

Keywords

  • American English
  • Dialect variation
  • Speech corpus

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Experimental and Cognitive Psychology
  • Linguistics and Language

Cite this

The Nationwide Speech Project : A new corpus of American English dialects. / Clopper, Cynthia G.; Pisoni, David.

In: Speech Communication, Vol. 48, No. 6, 06.2006, p. 633-644.

Research output: Contribution to journalArticle

@article{f4f9a0e844df48bebea7c4a39fd4302a,
title = "The Nationwide Speech Project: A new corpus of American English dialects",
abstract = "Perceptual and acoustic research on dialect variation in the United States requires an appropriate corpus of spoken language materials. Existing speech corpora that include dialect variation are limited by poor recording quality, small numbers of talkers, and/or small samples of speech from each talker. The Nationwide Speech Project corpus was designed to contain a large amount of speech produced by male and female talkers representing the primary regional varieties of American English. Five male and five female talkers from each of six dialect regions in the United States were recorded reading words, sentences, passages, and in interviews with an experimenter, using high quality digital recording equipment in a sound-attenuated booth. The resulting corpus contains nearly an hour of speech from each of the 60 talkers that can be used in future research on the perception and production of dialect variation.",
keywords = "American English, Dialect variation, Speech corpus",
author = "Clopper, {Cynthia G.} and David Pisoni",
year = "2006",
month = "6",
doi = "10.1016/j.specom.2005.09.010",
language = "English",
volume = "48",
pages = "633--644",
journal = "Speech Communication",
issn = "0167-6393",
publisher = "Elsevier",
number = "6",

}

TY - JOUR

T1 - The Nationwide Speech Project

T2 - A new corpus of American English dialects

AU - Clopper, Cynthia G.

AU - Pisoni, David

PY - 2006/6

Y1 - 2006/6

N2 - Perceptual and acoustic research on dialect variation in the United States requires an appropriate corpus of spoken language materials. Existing speech corpora that include dialect variation are limited by poor recording quality, small numbers of talkers, and/or small samples of speech from each talker. The Nationwide Speech Project corpus was designed to contain a large amount of speech produced by male and female talkers representing the primary regional varieties of American English. Five male and five female talkers from each of six dialect regions in the United States were recorded reading words, sentences, passages, and in interviews with an experimenter, using high quality digital recording equipment in a sound-attenuated booth. The resulting corpus contains nearly an hour of speech from each of the 60 talkers that can be used in future research on the perception and production of dialect variation.

AB - Perceptual and acoustic research on dialect variation in the United States requires an appropriate corpus of spoken language materials. Existing speech corpora that include dialect variation are limited by poor recording quality, small numbers of talkers, and/or small samples of speech from each talker. The Nationwide Speech Project corpus was designed to contain a large amount of speech produced by male and female talkers representing the primary regional varieties of American English. Five male and five female talkers from each of six dialect regions in the United States were recorded reading words, sentences, passages, and in interviews with an experimenter, using high quality digital recording equipment in a sound-attenuated booth. The resulting corpus contains nearly an hour of speech from each of the 60 talkers that can be used in future research on the perception and production of dialect variation.

KW - American English

KW - Dialect variation

KW - Speech corpus

UR - http://www.scopus.com/inward/record.url?scp=33646253193&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33646253193&partnerID=8YFLogxK

U2 - 10.1016/j.specom.2005.09.010

DO - 10.1016/j.specom.2005.09.010

M3 - Article

VL - 48

SP - 633

EP - 644

JO - Speech Communication

JF - Speech Communication

SN - 0167-6393

IS - 6

ER -