The Nationwide Speech Project: A new corpus of American English dialects

Cynthia G. Clopper, David B. Pisoni

Research output: Contribution to journalArticle

41 Scopus citations


Perceptual and acoustic research on dialect variation in the United States requires an appropriate corpus of spoken language materials. Existing speech corpora that include dialect variation are limited by poor recording quality, small numbers of talkers, and/or small samples of speech from each talker. The Nationwide Speech Project corpus was designed to contain a large amount of speech produced by male and female talkers representing the primary regional varieties of American English. Five male and five female talkers from each of six dialect regions in the United States were recorded reading words, sentences, passages, and in interviews with an experimenter, using high quality digital recording equipment in a sound-attenuated booth. The resulting corpus contains nearly an hour of speech from each of the 60 talkers that can be used in future research on the perception and production of dialect variation.

Original languageEnglish (US)
Pages (from-to)633-644
Number of pages12
JournalSpeech Communication
Issue number6
StatePublished - Jun 1 2006


  • American English
  • Dialect variation
  • Speech corpus

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Experimental and Cognitive Psychology
  • Linguistics and Language

Fingerprint Dive into the research topics of 'The Nationwide Speech Project: A new corpus of American English dialects'. Together they form a unique fingerprint.

  • Cite this