The Speech Perception Laboratory

Institute for Advanced Study of the Communication Processes (IASCP)

University of Florida

In the process of conducting studies, several speech corpora have been generated that can be made available to other researchers upon request. These include:

1.      The Hindi Stop Corpus (HSC)

Language:

Hindi

Speech Material Type:

Isolated real and nonsense words

Number of Talkers:

Six

Total Number of Stimuli:

3600

General Description:

A database of consonant-vowel (CV) sequences for use in cross-language speech perception, second language training studies, or for acoustic analysis. The sequences include 10 tokens each of all Hindi stop consonants (four voicing categories by five places of articulation) in three vowel contexts (i, a, u). All stimuli have been leveled and verified by native listeners in a forced-choice orthographic classification task.

Status

Complete

 

2.      The Voice Stress Analysis Database (VSAD)

Language:

English

Speech Material Type:

Passages (5-6 sentences; Rainbow Passage); audio and video recordings

Number of Talkers:

48

Total Number of Stimuli:

336

General Description:

A database of speakers who produce truthful and deceptive passages under low and high stress conditions. Stress was measured continuously throughout recording via pulse and GSR. Included also are simulated stress samples and low stress foils.

Status

Complete

 

3.      The Malayalam Consonant Corpus (MCC)

Language:

Malayalam

Speech Material Type:

Isolated real and nonsense words

Number of Talkers:

Five

Total Number of Stimuli:

6,000

General Description:

A database of consonant and vowel sequences (CV, VCV, VCCV) for use in cross-language speech perception, second language training studies, or for acoustic analysis. The sequences include 10 -15 tokens each of all Malayalam consonants in five vowel contexts (i, e, a, o, u). All stimuli were verified by native listeners in a forced-choice orthographic classification task.

Status

Complete

 

4.      The University of Florida Vocal Aging Database (UF-VAD)

Language:

English

Speech Material Type:

Rainbow Passage, Grandfather Passage, sustained vowels, diadodes, SPIN sentences

Number of Talkers:

150

Total Number of Stimuli:

750

General Description:

A diverse range of speech materials from 25 male and 25 female speakers each of three chronological groups (old, middle-aged, young). Perceived age judgments will also be collected for this database

Status

Complete

 

5.      Setswana Consonant Corpus (SCC)

Language:

Setswana

Speech Material Type:

Isolated real and nonsense words

Number of Talkers:

Three

Total Number of Stimuli:

2,082

General Description:

A database of consonant and vowel sequences (CV, VCV). The sequences include two tokens each of Setswana consonants in five vowel contexts (i, e, a, o, u). All stimuli will be leveled and verified by native listeners in a forced-choice orthographic classification task.

Status

Complete

 

 

 



 Information | Research | Vitae | Courses | Links | CSD

 

 

Last Updated 04-01-09