German Polyphone Database (SpeechDat(M)) DB1

19 Last view: 2026-03-22

German Polyphone Database (SpeechDat(M)) DB1

View resource name in all available languages

Base de données "Polyphone" en allemand (SpeechDat(M)) DB1

http://catalog.elra.info/product_info.php?products_id=59

ID:

ELRA-S0018

The database consists of read speech. A prompt sheet with a unique identification number has been distributed to the potential callers.
The speech data is recorded with digital lines (ISDN), resulting in A-law format (8 bit), 8 kHz sampling rate. The data collection comprises 1000 speakers, with a particular care of a balance with respect to gender. The age of the callers were to be between 16 and 65 (No controlled distribution).
Callers could call from any kind of acoustic and network environment: home, business, mobile phone, phone booth, wired or cordless phone, etc. (No controlled distribution).
The regional distribution was expected to fit within the following scheme: from each of the 16 German states there were to be 32 speakers. Speakers from Austria, Switzerland and other countries were not be controlled. The utterances to be gathered have been specified and consisted of several speech sequences, including sentences from different sources (local newspapers, existing corpora, law articles, etc.) to ensure a good phonetic coverage, application words from a defined list of command words, digits (isolated digits, connected digits, and natural numbers), currency amounts, quantities, credit card numbers, spelled words (mainly names), time of day (spontaneous) and time phrase (prompted, word style), city of call/birth, etc.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included.

View resource description in all available languages

The German SpeechDat(M) database contains the recordings of 1,000 German speakers from the 16 German states, who were recorded over the fixed telephone network. A particular care of a balance with respect to the gender (males, females) and to the age of the speakers (between 16 and 65) was given.

The database consists of read speech. A prompt sheet with a unique identification number has been distributed to the potential callers. The speech files are stored as sequences of 8 bit 8 kHz A-law samples.

Callers could call from any kind of acoustic and network environment: home, business, mobile phone, phone booth, wired or cordless phone, etc. (No controlled distribution).

It was validated by SPEX (the Netherlands) to assess its compliance with the SpeechDat format and content specifications.

Each speaker uttered the following items:

* several speech sequences, including sentences from different sources (local newspapers, existing corpora, law articles, etc.) to ensure a good phonetic coverage,
* application words from a defined list of command words,
* digits (isolated digits, connected digits, and natural numbers),
* currency amounts,
* quantities,
* credit card numbers,
* spelled words (mainly names),
* time of day (spontaneous) and time phrase (prompted, word style),
* city of call/birth, etc.

Un lexique de prononciation avec sa transcription phonétique en SAMPA est également fourni.

You don’t have the permission to edit this resource.

Distribution

Availability: Available

Availability Start date: 01/09/1996

Licences

ELRA VAR

Conditions: Commercial Use

Distribution Details ...Distribution

Availability: Available

Availability Start date: 01/09/1996

Licences

ELRA END USER

Conditions: Non Commercial Use

Distribution Details ...Distribution

Availability: Available

Availability Start date: 01/09/1996

Licences

ELRA END USER

Conditions: Non Commercial Use

Distribution Details ...Distribution

Availability: Available

Availability Start date: 01/09/1996

Licences

ELRA VAR

Conditions: Commercial Use

Distribution Details ...Distribution

Availability: Available

Availability Start date: 01/09/1996

Licences

ELRA END USER

Conditions: Non Commercial Use

Distribution Details ...Distribution

Availability: Available

Availability Start date: 01/09/1996

Licences

ELRA VAR

Conditions: Commercial Use

Distribution Details ...Distribution

Availability: Available

Availability Start date: 01/09/1996

Licences

ELRA END USER

Conditions: Non Commercial Use

Distribution Details ...Distribution

Availability: Available

Availability Start date: 01/09/1996

Licences

ELRA VAR

Conditions: Commercial Use

Distribution Details ...Contact Person

Mapelli Valérie

audio

Monolingual audio corpusLanguages

German

Linguality

Linguality type: Monolingual

Size

no size available

Resource Creation

Funding Project

SpeechDat(M)

Funding Type: Eu Funds

Metadata

Created: 12/05/2005

Version

Version: 1.0

Last Updated: 28/08/2007

People who looked at this resource also viewed the following:

Resources from the same project