Farsdat (Farsi Speech Database)

177 Last view: 2026-07-03

Farsdat (Farsi Speech Database)

http://catalog.elra.info/product_info.php?products_id=18

ID:

ELRA-S0112

The Persian Speech Database Farsdat comprises the recordings of 300 Iranian speakers, who differ from each other with regards to age, sex, education level, and dialect (10 dialect regions of Iran were represented: Tehrani, Torki, Esfahani, Jonubi, Shomali, Khorassani, Baluchi, Kordi, Lori, and Yazdi). Each speaker uttered 20 sentences in two sessions, and 100 of these speakers uttered 110 isolated words. 6000 utterances were segmented and labelled phonetically and phonemically manually, including 386 phonetically balanced sentences, using IPA characters. The acoustic signal has been stored with a Wave file standard, so that it can be used by any other application software. The used sampling frequency reaches 22.5 KHz, and the signal-to-noise ratio 34 dB. The ambiguities in segmentation have been solved by reference to the corresponding spectrograms extracted from DSP sona-Graph KAY 5500.

View resource description in all available languages

La base de données Farsdat comprend les enregistrements de 300 locuteurs iraniens, hommes et femmes, d'âge et de niveau d'éducation différents, et représentant 10 dialectes régionaux d'Iran (régions de Tehrani, Torki, Esfahani, Jonubi, Shomali, Khorassani, Baluchi, Kordi, Lori, and Yazdi). Chaque locuteur a prononcé 20 phrases, en deux sessions. 100 de ces locuteurs ont prononcé chacun 110 mots isolés. En tout, 6000 éléments énoncés ont été segmentés et annotés aux niveaux phonémique et phonétique à l'aide des symboles de l'API. Les fichiers son ont été enregistrés au format wave (.wav), et peuvent ainsi tourner avec de nombreuses applications. La fréquence d'échantillonnage est de 22,5 KHz, et le rapport signal-bruit de 34 dB. Les ambiguïtés au niveau de la segmentation ont été levées grâce aux spectrogrammes issus du DSP sona-Graph KAY 5500.

You don’t have the permission to edit this resource.

DistributionAvailability

Available - Restricted Use

Start date: 10/07/2001

Licence

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Non Members of ELRA

User Nature: Commercial

ELRA VAR

Restrictions: Commercial Use

For Members of ELRA

User Nature: Commercial

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Members of ELRA

User Nature: Commercial

ELRA VAR

Restrictions: Commercial Use

For Members of ELRA

User Nature: Academic

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Members of ELRA

User Nature: Academic

ELRA VAR

Restrictions: Commercial Use

For Non Members of ELRA

User Nature: Commercial

ELRA VAR

Restrictions: Commercial Use

For Non Members of ELRA

User Nature: Academic

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Non Members of ELRA

User Nature: Academic

Contact Person

Mapelli Valérie

audio

Monolingual audio corpusLanguages

Persian

Linguality

Linguality type: Monolingual

Size

no size available

Metadata

Created: 12/05/2005

Version

Version: 1.0

Last Updated: 22/02/2007

People who looked at this resource also viewed the following: