Monolingual text corpus Languages
English
Language Script: Latn
Linguality Linguality type: Monolingual
Size Character encoding
UTF - 8
(131,542 Words)
Domains Modalities
Recordings of interviews with Polish learners of English.
Spoken Language
Annotation Speech Annotation - Orthographic Transcription Annotated elements: Mispronunciations
StandOff: True
Segmentation level: Utterance, Word
Format: text/xml
Standard practices conformance: TEI_P5
Annotation Mode: Manual (All personal information has been anonymised.)
Start date: 01/07/2011
End date: 01/11/2013
Size:
131,542 Words
Speech Annotation - Phonetic Transcription Annotated elements: Mispronunciations
StandOff: True
Segmentation level: Word
Format: text/xml
Standard practices conformance: TEI_P5
Annotation Mode: Manual (All personal information has been anonymised.)
Start date: 01/07/2011
End date: 01/11/2013
Size:
131,542 Words
Speech Annotation - Sound To Text Alignment Annotated elements: Mispronunciations
StandOff: True
Segmentation level: Utterance, Word
Format: text/xml
Standard practices conformance: TEI_P5, Other
Annotation Mode: Manual (All personal information has been anonymised.)
Start date: 01/07/2011
End date: 01/11/2013
Size:
131,542 Words
Geographic coverage
Poland
(131,542 Words)
Creation Creation mode details: Recordings of interviews with Polish learners of English.
Creation mode: Manual
Creation Tools Monolingual audio corpus Languages
English
(131,542 Words)
Linguality Linguality type: Monolingual
Size Audio duration
15 Hours
Domains Modalities
Recordings of interviews with Polish learners of English.
(131,542 Words)
Spoken Language
(131,542 Words)
Classification
(131,542 Words)
Register: informal
Audio genre: Speech
Speech genre: Conversation
Content Speech items: Free Speech
Non-speech items: Noise
Noise Level: Medium
Setting Naturality: Assisted
Conversational type: Multilogue
Interactivity: Overlapping
Audio Formats audio/wav
(131,542 Words)
Compression: False
Recording quality: Medium
Quantization: 16
Number of tracks: 1
Sampling rate: 44100
Signal encoding: LinearPCM
Geographic coverage
Poland
(131,542 Words)
Recording Recorders
Recording environment: Other
Recording device type: Hard Disk
Capture Capturing device type details: Conversations were captured using an audio console set with external microphones or a voice recorder.
Capturing device type: Microphone
Capturing environment: Complex
Person SourceSet Origin of persons: Native
Sex of persons: Mixed
Number of persons: 119
Age range end: 45
Age range start: 8
Geographic distribution of persons: Łódź region.
Creation Creation mode details: Recordings of interviews with Polish learners of English.
Creation mode: Manual
Creation Tools