Hungarian speech
database for computer-using environment in offices
Klára Vicsi, András Kocsor, Csaba Teleki, László Tóth
Speech databases were recorded in different offices,
laboratories, and homes. Recordings in all scenes were prepared
by using two parallel synchronized recording systems. One of the
recording systems is the so-called reference system, where a close
talking microphone (Monacor EMC 100) and a good quality sound card
(Hercules Muse Pocket USB 5.1) and a laptop (Gericom Webshox) were
used. The second recording system was the so-called varied system,
where different microphones, sound cards and PCs were applied.
Description of the database:
The recording
form was 16 bits and 16 kHz
ˇ
332 speakers;
ˇ
Twelve sentences and twelve
words per speaker from a phonetically balanced text, composed in
accordance with special Hungarian phonetic expectances;
ˇ
Large variations of different
microphones, sound cards and PCs were used;
ˇ
Computer-using environment
in offices, homes and laboratories;
ˇ
The whole material of database
is annotated and one third (100 speakers) is segmented and labeled
by hand.