Yoshinori Sagisaka, Kazuya Takeda, Shigeru Katagiri and Hisao Kuwahara
Japanese Speech Database with Fine Acoustic-Phonetic Transcriptions
Abstract:A large sized Japanese speech database at ATR(JSDB-ATR) is
introduced. These speech data are transcribed in multiple ways using acoustic-
phonetic symbols for various data access requests and for the convenience of fine
acoustic-phonetic analysis. For multiple transcription, three types of categories
are considered: linguistic and phonemic categories, acoustic event categories and
some alophonic variation categories. These transcriptions were carried out
manually by trained labelers using digital sound spectrograms and several
acoustic parameters that reflect speech characteristics. To date, about 8500
words respectively uttered by eight professional announcers have been collected
with half of them being acoustically-phonetically transcribed.