TR-I-0003=TR-A-0004

TR-I-0003=TR-A-0004 :1987.5

Yoshinori Sagisaka, Kazuya Takeda, Shigeru Katagiri and Hisao Kuwahara

Japanese Speech Database with Fine Acoustic-Phonetic Transcriptions

Abstract:A large sized Japanese speech database at ATR(JSDB-ATR) is introduced. These speech data are transcribed in multiple ways using acoustic- phonetic symbols for various data access requests and for the convenience of fine acoustic-phonetic analysis. For multiple transcription, three types of categories are considered: linguistic and phonemic categories, acoustic event categories and some alophonic variation categories. These transcriptions were carried out manually by trained labelers using digital sound spectrograms and several acoustic parameters that reflect speech characteristics. To date, about 8500 words respectively uttered by eight professional announcers have been collected with half of them being acoustically-phonetically transcribed.