TR-H-0053

TR-H-0053 :1994.2.15

相川清明,山田玲子

英語/r/-/l/非類似度計測における各種のスペクトル表現の比較

Abstract:日本人被験者による英語/r/-/l/知覚実験における正答率は発話者に依存する。本報告はこの発話者依存性を/r/-/l/対の音響的非類似度で説明する試みについて述べる。/r/-/l/対の音響的非類似度を動的ケプストラムを用いて計測すると知覚実験による正答率ときわめて相関の高い発話者依存性パターンを示すことがわかった。基本パラメータとしてケプストラム、重み付きケプストラム、メルケプストラムの3種類を用い、これらのパラメータ自身、及び、これより導かれる動的ケプストラム、デルタケプストラム、組合せにして9種類のスペクトル表現の非類似度計測における有効性を比較する。知覚実験としては訓練を伴う場合と伴わない場合の2通りについて行った。部分的音響的非類似性分析により、日本人は/r/,/l/ 直後250ms程度の区間の調音結合により変形した後続音素を利用して/r/-/l/を聞き分けている可能性があることが示唆された。

The talker dependency of the correct response rate on the English/r/-/l/ perception by Japanese listeners has been reported. This paper reports that the acoustical dissimilarity of an/r/-/l/ minimal pair measured by a dynamic cepstrum shows a talker dependency pattern closely correlated with the talker dependency pattern of the correct response rate. Nine spectral representations are compared in measuring the /r/-/l/ acoustical dissimilarity. The nine spectral representations include three base parameters, a cepstrum, a weighted cepstrum, and a mel-cepstrum, and dynamic cepstra and delta cepstra. derived from the three base parameters. The experimental results imply that Japanese listeners tend to identify /r/ or /l/ using the succeeding vowels affected by coarticulations.