TR-A-0075

TR-A-0075 :1990.2.21

平原達也

OFTと聴覚スペクトログラムを用いたHMM音声認識(PART 2)

Abstract:In our previous report (Patterson and Hirahara, 1989, ATR Technical Report TR-A-0063), we showed some results for HMM /b,d,g/ phoneme recognition using DFT and SAS, auditory spectrograms. As it was a very preliminary report, many tests remained:

1. recognition tests with added independent pink noise.

2. recognition tests with different S/N ratios.

3. recognition tests with a larger phoneme set.

4. recognition tests training HMM on both clean and noisy tokens.

5. recognition tests with different reference vector sizes.

6. optimization of the auditory model parameters.

In this report, we will focus on the first three problems. The 4th and 5th are the problems concerning the HMM phoneme recognition system itself. As they are topics whish are interesting from a practical viewpoint, we will leave them for the speech recognition people. The 6th is a fundamental problem. However, all experiments should be repeated when the auditory model is returned. Therefore, we decided not to do it at the moment.