A large-scale spontaneous speech database over Japan
- Towards a speech recognition system beyond speaker differences -
(株)ATR音声翻訳通信研究所 第一研究室 松井 知子
For speech recognition systems, some voices are difficult to recognize while others
are not. All speech input systems are suffering from speaker variances. Which
voices are easy or difficult to recognize? Which factors cause this effect? This
paper introduces a huge the Japanese speech database which was collected at ATR
Interpreting Telecommunications Research Laboratories to investigate those questions
and realize a speech recognition system without requiring any selection of speakers.