VOICE FONTS: 音声合成データベースの作成

Voice Fonts-switchable databases for speech synthesis

（株）ATR音声翻訳通信研究所　第二研究室　藤澤　　謙一，Wei Zhang

本報告では音声合成のための音声データベース作成とラベル付与の方法について説明する。ワープロの「フォント」を選ぶのと同様に、話者の特徴や発話スタイルを簡単に交換することが可能となった。

We describe the collection and annotation of speech data for use in the CHATR synthesiser. By annotating the prosody and speaking-style characteristics of each corpus of recorded speech, we produce interchangeable "voices" for the synthesiser that can be likened to the interchangeable fonts offered by a word processor.

本文へ