Christian BOITET & Mutsuko TOMOKIYO
Towards ambiguity labelling for the study
of interactive disambiguation methods
Abstract:This report has been prepared in the context of the MIDDIM project (ATR-CNRS). It introduces the
concept of "ambiguity labelling", and proposes a precise text processor oriented format for labelling
"pieces" such as dialogues and texts. Several notions concerning ambiguities are made precise, and
many examples are given. The ambiguities labelled are meant to be those which state-of-the-art
speech analyzers are believed not to be able to solve, and which would have to be solved interactively
to produce the correct analysis. The proposed labelling has been specified with a view to store the
labelled pieces in a data base, in order to estimate the frequency of various types of ambiguities, the
importance to solve them in the envisaged contexts, the scope of disambiguation decisions, and the
knowledge needed for disambiguation. A complete example is given. Finally, an equivalent data base
oriented format is sketched.