TR-IT-0112 :1995.4.27

Christian BOITET & Mutsuko TOMOKIYO

Towards ambiguity labelling for the study of interactive disambiguation methods

Abstract:This report has been prepared in the context of the MIDDIM project (ATR-CNRS). It introduces the concept of "ambiguity labelling", and proposes a precise text processor oriented format for labelling "pieces" such as dialogues and texts. Several notions concerning ambiguities are made precise, and many examples are given. The ambiguities labelled are meant to be those which state-of-the-art speech analyzers are believed not to be able to solve, and which would have to be solved interactively to produce the correct analysis. The proposed labelling has been specified with a view to store the labelled pieces in a data base, in order to estimate the frequency of various types of ambiguities, the importance to solve them in the envisaged contexts, the scope of disambiguation decisions, and the knowledge needed for disambiguation. A complete example is given. Finally, an equivalent data base oriented format is sketched.