A DISCOURSE CODING SCHEME FOR CONVERSATIONAL SPANISH

1998 
This paper describes a 3-level manual discourse coding scheme that we have devised for manual tagging of the CallHome Spanish (CHS) and CallFriend Spanish (CFS) databases used in the CLARITY project. The goal of CLARITY is to explore the use of discourse structure in understanding conversational sp eech. The project combines empirical methods for dialogue processing with state-of-the art LVCSR (using the JANUS recognizer). The three levels of the coding scheme are (1) a speech act level consisting of a tag set extended from DAMSL and Switchboard; (2) dialogue game level defined by initiative and intention; and (3) an act ivity level defined within topic units. The manually tagged dialog ues are used to train automatic classifiers. We present preliminary results for statement categorization, and give an in-progress repo rt of automatic speech act classification and topic boundary identific ation.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    13
    References
    32
    Citations
    NaN
    KQI
    []