Interactive Pattern Recognition applied to Natural Language Processing
2010
This thesis is about Pattern Recognition. In the last decades, huge
efforts have been made to develop automatic systems able to rival
human capabilities in this field. Although these systems achieve high
productivity rates, they are not precise enough in most
situations. Humans, on the contrary, are very accurate but
comparatively quite slower. This poses an interesting question: the
possibility of benefiting from both worlds by constructing
cooperative systems.
This thesis presents diverse contributions to this kind of
collaborative approach. The point is to improve the Pattern
Recognition systems by properly introducing a human operator into the
system. We call this Interactive Pattern Recognition (IPR).
Firstly, a general proposal for IPR will be stated. The aim is to
develop a framework to easily derive new applications in this
area. Some interesting IPR issues are also introduced. Multi-modality
or adaptive learning are examples of extensions that can naturally fit
into IPR.
In the second place, we will focus on a specific application. A novel
method to obtain high quality speech transcriptions (CAST, Computer
Assisted Speech Transcription). We will start by proposing a CAST
formalization and, next, we will cope with different implementation
alternatives. Practical issues, as the system response time, will be
also taken into account, in order to allow for a practical
implementation of CAST. Word graphs and probabilistic error
correcting parsing are tools that will be used to reach an
alternative formulation that allows for the use of CAST in a real
scenario.
Afterwards, a special application within the general IPR
framework will be discussed. This is intended to test the IPR
capabilities in an extreme environment, where no input pattern
is available and the system only has access to the user actions to
produce a hypothesis. Specifically, we will focus here on providing
assistance in the problem of text generation. The use of adaptive
learning in this scenario will be emphasized. Besides, two
derived applications will be also considered. Notably, the use of
text prediction for information retrieval systems.
In addition, we will pose an interesting question about IPR
systems. The inclusion of multi-modality as a natural part of IPR. The
design of a speech input interface for Computer Assisted Translation
(CAT) will be addressed. To this end, we will describe several
interaction scenarios, which facilitate the speech recognition process
by taking advantage of the CAT environment.
Finally, a set of prototypes that include the main features of the
work here developed will be presented. The main motivation is to
provide real examples about the feasibility of implementing the
techniques here described.
Keywords:
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
49
References
3
Citations
NaN
KQI