Optimising figure of merit for phonetic spoken term detection

Roy Wallace,Robert J. Vogt,Brendan Baker,Sridha Sridharan

Optimising figure of merit for phonetic spoken term detection

2010

Roy Wallace
Robert J. Vogt
Brendan Baker
Sridha Sridharan

This paper introduces a novel technique to directly optimise the Figure of Merit (FOM) for phonetic spoken term detection. The FOM is a popular measure of sTD accuracy, making it an ideal candiate for use as an objective function. A simple linear model is introduced to transform the phone log-posterior probabilities output by a phe classifier to produce enhanced log-posterior features that are more suitable for the STD task. Direct optimisation of the FOM is then performed by training the parameters of this model using a non-linear gradient descent algorithm. Substantial FOM improvements of 11% relative are achieved on held-out evaluation data, demonstrating the generalisability of the approach.

Keywords:

Speech processing
Linear model
Gradient descent
Phone
Artificial intelligence
Figure of merit
Pattern recognition
Computer science

Correction
Cite
Save
Machine Reading By IdeaReader

References

Citations