Fusion of audio and video information for multi modal person authentication

Benoît Duc,Elizabeth Saers Bigün,Josef Bigun,Gilbert Maître,Stefan Fischer

Fusion of audio and video information for multi modal person authentication

1997

Benoît Duc
Elizabeth Saers Bigün
Josef Bigun
Gilbert Maître
Stefan Fischer

Abstract We present an algorithm functioning as a supervisor module in a multi-expert decision making machine. It uses the Bayes theory in order to estimate the biases of individual expert opinions. The biases are used to calibrate and conciliate expert opinions to a single decision. This supervision technique is applied to the real case of a person authentication technique using two modalities, face and speech. The visual part involves the matching of a coarse grid containing Gabor phase information from face images. The acoustic part is performed by a text-dependent speaker verification system based on Hidden Markov Models. Experimental results show that the proposed fusion method improves the quality of individual expert decisions by reaching success rates of 99.5%.

Keywords:

Artificial intelligence
Authentication
Computer vision
Pattern recognition
Fusion
Bayes' theorem
Grid
Machine learning
Hidden Markov model
Computer science
Bayesian statistics
Supervisor
Modalities
Modal

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations