Action recognition in video using a spatial-temporal graph-based feature representation

Iveel Jargalsaikhan,Suzanne Little,Remi Trichet,Noel E. OConnor

Action recognition in video using a spatial-temporal graph-based feature representation

2015

We propose a video graph based human action recognition framework. Given an input video sequence, we extract spatio-temporal local features and construct a video graph to incorporate appearance and motion constraints to reflect the spatio-temporal dependencies among features. them. In particular, we extend a popular dbscan density-based clustering algorithm to form an intuitive video graph. During training, we estimate a linear SVM classifier using the standard Bag-of-words method. During classification, we apply Graph-Cut optimization to find the most frequent action label in the constructed graph and assign this label to the test video sequence. The proposed approach achieves state-of-the-art performance with standard human action recognition benchmarks, namely KTH and UCF-sports datasets and competitive results for the Hollywood (HOHA) dataset.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations