What were we all looking at? Identifying objects of collective visual attention

Zhong Ma,Stephen Vickers,Howell O. Istance,Stephen Ackland,Xinbo Zhao,Wenhu Wang

What were we all looking at? Identifying objects of collective visual attention

2016

We aim to identify the salient objects in an image by applying a model of visual attention. We automate the process by predicting those objects in an image that are most likely to be the focus of someone's visual attention. Concretely, we first generate fixation maps from the eye tracking data, which express the ground truth of people's visual attention for each training image. Then, we extract the high-level features based on the bag-of-visual-words image representation as input attributes along with the fixation maps to train a support vector regression model. With this model, we can predict a new query image's saliency. Our experiments show that the model is capable of providing a good estimate for human visual attention in test images sets with one salient object and multiple salient objects. In this way, we seek to reduce the redundant information within the scene, and thus provide a more accurate depiction of the scene.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations