Pseudo Ground Truth Segmentation Mask to Improve Video Prediction Quality

Mu Chien Hsu,Jui Chun Shyur,Hiroshi Watanabe

Pseudo Ground Truth Segmentation Mask to Improve Video Prediction Quality

2020

Mu Chien Hsu
Jui Chun Shyur
Hiroshi Watanabe

Video prediction to foresee future events is an extremely difficult job since it involves spatial feature extraction and temporal sequential analysis. We identified that the semantic information is actually crucial to prediction, and proposed using “pseudo ground truth” segmentation masks which are generated automatically in real time and add them to the input layers as extra information to predict future frames. Experiments conducted on our self-defined network demonstrated drastically higher quality predictions are achieved when compared with other state-of-the-art direct video prediction models.

Keywords:

Semantics
video prediction
Segmentation
semantic information
Artificial intelligence
ground truth segmentation
Computer science
Pattern recognition
Ground truth
Feature extraction

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations