Continual Learning Without Knowing Task Identities: Do Simple Models Work?

Tiffany Tuor,Shiqiang Wang,Kin K. Leung

Continual Learning Without Knowing Task Identities: Do Simple Models Work?

2021

Due to the catastrophic forgetting phenomenon of deep neural networks (DNNs), models trained in standard ways tend to forget what it has learned from previous tasks, especially when the new task is sufficiently different from the previous ones. To overcome this issue, various continual learning techniques have been developed in recent years, which, however, often suffer from a substantially increased model complexity and training time. In this paper, we consider whether properly tailored simple models could perform well for continual learning. By proposing a relatively simple method based on Bayesian neural networks and model selection, we can in many cases outperform several state-of-the-art techniques in terms of accuracy, model size, and running time, especially when each mini-batch of data is known to come from the same task of an unknown identity. This leads to interesting observations suggesting that different continual learning techniques may be beneficial for different types of data and task diversity.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations