MT-DLA: An Efficient Multi-Task Deep Learning Accelerator Design

Mengdi Wang,Bing Li,Ying Wang,Cheng Liu,Xiaohan Ma,Xiandong Zhao,Lei Zhang

MT-DLA: An Efficient Multi-Task Deep Learning Accelerator Design

2021

Mengdi Wang
Bing Li
Ying Wang
Cheng Liu
Xiaohan Ma
Xiandong Zhao
Lei Zhang

Multi-task learning systems are commonly adopted in many real-world AI applications such as intelligent robots and self-driving vehicles. Instead of improving single-network performance, this work proposes a specialized Multi-Task Deep Learning Accelerator architecture, MT-DLA, to improve the performance of concurrent networks by exploiting the shared feature and parameters across these models. It is shown in our evaluation with realistic multi-task workloads, MT-DLA dramatically eliminates the memory and computation overhead caused by the shared parameters, activations and computation result. In the experiments with real-world multi-task learning workloads, MT-DLA brings about 1.4x-7.0x energy efficiency boost when compared to the baseline neural network accelerator without multi-task support.

Keywords:

Overhead (computing)
task
Applications of artificial intelligence
Multi-task learning
Deep learning
Artificial intelligence
Feature (machine learning)
Artificial neural network
Computer architecture
Efficient energy use
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations