Runtime DNN performance scaling through resource management on heterogeneous embedded platforms

Lei Xun,Bashir M. Al Hashimi,Jonathon S. Hare,Geoff V. Merrett

Runtime DNN performance scaling through resource management on heterogeneous embedded platforms

2021

Lei Xun
Bashir M. Al Hashimi
Jonathon S. Hare
Geoff V. Merrett

DNN inference is increasingly being executed locally on embedded platforms, due to the clear advantages in latency, privacy and connectivity. Modern SoCs typically execute a combination of different and dynamic workloads concurrently, it is challenging to consistently meet latency/energy budgets because the local computing resources available to the DNN vary considerably. In this poster, we show how resource management can be applied to optimise the performance of DNN workloads by monitoring and tuning both software and hardware constantly at runtime. This work shows how dynamic DNNs trade-off accuracy with latency/energy/power on heterogeneous embedded CPU-GPU platform.

Keywords:

Embedded system
Resource management
Energy (signal processing)
Scaling
Software
Latency (engineering)
Inference
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations