ESprint: QoS-Aware Management for Effective Computational Sprinting in Data Centers

2019 
In the era of 'dark silicon', modern data centers have to provision additional hardware resources to guarantee the Quality of Service (QoS) of applications in case of bursty workloads that typically occur in low frequency but high intensity. Fortunately, Computational Sprinting has proven to be an effective approach to boost the computing performance of many-core processor chips, which allows a chip to exceed its power and thermal limits temporarily by turning on all processor cores and absorbing the extra heat dissipation with novel phase-changing materials. Consequently, it offers a promising way to deal with these occasional workload bursts by unleashing the full potentials of hardware, avoiding deploying extra computing resources. In this work, we propose ESprint, a QoS-aware management system based on an effective feedback control mechanism for latency-critical applications in data centers. ESprint can perform computational sprinting by precisely scheduling core count, frequency levels, and sprinting duration, serving bursty workloads without QoS violation under the thermal constraint. Specifically, ESprint effectively predict load intensity in the next time interval, and further dynamically allocates appropriate computing resources to minimize actual power consumption. Our prototype-based evaluation results show that ESprint achieves up to 1.92x improvement on energy efficiency for typical workloads while ensuring QoS, over the non-sprinting strategy. We also explore the design space among energy efficiency, core count/frequency scaling techniques, workload characteristics, burst intensity, and QoS requirements, and draw several key insights to guide the effective use of computational sprinting in data centers.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    30
    References
    1
    Citations
    NaN
    KQI
    []