Evaluating the Use of Policy Gradient Optimization Approach for Automatic Cloud Resource Provisioning

2020 
Reinforcement learning is a very active field of research with many practical applications. Success in many cases is driven by combining it with Deep Learning. In this paper we present results of our attempt to use modern advancements in this area for automated management of resources used to host distributed software. We describe the use of three policy training algorithms from the policy gradient optimization family, to create a policy used to control the behavior of an autonomous management agent. The agent is interacting with a simulated cloud computing environment, which is processing a stream of computing jobs. We discuss and compare the policy performance aspects and the feasibility to use them in real-world scenarios.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    28
    References
    3
    Citations
    NaN
    KQI
    []