An application of multi-objective reinforcement learning for efficient model-free control of canals deployed with IoT networks

2021 
Abstract Canals have been widely constructed to deliver water from rich areas to poor areas to ease water shortages. Efficient controlling of canals is essential for high-performance water diversion. Many efforts have been made and lots of model-based approaches have been proposed to achieve efficient control of canals. However, when reliable physical models are unavailable or the performances of predictive models are unpromising for model-based approaches, model-free approaches (e.g., reinforcement learning, genetic algorithm) could be adopted to obtain desirable control policies. Most canal-control approaches based on reinforcement learning are developed by means of manually designed reward functions, which could face difficulties especially when multiple control objectives of canals need to be satisfied. This paper proposes a novel control approach where a Reward Network (R-Network) is developed to collaborate with the deep Q learning network. The weights of sub-rewards corresponding to multiple objectives are learned via interacting with the environment (canal simulator), and the R-Network approximates the reward function and outputs desired rewards for the reinforcement learning agent to obtain desired control policies. Extensive experiments are conducted via numerical simulation on the case study of the Chinese South to North Water Diversion Project, which is deployed with numerous sensors connected as Internet of Things. Experimental results show that our approach can achieve desirable performance improvements over the state-of-the-art model-free approaches under different test scenarios.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    42
    References
    0
    Citations
    NaN
    KQI
    []