Reinforcement learning for optimal policy learning in condition-based maintenance

Aniket Adsule,Makarand S. Kulkarni,Asim Tewari

Reinforcement learning for optimal policy learning in condition-based maintenance

2020

Aniket Adsule
Makarand S. Kulkarni
Asim Tewari

Condition-based maintenance (CBM) involves taking decisions on maintenance or repair based on the actual deterioration conditions of the components. The long-run average cost is minimised by choosing the right maintenance action at the right time. In this study, the CBM decision-making problem is modelled as a continuous semi-Markov decision process (CSMDP). It consists of a chain of states representing various stages of deterioration, a set of maintenance actions, their costs and scheduled inspection policy. The application of a reinforcement learning (RL) algorithm based on the average reward for CSMDPs in CBM is described. The RL algorithm is used to learn the optimal maintenance decisions and inspection schedule based on the current health state of the component.

Keywords:

Policy learning
Operations research
Reinforcement learning
decision process
Condition-based maintenance
Optimal maintenance
Average cost
Maintenance actions
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations