An Intelligent Toolkit for Benchmarking Data-Driven Aerospace Prognostics

2019 
Machine Learning (ML) has been largely employed to sensor data for predicting the Remaining Useful Life (RUL) of aircraft components with promising results. A review of the literature, however, has revealed a lack of consensus regarding evaluation metrics adopted, the state-of-the-art methods employed for performance comparison, the approaches to address data overfitting, and statistical tests to assess results’ significance. These weaknesses in methodological approaches to experimental design, results evaluation, comparison and reporting of findings can result in misleading outcomes and ultimately produce less effective predictors. Arbitrary choices of approaches for novel method’s evaluation, the potential bias that can be introduced, and the lack of systematic replication and comparison of outcomes might affect the findings reported and misguide future research. For further advances in this area, there is therefore an urgent need for appropriate benchmarking methodologies to assist evaluating novel methods and to produce fair performance rankings. In this paper we introduce an open-source, extensible benchmarking library to address this gap in aerospace prognosis. The library will assist researchers to conduct a proper and fair evaluation of their novel ML RUL predictive models. In addition, it will assist stimulating better practices and a more rigorous experimental design approach across the field. Our library contains 13 state-of-the-art ML methods, 12 metrics for algorithm performance evaluation and tests for statistical significance. To demonstrate the library’s functionalities, we apply it to gas turbine engine prognostic datasets.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    26
    References
    1
    Citations
    NaN
    KQI
    []