A Two-Stage Data-Driven Spatiotemporal Analysis to Predict Failure Risk of Urban Sewer Systems Leveraging Machine Learning Algorithms.

2021 
Risk-informed asset management is key to maintaining optimal performance and efficiency of urban sewer systems. Although sewer system failures are spatiotemporal in nature, previous studies analyzed failure risk from a unidimensional aspect (either spatial or temporal), not accounting for bidimensional spatiotemporal complexities. This is owing to the insufficiency of good-quality data, which ultimately leads to under-/overestimation of failure risk. Here, we propose a generalized methodology/framework to facilitate a robust spatiotemporal analysis of urban sewer system failure risk, overcoming the intrinsic challenges of data imperfections-e.g., missing data, outliers, and imbalanced information. The framework includes a two-stage data-driven modeling technique that efficiently models the highly right-skewed sewer system failure data to predict the failure risk, leveraging a bidimensional space-time approach. We implemented our analysis for Bogota, the capital city of Colombia. We train, test, and validate a battery of machine learning algorithms-logistic regression, decision trees, random forests, and XGBoost-and select the best model in terms of goodness-of-fit and predictive accuracy. Finally, we illustrate the applicability of the framework in planning/scheduling sewer system maintenance operations using state-of-the-art optimization techniques. Our proposed framework can help stakeholders to analyze the failure-risk models' performance under different discrimination thresholds, and provide managerial insights on the model's adequate spatial resolution and appropriateness of decentralized management for sewer system maintenance.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    95
    References
    5
    Citations
    NaN
    KQI
    []