Prediction-based outlier detection with explanations

2012 
General outlier detection strategies, be a distribution-based, clustering-based, or distance-based method, all resort to the comparison among instances to define abnormality. In this paper we introduce an additional dimension into the outlier definition. That is, we not only consider externally how one instance differs from others but internally the dependency and abnormality among its own attributes, denoted as the prediction-based outlier detection. Prediction-based outliers possess certain attributes which are difficult to be predicted based on the neighborhood information. Furthermore, we propose three neighborhood functions to generate predictions. Finally, acknowledging the lack of the gold standard to evaluate an outlier detection system, we propose four general evaluation strategies. Experiments conducted on several real-world datasets demonstrate the validity, novelty, power-law distribution, and robustness of our method.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    15
    References
    0
    Citations
    NaN
    KQI
    []