A Hybrid Approach for Preprocessing of Imbalanced Data in Credit Scoring Systems

Uma R. Salunkhe,Suresh N. Mali

A Hybrid Approach for Preprocessing of Imbalanced Data in Credit Scoring Systems

2018

Uma R. Salunkhe
Suresh N. Mali

During the last few years, classification task in machine learning is commonly used by various real-life applications. One of the common applications is credit scoring systems where the ability to accurately predict creditworthy or non-creditworthy applicants is critically important because incorrect predictions can cause major financial loss. In this paper, we aim to focus on skewed data distribution issue faced by credit scoring system. To reduce the imbalance between the classes, we apply preprocessing on the dataset which makes combined use of random re-sampling and dimensionality reduction. Experimental results on Australian and German credit datasets with the presented preprocessing technique has shown significant performance improvement in terms of AUC and F-measure.

Keywords:

Dimensionality reduction
Preprocessor
Performance improvement
Computer science
Pattern recognition
Artificial intelligence
skewed data
imbalanced data
combined use
hybrid approach
Machine learning
scoring system

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations