Mixed feature selection in incomplete decision table

2014 
Abstract Feature selection in incomplete decision table has gained considerable attention in recently. However many feature selection methods are mainly designed for incomplete data with categorical features. In this paper, we introduce an extended rough set model, which is based on neighborhood-tolerance relation and is applicable to incomplete data with mixed categorical and numerical features. Neighborhood-tolerance conditional entropy is proposed from this model, which is an uncertainty measure and can be used to evaluate feature subset. It is known that dependency is an important feature evaluation measure based on rough set theory. The comparison and analysis of classification complexity are made between the two measures and it is indicated that neighborhood-tolerance conditional entropy is a more effective feature evaluation criterion than dependency in incomplete decision table. Then the heuristic feature selection algorithm based on neighborhood-tolerance conditional entropy is constructed. Experimental results show that our proposal is applicable and effective to incomplete mixed data.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    24
    References
    28
    Citations
    NaN
    KQI
    []