New Hybrid Method for Efficient Imputation of Discrete Missing Attributes

2021 
In this paper, we present a hybrid method for efficiently estimating missing discrete attributes appearing in data manipulation or processing. The principle of the method consists first of all in determining the segment to which the missing value belongs and then estimating it by majority vote when possible. Otherwise, the average of the missing attribute is determined from the complete data of the segment. Several cases may arise. The case where the non-missing attributes have the same modality (they are in the same interval) is dealt with by calculating the centre of the missing attribute. M of the class and the average m attributes that are not missing. If m is less than M then the value e of the missing attribute is estimated by the value of the non-missing attribute within the interval [a, M [ where a is the lower bound of the modality. Otherwise, the value of the other non-missing attribute is used for estimation. The second case, where the non-missing attributes have different modalities, is treated by calculating the average m attributes that are not missing and then estimate the missing value. e by the not-missing attribute having the same modality as m. Finally, an error test based on RMSE demonstrates the effectiveness of our method.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    19
    References
    0
    Citations
    NaN
    KQI
    []