An Efficient Approach for Filling Incomplete Data

P. M. Kiran,A. Prakash Rao,B. Ratnamala

An Efficient Approach for Filling Incomplete Data

2012

P. M. Kiran
A. Prakash Rao
B. Ratnamala

Good data preparation is a key prerequisite to successful data mining. Conventional wisdom suggests that data preparation takes about 60 to 80% of the time involved in a data mining exercise. There have been good reviews of the problems associated with data preparation. However the data preprocessing is a crucial step used for variety of data warehousing and mining. Real world data is noisy and can often suffer from corruptions or incomplete values that may impact the models created from the data. Accuracy of any mining algorithm greatly depends on the input datasets. In this paper we describe a novel idea of predicting the missing values in the dataset by a well known principle of Maximum likelihood EM (Expectation

Keywords:

Missing data
Conventional wisdom
Maximum likelihood
Data pre-processing
Data preparation
Data warehouse
Data mining
Computer science
real world data
data mining algorithm

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations