Extracting sample data based on poisson distribution

2017 
Sampling methods are becoming in demand due to the rapid growth of big data applications. The term “Big Data” not only means the large size of data volume but also indicates the high speed of data generation, which plagues many existing data mining and analytic applications owing to the limited capability of processing large volume of data for real time analysis. Therefore, the demands for the use of sampling to generate summary data sets that support rapid queries are increasing according to Cormode and Duffield. The state-of-the art in sampling methods have been successfully applied to various areas including network traffic and social networks[1]. In this paper, a novel Poisson-based sampling method is introduced to provide a comprehensive data set for real time analysis. The proposed Poisson-based sampling method extends the previous Normal Distribution sampling method [2]. The experimental results show efficiency of the proposed method.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    10
    References
    5
    Citations
    NaN
    KQI
    []