Exploiting clustering algorithms in a multiple-level fashion

2016 
A two-phase methodology to analyze real-world data collections.Cluster content characterized through sequential patterns.A mobile application to allow ubiquitous data classification.Evaluation in the health care scenario using a collection of diabetic patients. Clustering real-world data is a challenging task, since many real-data collections are characterized by an inherent sparseness and variable distribution. An appealing domain that generates such data collections is the medical care scenario where collected data include a large cardinality of patient records and a variety of medical treatments usually adopted for a given disease pathology.This paper proposes a two-phase data mining methodology (MLC) to iteratively analyze different dataset portions and locally identify groups of objects with common properties. Discovered cohesive clusters are then analyzed using sequential patterns to characterize temporal relationships among data features. To support an automatic classification of new data objects within one of the discovered groups, a classification model is created starting from the computed cluster set. A mobile application has been also designed and developed to visualize and update data under analysis as well as categorizing new unlabeled data objects.The experimental evaluation conducted on real datasets in the medical care scenario showed the effectiveness of MLC to discover interesting knowledge items and to easily exploit them through a mobile application. Results have been also discussed from a medical perspective.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    30
    References
    11
    Citations
    NaN
    KQI
    []