Feature extraction based on principal component analysis for text categorization

2017 
Over the past 20 years, data has increased in a large scale in various fields. Internet of Things (IoT), for instance, comprises billions of devices and the data streams coming from these devices challenge the traditional approaches to data management and contribute to the emerging paradigm of big data. To be able to handle such data adequately, it is necessary to reduce their dimensionality to a size more compatible with the resolution methods, even if this reduction can lead to a slight loss of information. The aim of this paper is to study the potential of dimensionality reduction in text categorization of a publicly available dataset CNAE-9.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    32
    References
    6
    Citations
    NaN
    KQI
    []