Shannon sampling and nonlinear dynamics on graphs for representation, regularization and visualization of complex data

2010 
Data is now produced faster than it can be meaningfully analyzed. Many modern data sets present unprecedented analytical challenges, not merely because of their size but by their inherent complexity and information richness. Large numbers of astronomical objects now have dozens or hundreds of useful parameters describing each one. Traditional color-color plots using a limited number of symbols and some color-coding are clearly inadequate for finding all useful correlations given such large numbers of parameters. To capitalize on the opportunities provided by these data sets one needs to be able to organize, analyze and visualize them in fundamentally new ways. The identification and extraction of useful information in multiparametric, high-dimensional data sets - data mining - is greatly facilitated by finding simpler, that is, lower-dimensional abstract mathematical representations of the data sets that are more amenable to analysis. Dimensionality reduction consists of finding a lower-dimensional representation of high-dimensional data by constructing a set of basis functions that capture patterns intrinsic to a particular state space. Traditional methods of dimension reduction and pattern recognition often fail to work well when performed upon data sets as complex as those that now confront astronomy. We present here our developments of data compression, sampling, nonlinear dimensionality reduction, and clustering, which are important steps in the analysis of large-scale, complex datasets.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    2
    References
    0
    Citations
    NaN
    KQI
    []