Fast Binary Embedding for High-Dimensional Data

2015 
Binary embedding of high-dimensional data requires long codes to preserve the discriminative power of the input space. Traditional binary coding methods often suffer from very high computation and storage costs in such a scenario. To address this problem, we propose two solutions which improve over existing approaches. The first method, Bilinear Binary Embedding (BBE), converts high-dimensional data to compact similarity-preserving binary codes using compact bilinear projections . Compared to methods that use unstructured matrices for projection, it improves the time complexity from \(\mathcal {O}(d^2)\) to \(\mathcal {O}(d^{1.5})\), and the space complexity from \(\mathcal {O}(d^2)\) to \(\mathcal {O}(d)\) where \(d\) is the input dimensionality. The second method, Circulant Binary Embedding (CBE), generates binary codes by projecting the data with a circulant matrix . The circulant structure enables the use of Fast Fourier Transformation to speed up the computation. This further improves the time complexity to \(\mathcal {O}(d\log {d})\). For both BBE and CBE, we propose to learn the projections in a data-dependent fashion. We show by extensive experiments that the proposed approaches give much better performance than the state of the arts for fixed time, and provides much faster computation with no performance degradation for fixed number of bits. The BBE and CBE methods were previously presented in [6, 38]. In this book chapter, we present the two approaches in a unified framework, covering randomized binary embedding, learning-based binary embedding , and learning with dimension reductions. We also discuss the choice of algorithms.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    42
    References
    1
    Citations
    NaN
    KQI
    []