A Communication-Efficient Distributed Data Structure for Top- k and k -Select Queries

2017 
We consider the scenario of n sensor nodes observing streams of data. The nodes are connected to a central server whose task it is to compute some function over all data items observed by the nodes. In our case, there exists a total order on the data items observed by the nodes. Our goal is to compute the k currently lowest observed values or a value with rank in \([(1-\varepsilon )k,(1+\varepsilon )k]\) with probability \((1-\delta )\). We propose solutions for these problems in an extension of the distributed monitoring model where the server can send broadcast messages to all nodes for unit cost. We want to minimize communication over multiple time steps where there are m updates to a node’s value in between queries. The result is composed of two main parts, which each may be of independent interest: 1. Protocols which answer Top-\(k\) and \(k\)-Select queries. These protocols are memoryless in the sense that they gather all information at the time of the request. 2. A dynamic data structure which tracks for every k an element close to k.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    21
    References
    1
    Citations
    NaN
    KQI
    []