Distributed graph query processing in dynamic networks
2012
In this paper we examine a popular network computational model (BSP: Bulk Synchronous Parallel) that has been adopted by the Google Pregel system to support large scale graph processing. We show that the synchronicity assumption made by the BSP model, while acceptable in data center like environments with strong and persistent network connectivity, can result in severe performance penalties in the context of dynamic networks. We introduce a new computational model (BAP: Bulk Asynchronous Parallel) that preserves the bulk and parallel nature of the BSP model but extends the model to asynchronous network communication. We consider two popular classes of graph queries (random walk queries and shortest path queries), present both BSP and BAP algorithms for these queries and evaluate their performance using realistic graphs datasets (DBLP and Flickr) and dynamic network datasets (Infocom06 and MIT Reality dataset). Our initial results show that in dynamic networks BAP algorithms can achieve several orders of magnitude in improvement for various QoI metrics such as accuracy and latency of (partial and complete) query evaluation.
Keywords:
- Heuristic (computer science)
- Computer science
- Computer network
- Distributed computing
- Bulk synchronous parallel
- Graph theory
- Dynamic network analysis
- Synchronicity
- Asynchronous communication
- Shortest path problem
- Random walk
- Machine learning
- Artificial intelligence
- Algorithm design
- Telecommunications network
- Theoretical computer science
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
15
References
2
Citations
NaN
KQI