SShare: a simulator for studying and evaluating decentralized SPARQL query processing

2015 
Previously, we proposed efficient, scalable decentralized processing of SPARQL queries for an ad hoc Semantic Web data sharing system and explored optimization techniques. However, it has proven to be difficult to measure the performance of the proposed query processing in a decentralized setting with existing tools. This is because assessments on SPARQL query performance were typically targeted at a centralized or single-machine settings, and node-to-node communication costs occurring when (sub-)queries were forwarded among multiple nodes have rarely been taken into consideration. We hereby developed a simulator, SShare, that bridges Jena, a Java framework that supports querying RDF data with SPARQL, and ns-3 (network simulator 3), a discrete-event network simulator using C++ and Python. With SShare, one can submit any proper SPARQL query that involves RDF data of interest scattered around distributed hosts (the details of which are unknown to the query initiator), evaluate important performance metrics (e.g., the inter-site data transmission volume and communication delay) obtained at the network level, and finally get visualized results. We anticipated that SShare would be beneficial to others who are keen on better capturing and analyzing the inherent feature of various distributed and decentralized SPARQL processing mechanisms over a large-scale network.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    14
    References
    0
    Citations
    NaN
    KQI
    []