Skip to:

e-Science 2008 4th IEEE International Conference on e-Science

Main Conference Sessions

Scalable Semantics—The Silver Lining of Cloud Computing

Authors

  • Andrew Newman, University of Queensland
  • Yuan-Fang Li, The University of Queensland
  • Jane Hunter, University of Queensland

Abstract

Semantic inferencing and querying across large-scale RDF triple stores is notoriously slow. Our objective is to expedite this process by employing Google’s MapReduce framework to implement scale-out distributed querying and reasoning. This approach requires RDF graphs to be decomposed into smaller units that are distributed across computational nodes. RDF molecules appear to offer an ideal approach, providing an intermediate level of granularity between RDF graphs and triples. However, the original RDF molecule definition has inherent limitations that will adversely affect performance. In this paper, we propose a number of extensions to RDF molecules (hierarchy and ordering) to overcome these limitations. We then present some implementation details for our MapReduce-based RDF molecule store. Finally we evaluate the benefits of our approach in the context of the BioMANTA project, an application that requires integration and querying across large-scale protein-protein interaction datasets.

Date and Time

Wednesday, December 10, 10:45 a.m. to 11:15 a.m.

Room Number

208

More Information

Show your support for e-Science 2008

Add one of our badges to your site:

  • Teal eScience 2008 Web badge
  • Green eScience 2008 Web badge
  • Orange eScience 2008 Web badge