Taming Subgraph Isomorphism for RDF Query Processing

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

RDF data are used to model knowledge in various areas such as life sciences, Semantic Web, bioinformatics, and social graphs. The size of real RDF data reaches billions of triples. This calls for a framework for efficiently processing RDF data. The core function of processing RDF data is subgraph pattern matching. There have been two completely different directions for supporting efficient subgraph pattern matching. One direction is to develop specialized RDF query processing engines exploiting the properties of RDF data for the last decade, while the other direction is to develop efficient subgraph isomorphism algorithms for general, labeled graphs for over 30 years. Although both directions have a similar goal (i.e., finding subgraphs in data graphs for a given query graph), they have been independently researched without clear reason. We argue that a subgraph isomorphism algorithm can be easily modified to handle the graph homomorphism, which is the RDF pattern matching semantics, by just removing the injectivity constraint. In this paper, based on the state-of-the-art subgraph isomorphism algorithm, we propose an in-memory solution, TurboHOM++, which is tamed for the RDF processing, and we compare it with the representative RDF processing engines for several RDF benchmarks in a server machine where billions of triples can be loaded in memory. In order to speed up TurboHOM++, we also provide a simple yet effective transformation and a series of optimization techniques. Extensive experiments using several RDF benchmarks show that TurboHOM++ consistently and significantly outperforms the representative RDF engines. Specifically, TurboHOM++ outperforms its competitors by up to five orders of magnitude.

Related collections

Most cited references 17

Record: found
Abstract: not found
Article: not found

An Algorithm for Subgraph Isomorphism

J. R. Ullmann (1976)

0 comments Cited 208 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

A (sub)graph isomorphism algorithm for matching large graphs.

Pasquale Foggia, C Sansone, Luigi Cordella … (2004)

We present an algorithm for graph isomorphism and subgraph isomorphism suited for dealing with large graphs. A first version of the algorithm has been presented in a previous paper, where we examined its performance for the isomorphism of small and medium size graphs. The algorithm is improved here to reduce its spatial complexity and to achieve a better performance on large graphs; its features are analyzed in detail with special reference to time and memory requirements. The results of a testing performed on a publicly available database of synthetically generated graphs and on graphs relative to a real application dealing with technical drawings are presented, confirming the effectiveness of the approach, especially when working with large graphs.

0 comments Cited 161 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

LUBM: A benchmark for OWL knowledge base systems

Jeff Heflin, Zhengxiang Pan, Yuanbo Guo (2005)

0 comments Cited 97 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Publication date Created: 2015-06-05

Publication date Updated: 2015-06-10

Article

ArXiV ID: 1506.01973

SO-VID: 28bd7625-2deb-477a-9dd4-d3984dea6455

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Categories cs.DB

ScienceOpen disciplines: Databases

Data availability:

ScienceOpen disciplines: Databases

Taming Subgraph Isomorphism for RDF Query Processing

Read this article at

Abstract

Related collections

Radiology and Natural Language Processing

Most cited references 17

An Algorithm for Subgraph Isomorphism

A (sub)graph isomorphism algorithm for matching large graphs.

LUBM: A benchmark for OWL knowledge base systems

Author and article information

Journal

Article

History

Custom metadata

Comments

Comment on this article

Similar content 57

Cited by 1

Most referenced authors 147