SIREn: Entity Retrieval System for the Web of Data

We present ongoing work on the Semantic Information Retrieval Engine (SIREn), an “entity retrieval system” specifically designed to meet the requirements of indexing and searching a large amount of semi-structured data, e.g. the entire Web of Data. SIREn supports efficient full text search with semi-structural queries and exhibits a concise index, constant time updates and inherits Information Retrieval features such as top-k queries, efficient caching and scalability via distribution over shards. We demonstrate how SIREn can effectively answer queries over 10 billion triples on single commodity machine. The prototype is currently in use in the Sindice search engine which index at the present time more than 50 million harvested documents containing semi-structured data.

Content

Author and article information

Contributors

Renaud Delbru

Conference

Publication date: September 2009

Publication date (Print): September 2009

Pages: 29-35

Affiliations

[0001]Digital Enterprise Research Institute

National University of Ireland

Galway, Ireland

Article

DOI: 10.14236/ewic/FDIA2009.6

SO-VID: a2cd55d1-e98c-41cf-9043-b10b7791a37f

License:

This work is licensed under a Creative Commons Attribution 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Conference name: Third BCS-IRSG Symposium on Future Directions in Information Access (FDIA 2009)

Conference acronym: FDIA

Conference number: 3

Conference location: Padua, Italy

Conference date: 1 September 2009

Conference sponsor: Electronic Workshops in Computing (eWiC)

Conference theme: Computers XXIII Celebrating People and Technology

History

Product

1477-9358 BCS Learning & Development

Self URI (article page): https://www.scienceopen.com/hosted-document?doi=10.14236/ewic/FDIA2009.6

Self URI (journal page): https://ewic.bcs.org/

Celebrating 65 years of The Computer Journal - free-to-read perspectives - bcs.org/tcj65

SIREn: Entity Retrieval System for the Web of Data

Abstract