8
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      An Integration-Oriented Ontology to Govern Evolution in Big Data Ecosystems

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Big Data architectures allow to flexibly store and process heterogeneous data, from multiple sources, in their original format. The structure of those data, commonly supplied by means of REST APIs, is continuously evolving. Thus data analysts need to adapt their analytical processes after each API release. This gets more challenging when performing an integrated or historical analysis. To cope with such complexity, in this paper, we present the Big Data Integration ontology, the core construct to govern the data integration process under schema evolution by systematically annotating it with information regarding the schema of the sources. We present a query rewriting algorithm that, using the annotated ontology, converts queries posed over the ontology to queries over the sources. To cope with syntactic evolution in the sources, we present an algorithm that semi-automatically adapts the ontology upon new releases. This guarantees ontology-mediated queries to correctly retrieve data from the most recent schema version as well as correctness in historical queries. A functional and performance evaluation on real-world APIs is performed to validate our approach.

          Related collections

          Most cited references11

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Data integration

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Data exchange: semantics and query answering

              Bookmark
              • Record: found
              • Abstract: not found
              • Book Chapter: not found

              Linking Data to Ontologies

                Bookmark

                Author and article information

                Journal
                16 January 2018
                Article
                1801.05161
                77ee9f73-6137-4090-aee8-459eaf3274c5

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                Preprint submitted to Information Systems. 35 pages
                cs.DB

                Comments

                Comment on this article