18
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Liberating links between datasets using lightweight data publishing: an example using plant names and the taxonomic literature

      research-article
      1 ,
      Biodiversity Data Journal
      Pensoft Publishers
      datasette, linked data, data publishing, biodiversity knowledge graph, taxonomic names

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Abstract

          Constructing a biodiversity knowledge graph will require making millions of cross links between diversity entities in different datasets. Researchers trying to bootstrap the growth of the biodiversity knowledge graph by constructing databases of links between these entities lack obvious ways to publish these sets of links. One appealing and lightweight approach is to create a "datasette", a database that is wrapped together with a simple web server that enables users to query the data. Datasettes can be packaged into Docker containers and hosted online with minimal effort. This approach is illustrated using a dataset of links between globally unique identifiers for plant taxonomic namesand identifiers for the taxonomic articles that published those names.

          Related collections

          Most cited references20

          • Record: found
          • Abstract: not found
          • Article: not found

          Wikidata

            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            The anatomy of a nanopublication

            As the amount of scholarly communication increases, it is increasingly difficult for specific core scientific statements to be found, connected and curated. Additionally, the redundancy of these statements in multiple fora makes it difficult to determine attribution, quality and provenance. To tackle these challenges, the Concept Web Alliance has promoted the notion of nanopublications (core scientific statements with associated context). In this document, we present a model of nanopublications along with a Named Graph/RDF serialization of the model. Importantly, the serialization is defined completely using already existing community-developed technologies. Finally, we discuss the importance of aggregating nanopublications and the role that the Concept Wiki plays in facilitating it.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Taxonomic information exchange and copyright: the Plazi approach

              Background A large part of our knowledge on the world's species is recorded in the corpus of biodiversity literature with well over hundred million pages, and is represented in natural history collections estimated at 2 – 3 billion specimens. But this body of knowledge is almost entirely in paper-print form and is not directly accessible through the Internet. For the digitization of this literature, new territories have to be chartered in the fields of technical, legal and social issues that presently impede its advance. The taxonomic literature seems especially destined for such a transformation. Discussion Plazi was founded as an association with the primary goal of transforming both the printed and, more recently, "born-digital" taxonomic literature into semantically enabled, enhanced documents. This includes the creation of a test body of literature, an XML schema modeling its logic content (TaxonX), the development of a mark-up editor (GoldenGATE) allowing also the enhancement of documents with links to external resources via Life Science Identifiers (LSID), a repository for publications and issuance of bibliographic identifiers, a dedicated server to serve the marked up content (the Plazi Search and Retrieval Server, SRS) and semantic tools to mine information. Plazi's workflow is designed to respect copyright protection and achieves extraction by observing exceptions and limitations existent in international copyright law. Conclusion The information found in Plazi's databases – taxonomic treatments as well as the metadata of the publications – are in the public domain and can therefore be used for further scientific research without any restriction, whether or not contained in copyrighted publications.
                Bookmark

                Author and article information

                Contributors
                Journal
                Biodivers Data J
                Biodivers Data J
                Biodiversity Data Journal
                Biodiversity Data Journal
                Biodiversity Data Journal
                Pensoft Publishers
                1314-2828
                2018
                23 July 2018
                : 6
                Affiliations
                [1 ] University of Glasgow, Glasgow, United Kingdom University of Glasgow Glasgow United Kingdom
                Author notes
                Corresponding author: Roderic Page ( rdmpage@ 123456gmail.com ).

                Academic editor: Anne Thessen

                Article
                Biodiversity Data Journal 9918
                10.3897/BDJ.6.e27539
                6066477
                b792d3c8-b8f0-4033-b7b0-2ebecc59358e
                Roderic Page

                This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                Page count
                Figures: 1, Tables: 1, References: 23
                Categories
                Forum Paper
                Plantae
                Bioinformatics
                Data Management
                Nomenclature

                datasette,linked data,data publishing,biodiversity knowledge graph,taxonomic names

                Comments

                Comment on this article