1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Towards Long-term and Archivable Reproducibility

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Reproducible workflow solutions commonly use high-level technologies that were popular when they were created, providing an immediate solution which is unlikely to be sustainable in the long term. We therefore introduce a set of criteria to address this problem and demonstrate their practicality and implementation. The criteria have been tested in several research publications and can be summarized as: completeness (no dependency beyond a POSIX-compatible operating system, no administrator privileges, no network connection and storage primarily in plain text); modular design; minimal complexity; scalability; verifiable inputs and outputs; temporal provenance; linking analysis with narrative; and free-and-open-source software. As a proof of concept, we have implemented "Maneage", a solution which stores the project in machine-actionable and human-readable plain-text, enables version-control, cheap archiving, automatic parsing to extract data provenance, and peer-reviewable verification. We show that requiring longevity of a reproducible workflow solution is realistic, without sacrificing immediate or short-term reproducibility and discuss the benefits of the criteria for scientific progress. This paper has itself been written in Maneage, with snapshot 1637cce.

          Related collections

          Author and article information

          Journal
          04 June 2020
          Article
          2006.03018
          223d6044-b13b-4aff-978b-8bb87b8b3f53

          http://creativecommons.org/licenses/by-sa/4.0/

          History
          Custom metadata
          The downloadable source (on arXiv) includes the full reproduction info (scripts, config files and input data links) and can reproduce the paper automatically. Supplementary datasets and source also available on Zenodo.3872248: https://doi.org/10.5281/zenodo.3872248
          cs.DL

          Information & Library science
          Information & Library science

          Comments

          Comment on this article