Blog
About


16
views
3
recommends
+1 Recommend
0
shares
  • Review: found
Is Open Access

Review of 'Data preservation at the Fermilab Tevatron'

Bookmark
3
A concise description of data preservation challenges in a specific case
Average rating:
    Rated 3.5 of 5.
Level of importance:
    Rated 3 of 5.
Level of validity:
    Rated 3 of 5.
Level of completeness:
    Rated 4 of 5.
Level of comprehensibility:
    Rated 3 of 5.
Competing interests:
None

Reviewed article

  • Record: found
  • Abstract: found
  • Article: found
Is Open Access

Data preservation at the Fermilab Tevatron

The Fermilab Tevatron collider's data-taking run ended in September 2011, yielding a dataset with rich scientific potential. The CDF and D0 experiments each have approximately 9 PB of collider and simulated data stored on tape. A large computing infrastructure consisting of tape storage, disk cache, and distributed grid computing for physics analysis with the Tevatron data is present at Fermilab. The Fermilab Run II data preservation project intends to keep this analysis capability sustained through the year 2020 and beyond. To achieve this goal, we have implemented a system that utilizes virtualization, automated validation, and migration to new standards in both software and data storage technology and leverages resources available from currently-running experiments at Fermilab. These efforts have also provided useful lessons in ensuring long-term data access for numerous experiments, and enable high-quality scientific output for years to come.
    Bookmark

    Review information

    10.14293/S2199-1006.1.SOR-PHYS.A1701.07773.v1.RHUVIH
    ScienceOpen disciplines:

    Review text

    This article describes the challenges faced by a large physics experiment at one of the premier facilities in the world for physics research. The issues and measures described apply to the need to preserve data beyond the lifetime of the active data-taking period, and beyond the average lifetime of participation of individuals in the collaborations.

    Aspects of preservation of data consistency as well as software and operating environments are considered.

    This article serves a s a good reference for those involved in planning data preservation and related infrastructures, and may inform certain important decisions made.

    Comments

    Comment on this review