46
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Building a multi-scaled geospatial temporal ecology database from disparate data sources: fostering open science and data reuse

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Although there are considerable site-based data for individual or groups of ecosystems, these datasets are widely scattered, have different data formats and conventions, and often have limited accessibility. At the broader scale, national datasets exist for a large number of geospatial features of land, water, and air that are needed to fully understand variation among these ecosystems. However, such datasets originate from different sources and have different spatial and temporal resolutions. By taking an open-science perspective and by combining site-based ecosystem datasets and national geospatial datasets, science gains the ability to ask important research questions related to grand environmental challenges that operate at broad scales. Documentation of such complicated database integration efforts, through peer-reviewed papers, is recommended to foster reproducibility and future use of the integrated database. Here, we describe the major steps, challenges, and considerations in building an integrated database of lake ecosystems, called LAGOS (LAke multi-scaled GeOSpatial and temporal database), that was developed at the sub-continental study extent of 17 US states (1,800,000 km 2). LAGOS includes two modules: LAGOS GEO, with geospatial data on every lake with surface area larger than 4 ha in the study extent (~50,000 lakes), including climate, atmospheric deposition, land use/cover, hydrology, geology, and topography measured across a range of spatial and temporal extents; and LAGOS LIMNO, with lake water quality data compiled from ~100 individual datasets for a subset of lakes in the study extent (~10,000 lakes). Procedures for the integration of datasets included: creating a flexible database design; authoring and integrating metadata; documenting data provenance; quantifying spatial measures of geographic data; quality-controlling integrated and derived data; and extensively documenting the database. Our procedures make a large, complex, and integrated database reproducible and extensible, allowing users to ask new research questions with the existing database or through the addition of new data. The largest challenge of this task was the heterogeneity of the data, formats, and metadata. Many steps of data integration need manual input from experts in diverse fields, requiring close collaboration.

          Electronic supplementary material

          The online version of this article (doi:10.1186/s13742-015-0067-4) contains supplementary material, which is available to authorized users.

          Related collections

          Most cited references31

          • Record: found
          • Abstract: not found
          • Article: not found

          R: A Language and environmental for statistical computing

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Global patterns of root turnover for terrestrial ecosystems

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Big data and the future of ecology

                Bookmark

                Author and article information

                Contributors
                soranno@anr.msu.edu
                bissell3@msu.edu
                ksc@msu.edu
                schristel@wisc.edu
                colli636@msu.edu
                fergusca@msu.edu
                filstrup@iastate.edu
                jfrancoislapierre@gmail.com
                nrlottig@gmail.com
                skoliver@wisc.edu
                scottca6@msu.edu
                nicole.j.smith@gmail.com
                stopyak.scott@gmail.com
                yuanshu2@msu.edu
                bremigan@msu.edu
                downing@iastate.edu
                cgries@wisc.edu
                emily.henry@oregonstate.edu
                Nicholas.skaff@gmail.com
                ehstanley@wisc.edu
                craig.stow@noaa.gov
                ptan@cse.msu.edu
                txw19@psu.edu
                websterk@tcd.ie
                Journal
                Gigascience
                Gigascience
                GigaScience
                BioMed Central (London )
                2047-217X
                1 July 2015
                1 July 2015
                2015
                : 4
                : 28
                Affiliations
                [ ]Department of Fisheries and Wildlife, Michigan State University, East Lansing, MI 48824 USA
                [ ]Center for Limnology, University of Wisconsin-Madison, Madison, WI 53706 USA
                [ ]Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011 USA
                [ ]Center for Limnology Trout Lake Station, University of Wisconsin-Madison, Boulder Junction, WI 54512 USA
                [ ]Oregon State University, Tillamook County, Tillamook, OR 97141 USA
                [ ]NOAA Great Lakes Laboratory, Ann Arbor, MI 48108 USA
                [ ]Department of Computer Science and Engineering, Michigan State University, East Lansing, MI 48824 USA
                [ ]US Geological Survey, Pennsylvania Cooperative Fish and Wildlife Research Unit, Pennsylvania State University, University Park, PA 16802 USA
                [ ]School of Natural Sciences, Trinity College Dublin, Dublin, Ireland
                Article
                67
                10.1186/s13742-015-0067-4
                4488039
                26140212
                7962800c-15d9-405d-8aae-48a09f546c4a
                © Soranno et al. 2015

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                History
                : 31 December 2014
                : 9 June 2015
                Categories
                Review
                Custom metadata
                © The Author(s) 2015

                lagos,integrated database,data harmonization,database documentation,data reuse,data sharing,ecoinformatics,macrosystems ecology,landscape limnology,water quality

                Comments

                Comment on this article