8
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Kernel methods and their derivatives: Concept and perspectives for the earth system sciences

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Kernel methods are powerful machine learning techniques which use generic non-linear functions to solve complex tasks. They have a solid mathematical foundation and exhibit excellent performance in practice. However, kernel machines are still considered black-box models as the kernel feature mapping cannot be accessed directly thus making the kernels difficult to interpret. The aim of this work is to show that it is indeed possible to interpret the functions learned by various kernel methods as they can be intuitive despite their complexity. Specifically, we show that derivatives of these functions have a simple mathematical formulation, are easy to compute, and can be applied to various problems. The model function derivatives in kernel machines is proportional to the kernel function derivative and we provide the explicit analytic form of the first and second derivatives of the most common kernel functions with regard to the inputs as well as generic formulas to compute higher order derivatives. We use them to analyze the most used supervised and unsupervised kernel learning methods: Gaussian Processes for regression, Support Vector Machines for classification, Kernel Entropy Component Analysis for density estimation, and the Hilbert-Schmidt Independence Criterion for estimating the dependency between random variables. For all cases we expressed the derivative of the learned function as a linear combination of the kernel function derivative. Moreover we provide intuitive explanations through illustrative toy examples and show how these same kernel methods can be applied to applications in the context of spatio-temporal Earth system data cubes. This work reflects on the observation that function derivatives may play a crucial role in kernel methods analysis and understanding.

          Related collections

          Most cited references52

          • Record: found
          • Abstract: not found
          • Article: not found

          A tutorial on support vector regression

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Theory of reproducing kernels

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found
              Is Open Access

              GLEAM v3: satellite-based land evaporation and root-zone soil moisture

              The Global Land Evaporation Amsterdam Model (GLEAM) is a set of algorithms dedicated to the estimation of terrestrial evaporation and root-zone soil moisture from satellite data. Ever since its development in 2011, the model has been regularly revised, aiming at the optimal incorporation of new satellite-observed geophysical variables, and improving the representation of physical processes. In this study, the next version of this model (v3) is presented. Key changes relative to the previous version include (1) a revised formulation of the evaporative stress, (2) an optimized drainage algorithm, and (3) a new soil moisture data assimilation system. GLEAM v3 is used to produce three new data sets of terrestrial evaporation and root-zone soil moisture, including a 36-year data set spanning 1980–2015, referred to as v3a (based on satellite-observed soil moisture, vegetation optical depth and snow-water equivalent, reanalysis air temperature and radiation, and a multi-source precipitation product), and two satellite-based data sets. The latter share most of their forcing, except for the vegetation optical depth and soil moisture, which are based on observations from different passive and active C- and L-band microwave sensors (European Space Agency Climate Change Initiative, ESA CCI) for the v3b data set (spanning 2003–2015) and observations from the Soil Moisture and Ocean Salinity (SMOS) satellite in the v3c data set (spanning 2011–2015). Here, these three data sets are described in detail, compared against analogous data sets generated using the previous version of GLEAM (v2), and validated against measurements from 91 eddy-covariance towers and 2325 soil moisture sensors across a broad range of ecosystems. Results indicate that the quality of the v3 soil moisture is consistently better than the one from v2: average correlations against in situ surface soil moisture measurements increase from 0.61 to 0.64 in the case of the v3a data set and the representation of soil moisture in the second layer improves as well, with correlations increasing from 0.47 to 0.53. Similar improvements are observed for the v3b and c data sets. Despite regional differences, the quality of the evaporation fluxes remains overall similar to the one obtained using the previous version of GLEAM, with average correlations against eddy-covariance measurements ranging between 0.78 and 0.81 for the different data sets. These global data sets of terrestrial evaporation and root-zone soil moisture are now openly available at www.GLEAM.eu and may be used for large-scale hydrological applications, climate studies, or research on land–atmosphere feedbacks.
                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: MethodologyRole: ResourcesRole: SoftwareRole: Validation
                Role: ConceptualizationRole: Formal analysisRole: InvestigationRole: SupervisionRole: ValidationRole: Visualization
                Role: ConceptualizationRole: Formal analysisRole: InvestigationRole: SoftwareRole: Validation
                Role: ConceptualizationRole: Data curationRole: ResourcesRole: Supervision
                Role: ConceptualizationRole: Formal analysisRole: Funding acquisitionRole: InvestigationRole: ResourcesRole: SupervisionRole: Validation
                Role: Editor
                Journal
                PLoS One
                PLoS One
                plos
                plosone
                PLoS ONE
                Public Library of Science (San Francisco, CA USA )
                1932-6203
                2020
                29 October 2020
                : 15
                : 10
                : e0235885
                Affiliations
                [001] Image Processing Laboratory, Universitat de València, València, Spain
                University of Bradford, UNITED KINGDOM
                Author notes

                Competing Interests: The authors have declared that no competing interests exist.

                [¤]

                Current address: Max Planck Institute for Biogeochemistry, Jena, Germany

                Author information
                https://orcid.org/0000-0002-6739-0053
                https://orcid.org/0000-0002-8258-4454
                Article
                PONE-D-19-17584
                10.1371/journal.pone.0235885
                7595302
                33119617
                e6cb42f6-7942-4c92-ac02-d67011fb59a0
                © 2020 Johnson et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 10 July 2019
                : 8 June 2020
                Page count
                Figures: 12, Tables: 4, Pages: 30
                Funding
                Funded by: funder-id http://dx.doi.org/10.13039/501100000781, European Research Council;
                Award ID: 647423
                Award Recipient :
                GCV 647423 European Research Council https://erc.europa.eu/ The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Physical Sciences
                Mathematics
                Operator Theory
                Kernel Functions
                Physical Sciences
                Mathematics
                Applied Mathematics
                Algorithms
                Kernel Methods
                Research and Analysis Methods
                Simulation and Modeling
                Algorithms
                Kernel Methods
                Computer and Information Sciences
                Artificial Intelligence
                Machine Learning
                Support Vector Machines
                Computer and Information Sciences
                Artificial Intelligence
                Machine Learning
                Earth Sciences
                Geography
                Physical Geography
                Earth Systems
                Physical Sciences
                Mathematics
                Algebra
                Linear Algebra
                Eigenvectors
                Physical Sciences
                Mathematics
                Probability Theory
                Random Variables
                Computer and Information Sciences
                Data Management
                Data Visualization
                Custom metadata
                All toy example code is reproducible and is available at: https://github.com/IPL-UV/sakame All applied data is open source and available at: https://www.earthsystemdatalab.net/.

                Uncategorized
                Uncategorized

                Comments

                Comment on this article