MaxEnt’s parameter configuration and small samples: are we paying attention to recommendations? A systematic review

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Environmental niche modeling (ENM) is commonly used to develop probabilistic maps of species distribution. Among available ENM techniques, MaxEnt has become one of the most popular tools for modeling species distribution, with hundreds of peer-reviewed articles published each year. MaxEnt’s popularity is mainly due to the use of a graphical interface and automatic parameter configuration capabilities. However, recent studies have shown that using the default automatic configuration may not be always appropriate because it can produce non-optimal models; particularly when dealing with a small number of species presence points. Thus, the recommendation is to evaluate the best potential combination of parameters (feature classes and regularization multiplier) to select the most appropriate model. In this work we reviewed 244 articles published between 2013 and 2015 to assess whether researchers are following recommendations to avoid using the default parameter configuration when dealing with small sample sizes, or if they are using MaxEnt as a “black box tool.” Our results show that in only 16% of analyzed articles authors evaluated best feature classes, in 6.9% evaluated best regularization multipliers, and in a meager 3.7% evaluated simultaneously both parameters before producing the definitive distribution model. We analyzed 20 articles to quantify the potential differences in resulting outputs when using software default parameters instead of the alternative best model. Results from our analysis reveal important differences between the use of default parameters and the best model approach, especially in the total area identified as suitable for the assessed species and the specific areas that are identified as suitable by both modelling approaches. These results are worrying, because publications are potentially reporting over-complex or over-simplistic models that can undermine the applicability of their results. Of particular importance are studies used to inform policy making. Therefore, researchers, practitioners, reviewers and editors need to be very judicious when dealing with MaxEnt, particularly when the modelling process is based on small sample sizes.

Related collections

Most cited references 35

Record: found
Abstract: not found
Article: not found

Making better Maxentmodels of species distributions: complexity, overfitting and evaluation

Aleksandar Radosavljevic, Robert Anderson, Tâmara Miguel Araújo (2014)

0 comments Cited 417 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Ecological niche modeling in Maxent: the importance of model complexity and the performance of model selection criteria.

René L. Warren, N Seifert (2011)

Maxent, one of the most commonly used methods for inferring species distributions and environmental tolerances from occurrence data, allows users to fit models of arbitrary complexity. Model complexity is typically constrained via a process known as L1 regularization, but at present little guidance is available for setting the appropriate level of regularization, and the effects of inappropriately complex or simple models are largely unknown. In this study, we demonstrate the use of information criterion approaches to setting regularization in Maxent, and we compare models selected using information criteria to models selected using other criteria that are common in the literature. We evaluate model performance using occurrence data generated from a known "true" initial Maxent model, using several different metrics for model quality and transferability. We demonstrate that models that are inappropriately complex or inappropriately simple show reduced ability to infer habitat quality, reduced ability to infer the relative importance of variables in constraining species' distributions, and reduced transferability to other time periods. We also demonstrate that information criteria may offer significant advantages over the methods commonly used in the literature.

0 comments Cited 274 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

The Effects of Sampling Bias and Model Complexity on the Predictive Performance of MaxEnt Species Distribution Models

Mindy Syfert, Matthew Smith, David A. Coomes (2013)

Species distribution models (SDMs) trained on presence-only data are frequently used in ecological research and conservation planning. However, users of SDM software are faced with a variety of options, and it is not always obvious how selecting one option over another will affect model performance. Working with MaxEnt software and with tree fern presence data from New Zealand, we assessed whether (a) choosing to correct for geographical sampling bias and (b) using complex environmental response curves have strong effects on goodness of fit. SDMs were trained on tree fern data, obtained from an online biodiversity data portal, with two sources that differed in size and geographical sampling bias: a small, widely-distributed set of herbarium specimens and a large, spatially clustered set of ecological survey records. We attempted to correct for geographical sampling bias by incorporating sampling bias grids in the SDMs, created from all georeferenced vascular plants in the datasets, and explored model complexity issues by fitting a wide variety of environmental response curves (known as “feature types” in MaxEnt). In each case, goodness of fit was assessed by comparing predicted range maps with tree fern presences and absences using an independent national dataset to validate the SDMs. We found that correcting for geographical sampling bias led to major improvements in goodness of fit, but did not entirely resolve the problem: predictions made with clustered ecological data were inferior to those made with the herbarium dataset, even after sampling bias correction. We also found that the choice of feature type had negligible effects on predictive performance, indicating that simple feature types may be sufficient once sampling bias is accounted for. Our study emphasizes the importance of reducing geographical sampling bias, where possible, in datasets used to train SDMs, and the effectiveness and essentialness of sampling bias correction within MaxEnt.

0 comments Cited 192 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Narkis S. Morales

Ignacio C. Fernández

Journal

Journal ID (nlm-ta): PeerJ

Journal ID (iso-abbrev): PeerJ

Journal ID (publisher-id): peerj

Journal ID (pmc): peerj

Title: PeerJ

Publisher: PeerJ Inc. (San Francisco, USA )

ISSN (Electronic): 2167-8359

Publication date (Electronic): 14 March 2017

Publication date Collection: 2017

Volume: 5

Electronic Location Identifier: e3093

Affiliations

[1 ]Department of Biological Sciences, Faculty of Science and Engineering, Macquarie University , Sydney, New South Wales, Australia

[2 ]Fundación Ecomabi , Santiago, Región Metropolitana, Chile

[3 ]Landscape Ecology & Sustainability Laboratory, Arizona State University , Tempe, AZ, United States

[4 ]Facultad de Ciencias Biológicas, Universidad Complutense de Madrid , Madrid, Spain

Article

Publisher ID: 3093

DOI: 10.7717/peerj.3093

PMC ID: 5354112

PubMed ID: 28316894

SO-VID: a81ba24c-d50d-4fc1-80dd-785969de9891

License:

This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

History

Date received : 16 October 2016

Date accepted : 14 February 2017

Funding

The authors received no funding for this work.

Comments

Comment on this article

scite_

Cited by 107

See all cited by

Most referenced authors 1,210

See all reference authors

MaxEnt’s parameter configuration and small samples: are we paying attention to recommendations? A systematic review

Read this article at

Abstract

Related collections

SAXS

Most cited references 35

Making better Maxentmodels of species distributions: complexity, overfitting and evaluation

Ecological niche modeling in Maxent: the importance of model complexity and the performance of model selection criteria.

The Effects of Sampling Bias and Model Complexity on the Predictive Performance of MaxEnt Species Distribution Models

Author and article information

Contributors

Journal

Affiliations

Article

History

Funding

Categories

Comments

Comment on this article

Similar content 64

Cited by 107

Most referenced authors 1,210