24
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Hide and seek shark teeth in Random Forests: machine learning applied to Scyliorhinus canicula populations

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Shark populations that are distributed alongside a latitudinal gradient often display body size differences at sexual maturity and vicariance patterns related to their number of tooth files. Previous works have demonstrated that Scyliorhinus canicula populations differ between the northeastern Atlantic Ocean and the Mediterranean Sea based on biological features and genetic analysis. In this study, we sample more than 3,000 teeth from 56 S. canicula specimens caught incidentally off Roscoff and Banyuls-sur-Mer. We investigate population differences based on tooth shape and form by using two approaches. Classification results show that the classical geometric morphometric framework is outperformed by an original Random Forests-based framework. Visually, both S. canicula populations share similar ontogenetic trends and timing of gynandric heterodonty emergence but the Atlantic population has bigger, blunter teeth, and less numerous accessory cusps than the Mediterranean population. According to the models, the populations are best differentiated based on their lateral tooth edges, which bear accessory cusps, and the tooth centroid sizes significantly improve classification performances. The differences observed are discussed in light of dietary and behavioural habits of the populations considered. The method proposed in this study could be further adapted to complement DNA analyses to identify shark species or populations based on tooth morphologies. This process would be of particular interest for fisheries management and identification of shark fossils.

          Related collections

          Most cited references91

          • Record: found
          • Abstract: not found
          • Article: not found

          Random Forests

            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Gene selection and classification of microarray data using random forest

            Background Selection of relevant genes for sample classification is a common task in most gene expression studies, where researchers try to identify the smallest possible set of genes that can still achieve good predictive performance (for instance, for future use with diagnostic purposes in clinical practice). Many gene selection approaches use univariate (gene-by-gene) rankings of gene relevance and arbitrary thresholds to select the number of genes, can only be applied to two-class problems, and use gene selection ranking criteria unrelated to the classification algorithm. In contrast, random forest is a classification algorithm well suited for microarray data: it shows excellent performance even when most predictive variables are noise, can be used when the number of variables is much larger than the number of observations and in problems involving more than two classes, and returns measures of variable importance. Thus, it is important to understand the performance of random forest with microarray data and its possible use for gene selection. Results We investigate the use of random forest for classification of microarray data (including multi-class problems) and propose a new method of gene selection in classification problems based on random forest. Using simulated and nine microarray data sets we show that random forest has comparable performance to other classification methods, including DLDA, KNN, and SVM, and that the new gene selection procedure yields very small sets of genes (often smaller than alternative methods) while preserving predictive accuracy. Conclusion Because of its performance and features, random forest and gene selection using random forest should probably become part of the "standard tool-box" of methods for class prediction and gene selection with microarray data.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Extinction risk and conservation of the world’s sharks and rays

              The rapid expansion of human activities threatens ocean-wide biodiversity. Numerous marine animal populations have declined, yet it remains unclear whether these trends are symptomatic of a chronic accumulation of global marine extinction risk. We present the first systematic analysis of threat for a globally distributed lineage of 1,041 chondrichthyan fishes—sharks, rays, and chimaeras. We estimate that one-quarter are threatened according to IUCN Red List criteria due to overfishing (targeted and incidental). Large-bodied, shallow-water species are at greatest risk and five out of the seven most threatened families are rays. Overall chondrichthyan extinction risk is substantially higher than for most other vertebrates, and only one-third of species are considered safe. Population depletion has occurred throughout the world’s ice-free waters, but is particularly prevalent in the Indo-Pacific Biodiversity Triangle and Mediterranean Sea. Improved management of fisheries and trade is urgently needed to avoid extinctions and promote population recovery. DOI: http://dx.doi.org/10.7554/eLife.00590.001
                Bookmark

                Author and article information

                Contributors
                Journal
                PeerJ
                PeerJ
                PeerJ
                PeerJ
                PeerJ Inc. (San Diego, USA )
                2167-8359
                4 July 2022
                2022
                : 10
                : e13575
                Affiliations
                [1 ]Institut de Génomique Fonctionnelle de Lyon, École Normale Supérieure de Lyon , CNRS, UCBL, Lyon, France
                [2 ]Institut des Sciences de l’Évolution de Montpellier, CNRS, IRD, EPHE, Université de Montpellier , Montpellier, France
                [3 ]Université de Bordeaux, Bordeaux INP, CNRS, LaBRI , Talence, France
                [4 ]Department of Visual and Data-Centric Computing, Zuse Institute Berlin , Berlin, Germany
                Author information
                http://orcid.org/0000-0003-0810-9783
                Article
                13575
                10.7717/peerj.13575
                9261926
                35811817
                8d93bda8-18ac-41e3-a9b0-ca5e1dd24562
                © 2022 Berio et al.

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

                History
                : 18 January 2022
                : 22 May 2022
                Funding
                Funded by: Attractivité Nouveaux professeurs
                Award ID: ENS de Lyon
                This work was supported by the ENS de Lyon “Attractivité Nouveaux professeurs” fund. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Aquaculture, Fisheries and Fish Science
                Ecology
                Marine Biology
                Zoology
                Data Mining and Machine Learning

                machine learning,geometric morphometrics,tooth morphology,scyliorhinus canicula,random forests,linear discriminant analysis,sharks

                Comments

                Comment on this article