23
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Application of Machine Learning Algorithms in Plant Breeding: Predicting Yield From Hyperspectral Reflectance in Soybean

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Recent substantial advances in high-throughput field phenotyping have provided plant breeders with affordable and efficient tools for evaluating a large number of genotypes for important agronomic traits at early growth stages. Nevertheless, the implementation of large datasets generated by high-throughput phenotyping tools such as hyperspectral reflectance in cultivar development programs is still challenging due to the essential need for intensive knowledge in computational and statistical analyses. In this study, the robustness of three common machine learning (ML) algorithms, multilayer perceptron (MLP), support vector machine (SVM), and random forest (RF), were evaluated for predicting soybean ( Glycine max) seed yield using hyperspectral reflectance. For this aim, the hyperspectral reflectance data for the whole spectra ranged from 395 to 1005 nm, which were collected at the R4 and R5 growth stages on 250 soybean genotypes grown in four environments. The recursive feature elimination (RFE) approach was performed to reduce the dimensionality of the hyperspectral reflectance data and select variables with the largest importance values. The results indicated that R5 is more informative stage for measuring hyperspectral reflectance to predict seed yields. The 395 nm reflectance band was also identified as the high ranked band in predicting the soybean seed yield. By considering either full or selected variables as the input variables, the ML algorithms were evaluated individually and combined-version using the ensemble–stacking (E–S) method to predict the soybean yield. The RF algorithm had the highest performance with a value of 84% yield classification accuracy among all the individual tested algorithms. Therefore, by selecting RF as the metaClassifier for E–S method, the prediction accuracy increased to 0.93, using all variables, and 0.87, using selected variables showing the success of using E–S as one of the ensemble techniques. This study demonstrated that soybean breeders could implement E–S algorithm using either the full or selected spectra reflectance to select the high-yielding soybean genotypes, among a large number of genotypes, at early growth stages.

          Related collections

          Most cited references127

          • Record: found
          • Abstract: not found
          • Article: not found

          Random Forests

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Bagging predictors

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Building Predictive Models inRUsing thecaretPackage

                Bookmark

                Author and article information

                Contributors
                Journal
                Front Plant Sci
                Front Plant Sci
                Front. Plant Sci.
                Frontiers in Plant Science
                Frontiers Media S.A.
                1664-462X
                12 January 2021
                2020
                : 11
                : 624273
                Affiliations
                [1] 1Department of Plant Agriculture, University of Guelph , Guelph, ON, Canada
                [2] 2Department of Animal Biosciences, University of Guelph , Guelph, ON, Canada
                Author notes

                Edited by: Yiannis Ampatzidis, University of Florida, United States

                Reviewed by: Omar Vergara-Diaz, University of Barcelona, Spain; Michael Gomez Selvaraj, Consultative Group on International Agricultural Research (CGIAR), United States

                *Correspondence: Milad Eskandari, meskanda@ 123456uoguelph.ca

                This article was submitted to Technical Advances in Plant Science, a section of the journal Frontiers in Plant Science

                Article
                10.3389/fpls.2020.624273
                7835636
                33510761
                0088cac9-31d8-467e-9959-8834181bd674
                Copyright © 2021 Yoosefzadeh-Najafabadi, Earl, Tulpan, Sulik and Eskandari.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 31 October 2020
                : 10 December 2020
                Page count
                Figures: 9, Tables: 2, Equations: 4, References: 119, Pages: 14, Words: 0
                Categories
                Plant Science
                Original Research

                Plant science & Botany
                artificial intelligence,data-driven model,ensemble methods,high-throughput phenotyping,random forest,recursive feature elimination

                Comments

                Comment on this article