8
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Machine Learning Approach for Prescriptive Plant Breeding

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We explored the capability of fusing high dimensional phenotypic trait (phenomic) data with a machine learning (ML) approach to provide plant breeders the tools to do both in-season seed yield (SY) prediction and prescriptive cultivar development for targeted agro-management practices (e.g., row spacing and seeding density). We phenotyped 32 SoyNAM parent genotypes in two independent studies each with contrasting agro-management treatments (two row spacing, three seeding densities). Phenotypic trait data (canopy temperature, chlorophyll content, hyperspectral reflectance, leaf area index, and light interception) were generated using an array of sensors at three growth stages during the growing season and seed yield (SY) determined by machine harvest. Random forest (RF) was used to train models for SY prediction using phenotypic traits (predictor variables) to identify the optimal temporal combination of variables to maximize accuracy and resource allocation. RF models were trained using data from both experiments and individually for each agro-management treatment. We report the most important traits agnostic of agro-management practices. Several predictor variables showed conditional importance dependent on the agro-management system. We assembled predictive models to enable in-season SY prediction, enabling the development of a framework to integrate phenomics information with powerful ML for prediction enabled prescriptive plant breeding.

          Related collections

          Most cited references47

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Gene selection and classification of microarray data using random forest

          Background Selection of relevant genes for sample classification is a common task in most gene expression studies, where researchers try to identify the smallest possible set of genes that can still achieve good predictive performance (for instance, for future use with diagnostic purposes in clinical practice). Many gene selection approaches use univariate (gene-by-gene) rankings of gene relevance and arbitrary thresholds to select the number of genes, can only be applied to two-class problems, and use gene selection ranking criteria unrelated to the classification algorithm. In contrast, random forest is a classification algorithm well suited for microarray data: it shows excellent performance even when most predictive variables are noise, can be used when the number of variables is much larger than the number of observations and in problems involving more than two classes, and returns measures of variable importance. Thus, it is important to understand the performance of random forest with microarray data and its possible use for gene selection. Results We investigate the use of random forest for classification of microarray data (including multi-class problems) and propose a new method of gene selection in classification problems based on random forest. Using simulated and nine microarray data sets we show that random forest has comparable performance to other classification methods, including DLDA, KNN, and SVM, and that the new gene selection procedure yields very small sets of genes (often smaller than alternative methods) while preserving predictive accuracy. Conclusion Because of its performance and features, random forest and gene selection using random forest should probably become part of the "standard tool-box" of methods for class prediction and gene selection with microarray data.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Detection of Influential Observation in Linear Regression

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Stage of Development Descriptions for Soybeans, Glycine Max (L.) Merrill1

                Bookmark

                Author and article information

                Contributors
                singhak@iastate.edu
                Journal
                Sci Rep
                Sci Rep
                Scientific Reports
                Nature Publishing Group UK (London )
                2045-2322
                20 November 2019
                20 November 2019
                2019
                : 9
                : 17132
                Affiliations
                [1 ]ISNI 0000 0004 1936 7312, GRID grid.34421.30, Department of Agronomy, Iowa State University, ; Ames, IA USA
                [2 ]ISNI 0000 0004 1936 7312, GRID grid.34421.30, Department of Mechanical Engineering, Iowa State University, ; Ames, IA USA
                Author information
                http://orcid.org/0000-0002-7522-037X
                Article
                53451
                10.1038/s41598-019-53451-4
                6868245
                31748577
                f35a5b45-3c84-4337-97d3-b5b58cb5c48e
                © The Author(s) 2019

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 1 May 2019
                : 29 October 2019
                Funding
                Funded by: FundRef https://doi.org/10.13039/100011461, Iowa Soybean Association (ISA);
                Funded by: R F Baker for Plant Breeding at Iowa State University (ISU), Monsanto Chair in Soybean Breeding at ISU, Plant Sciences Institute at ISU, Presidential Interdisciplinary Research Initiative at ISU, Iowa Crop Improvement Association, and USDA IOW04403
                Categories
                Article
                Custom metadata
                © The Author(s) 2019

                Uncategorized
                high-throughput screening,plant breeding
                Uncategorized
                high-throughput screening, plant breeding

                Comments

                Comment on this article