15
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Crop Yield Prediction Using Deep Neural Networks

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Crop yield is a highly complex trait determined by multiple factors such as genotype, environment, and their interactions. Accurate yield prediction requires fundamental understanding of the functional relationship between yield and these interactive factors, and to reveal such relationship requires both comprehensive datasets and powerful algorithms. In the 2018 Syngenta Crop Challenge, Syngenta released several large datasets that recorded the genotype and yield performances of 2,267 maize hybrids planted in 2,247 locations between 2008 and 2016 and asked participants to predict the yield performance in 2017. As one of the winning teams, we designed a deep neural network (DNN) approach that took advantage of state-of-the-art modeling and solution techniques. Our model was found to have a superior prediction accuracy, with a root-mean-square-error (RMSE) being 12% of the average yield and 50% of the standard deviation for the validation dataset using predicted weather data. With perfect weather data, the RMSE would be reduced to 11% of the average yield and 46% of the standard deviation. We also performed feature selection based on the trained DNN model, which successfully decreased the dimension of the input space without significant drop in the prediction accuracy. Our computational results suggested that this model significantly outperformed other popular methods such as Lasso, shallow neural networks (SNN), and regression tree (RT). The results also revealed that environmental factors had a greater effect on the crop yield than genotype.

          Related collections

          Most cited references29

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Feature selection, L1 vs. L2 regularization, and rotational invariance

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Integrating environmental covariates and crop modeling into the genomic selection framework to predict genotype by environment interactions.

            Development of models to predict genotype by environment interactions, in unobserved environments, using environmental covariates, a crop model and genomic selection. Application to a large winter wheat dataset. Genotype by environment interaction (G*E) is one of the key issues when analyzing phenotypes. The use of environment data to model G*E has long been a subject of interest but is limited by the same problems as those addressed by genomic selection methods: a large number of correlated predictors each explaining a small amount of the total variance. In addition, non-linear responses of genotypes to stresses are expected to further complicate the analysis. Using a crop model to derive stress covariates from daily weather data for predicted crop development stages, we propose an extension of the factorial regression model to genomic selection. This model is further extended to the marker level, enabling the modeling of quantitative trait loci (QTL) by environment interaction (Q*E), on a genome-wide scale. A newly developed ensemble method, soft rule fit, was used to improve this model and capture non-linear responses of QTL to stresses. The method is tested using a large winter wheat dataset, representative of the type of data available in a large-scale commercial breeding program. Accuracy in predicting genotype performance in unobserved environments for which weather data were available increased by 11.1% on average and the variability in prediction accuracy decreased by 10.8%. By leveraging agronomic knowledge and the large historical datasets generated by breeding programs, this new model provides insight into the genetic architecture of genotype by environment interactions and could predict genotype performance based on past and future weather scenarios.
              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Tensorflow: a system for large-scale machine learning.

                Bookmark

                Author and article information

                Contributors
                Journal
                Front Plant Sci
                Front Plant Sci
                Front. Plant Sci.
                Frontiers in Plant Science
                Frontiers Media S.A.
                1664-462X
                22 May 2019
                2019
                : 10
                : 621
                Affiliations
                Industrial and Manufacturing Systems Engineering, Iowa State University , Ames, IA, United States
                Author notes

                Edited by: Alfredo Pulvirenti, Università degli Studi di Catania, Italy

                Reviewed by: Sheldon Du, University of Wisconsin-Madison, United States; Dong Xu, University of Missouri, United States

                *Correspondence: Saeed Khaki skhaki@ 123456iastate.edu

                This article was submitted to Bioinformatics and Computational Biology, a section of the journal Frontiers in Plant Science

                Article
                10.3389/fpls.2019.00621
                6540942
                31191564
                d8617c1f-9ba9-42e3-8354-4bb0b4b70496
                Copyright © 2019 Khaki and Wang.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 06 February 2019
                : 26 April 2019
                Page count
                Figures: 9, Tables: 4, Equations: 1, References: 46, Pages: 10, Words: 6233
                Categories
                Plant Science
                Original Research

                Plant science & Botany
                yield prediction,machine learning,deep learning,feature selection,weather prediction

                Comments

                Comment on this article