Enhancing materials property prediction by leveraging computational and experimental data using deep transfer learning

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

The current predictive modeling techniques applied to Density Functional Theory (DFT) computations have helped accelerate the process of materials discovery by providing significantly faster methods to scan materials candidates, thereby reducing the search space for future DFT computations and experiments. However, in addition to prediction error against DFT-computed properties, such predictive models also inherit the DFT-computation discrepancies against experimentally measured properties. To address this challenge, we demonstrate that using deep transfer learning, existing large DFT-computational data sets (such as the Open Quantum Materials Database (OQMD)) can be leveraged together with other smaller DFT-computed data sets as well as available experimental observations to build robust prediction models. We build a highly accurate model for predicting formation energy of materials from their compositions; using an experimental data set of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1,963$$\end{document} observations, the proposed approach yields a mean absolute error (MAE) of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.06$$\end{document} eV/atom, which is significantly better than existing machine learning (ML) prediction modeling based on DFT computations and is comparable to the MAE of DFT-computation itself.

Abstract

Machine-learning approaches based on DFT computations can greatly enhance materials discovery. Here the authors leverage existing large DFT-computational data sets and experimental observations by deep transfer learning to predict the formation energy of materials from their elemental compositions with high accuracy.

Related collections

Most cited references 43

Record: found
Abstract: found
Article: found

Is Open Access

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, Mingchen Gao … (2016)

Remarkable progress has been made in image recognition, primarily due to the availability of large-scale annotated datasets and deep convolutional neural networks (CNNs). CNNs enable learning data-driven, highly representative, hierarchical image features from sufficient training data. However, obtaining datasets as comprehensively annotated as ImageNet in the medical imaging domain remains a challenge. There are currently three major techniques that successfully employ CNNs to medical image classification: training the CNN from scratch, using off-the-shelf pre-trained CNN features, and conducting unsupervised CNN pre-training with supervised fine-tuning. Another effective method is transfer learning, i.e., fine-tuning CNN models pre-trained from natural image dataset to medical image tasks. In this paper, we exploit three important, but previously understudied factors of employing deep convolutional neural networks to computer-aided detection problems. We first explore and evaluate different CNN architectures. The studied models contain 5 thousand to 160 million parameters, and vary in numbers of layers. We then evaluate the influence of dataset scale and spatial image context on performance. Finally, we examine when and why transfer learning from pre-trained ImageNet (via fine-tuning) can be useful. We study two specific computer-aided detection (CADe) problems, namely thoraco-abdominal lymph node (LN) detection and interstitial lung disease (ILD) classification. We achieve the state-of-the-art performance on the mediastinal LN detection, and report the first five-fold cross-validation classification results on predicting axial CT slices with ILD categories. Our extensive empirical evaluation, CNN model analysis and valuable insights can be extended to the design of high performance CAD systems for other medical imaging tasks.

0 comments Cited 1031 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

SchNet – A deep learning architecture for molecules and materials

K.-R. Muller, H. E. Sauceda, P-J Kindermans … (2018)

0 comments Cited 523 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Materials Design and Discovery with High-Throughput Density Functional Theory: The Open Quantum Materials Database (OQMD)

James E. Saal, Scott Kirklin, Muratahan Aykol … (2013)

0 comments Cited 476 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Ankit Agrawal: ankitag@eecs.northwestern.edu

Journal

Journal ID (nlm-ta): Nat Commun

Journal ID (iso-abbrev): Nat Commun

Title: Nature Communications

Publisher: Nature Publishing Group UK (London )

ISSN (Electronic): 2041-1723

Publication date (Electronic): 22 November 2019

Publication date PMC-release: 22 November 2019

Publication date Collection: 2019

Volume: 10

Electronic Location Identifier: 5316

Affiliations

[1 ]ISNI 0000 0001 2299 3507, GRID grid.16753.36, Department of Electrical and Computer Engineering, , Northwestern University, ; Evanston, IL 60208 USA

[2 ]ISNI 000000012158463X, GRID grid.94225.38, Thermodynamics and Kinetics Group, , National Institute of Standards and Technology, ; Gaithersburg, MD 20899 USA

Article

Publisher ID: 13297

DOI: 10.1038/s41467-019-13297-w

PMC ID: 6874674

PubMed ID: 31757948

SO-VID: 0e9952be-1f04-466f-bbe6-45b5499e949b

License:

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

History

Date received : 6 March 2019

Date accepted : 24 October 2019

Funding

Funded by: FundRef https://doi.org/10.13039/100000190, U.S. Department of Commerce (United States Department of Commerce);

Award ID: 70NANB19H005

Award Recipient : Ankit Agrawal

Funded by: FundRef https://doi.org/10.13039/100000015, U.S. Department of Energy (DOE);

Award ID: DE-SC0014330

Award ID: DE-SC0019358

Award Recipient : Ankit Agrawal

Custom metadata

ScienceOpen disciplines: Uncategorized

Keywords: inorganic chemistry,materials chemistry,density functional theory,theory and computation

Data availability:

ScienceOpen disciplines: Uncategorized

Keywords: inorganic chemistry, materials chemistry, density functional theory, theory and computation

Enhancing materials property prediction by leveraging computational and experimental data using deep transfer learning

Read this article at

Abstract

Abstract

Related collections

Genome Engineering using CRISPR

Most cited references 43

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

SchNet – A deep learning architecture for molecules and materials

Materials Design and Discovery with High-Throughput Density Functional Theory: The Open Quantum Materials Database (OQMD)

Author and article information

Contributors

Journal

Affiliations

Article

History

Funding

Categories

Custom metadata

Comments

Comment on this article

Similar content 69

Cited by 66

Most referenced authors 956