Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Related collections

Most cited references 40

Record: found
Abstract: not found
Article: not found

The random subspace method for constructing decision forests

Tin Ho (1998)

0 comments Cited 852 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Instance-based learning algorithms

David Aha, Dennis Kibler, Marc Albert (1991)

0 comments Cited 318 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Rotation forest: A new classifier ensemble method.

Juan J Rodríguez, Ludmila I Kuncheva, Carlos J. Alonso (2006)

We propose a method for generating classifier ensembles based on feature extraction. To create the training data for a base classifier, the feature set is randomly split into K subsets (K is a parameter of the algorithm) and Principal Component Analysis (PCA) is applied to each subset. All principal components are retained in order to preserve the variability information in the data. Thus, K axis rotations take place to form the new features for a base classifier. The idea of the rotation approach is to encourage simultaneously individual accuracy and diversity within the ensemble. Diversity is promoted through the feature extraction for each base classifier. Decision trees were chosen here because they are sensitive to rotation of the feature axes, hence the name "forest." Accuracy is sought by keeping all principal components and also using the whole data set to train each base classifier. Using WEKA, we examined the Rotation Forest ensemble on a random selection of 33 benchmark data sets from the UCI repository and compared it with Bagging, AdaBoost, and Random Forest. The results were favorable to Rotation Forest and prompted an investigation into diversity-accuracy landscape of the ensemble models. Diversity-error diagrams revealed that Rotation Forest ensembles construct individual classifiers which are more accurate than these in AdaBoost and Random Forest, and more diverse than these in Bagging, sometimes more accurate as well.

0 comments Cited 287 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (publisher-id): AMPADS

Title: APL Materials

Abbreviated Title: APL Mater.

Publisher: AIP Publishing

ISSN (Electronic): 2166-532X

Publication date Created: May 01 2016

Publication date (Print): May 01 2016

Volume: 4

Issue: 5

Page: 053208

Article

DOI: 10.1063/1.4946894

SO-VID: 35586923-f421-41fd-ac07-4d8a182296fb

History

Data availability:

Comments

Comment on this article

scite_

Cited by 225

See all cited by

Most referenced authors 448

See all reference authors

Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science

Read this article at

Related collections

Radiology Science

Most cited references 40

The random subspace method for constructing decision forests

Instance-based learning algorithms

Rotation forest: A new classifier ensemble method.

Author and article information

Journal

Article

History

Comments

Comment on this article

Similar content 416

Cited by 225

Most referenced authors 448