Privacy-preserving federated machine learning on FAIR health data: A real-world application

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Objective

This paper introduces a privacy-preserving federated machine learning (ML) architecture built upon Findable, Accessible, Interoperable, and Reusable (FAIR) health data. It aims to devise an architecture for executing classification algorithms in a federated manner, enabling collaborative model-building among health data owners without sharing their datasets.

Materials and methods

Utilizing an agent-based architecture, a privacy-preserving federated ML algorithm was developed to create a global predictive model from various local models. This involved formally defining the algorithm in two steps: data preparation and federated model training on FAIR health data and constructing the architecture with multiple components facilitating algorithm execution. The solution was validated by five healthcare organizations using their specific health datasets.

Results

Five organizations transformed their datasets into Health Level 7 Fast Healthcare Interoperability Resources via a common FAIRification workflow and software set, thereby generating FAIR datasets. Each organization deployed a Federated ML Agent within its secure network, connected to a cloud-based Federated ML Manager. System testing was conducted on a use case aiming to predict 30-day readmission risk for chronic obstructive pulmonary disease patients and the federated model achieved an accuracy rate of 87%.

Discussion

The paper demonstrated a practical application of privacy-preserving federated ML among five distinct healthcare entities, highlighting the value of FAIR health data in machine learning when utilized in a federated manner that ensures privacy protection without sharing data.

Conclusion

This solution effectively leverages FAIR datasets from multiple healthcare organizations for federated ML while safeguarding sensitive health datasets, meeting legislative privacy and security requirements.

Graphical Abstract

Related collections

Most cited references 26

Record: found
Abstract: found
Article: found

Is Open Access

The FAIR Guiding Principles for scientific data management and stewardship

Mark D Wilkinson, Michel Dumontier, IJsbrand Jan Aalbersberg … (2017)

There is an urgent need to improve the infrastructure supporting the reuse of scholarly data. A diverse set of stakeholders—representing academia, industry, funding agencies, and scholarly publishers—have come together to design and jointly endorse a concise and measureable set of principles that we refer to as the FAIR Data Principles. The intent is that these may act as a guideline for those wishing to enhance the reusability of their data holdings. Distinct from peer initiatives that focus on the human scholar, the FAIR Principles put specific emphasis on enhancing the ability of machines to automatically find and use the data, in addition to supporting its reuse by individuals. This Comment is the first formal publication of the FAIR Principles, and includes the rationale behind them, and some exemplar implementations in the community.

0 comments Cited 3730 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Federated Machine Learning: Concept and Applications

Qiang Yang, Yang Liu, Tianjian Chen … (2019)

0 comments Cited 500 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Advances and Open Problems in Federated Learning

Edited Kairouz, H. McMahan (2021)

0 comments Cited 186 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

A. Anil Sinaci

Journal

Journal ID (nlm-ta): Comput Struct Biotechnol J

Journal ID (iso-abbrev): Comput Struct Biotechnol J

Title: Computational and Structural Biotechnology Journal

Publisher: Research Network of Computational and Structural Biotechnology

ISSN (Electronic): 2001-0370

Publication date PMC-release: 17 February 2024

Publication date Collection: December 2024

Publication date (Electronic): 17 February 2024

Volume: 24

Pages: 136-145

Affiliations

[a ]SRDC Software Research Development and Consultancy Corporation, Ankara, Turkey

[b ]Department of Computer Engineering, Middle East Technical University, Ankara, Turkey

[c ]Group of Research and Innovation in Biomedical Informatics, Biomedical Engineering and Health Economy, Institute of Biomedicine of Seville, IBiS / Virgen del Rocío University Hospital / CSIC / University of Seville, Seville, Spain

[d ]Languages and Systems Department, University of Seville, Seville, Spain

Author notes

[* ]Correspondence to: SRDC A.S. ODTU Teknokent Silikon Bina K1-16, Cankaya, 06800 Ankara, Turkey. anil@ 123456srdc.com.tr

[1]

0000-0003-4397-3382

[2]

0000-0003-2697-5722

[3]

0000-0001-8647-9515

[4]

0000-0002-6201-3849

[5]

0000-0001-5614-7747

[6]

0000-0002-6435-1497

[7]

0000-0003-2609-575X

Article

Publisher Item ID: S2001-0370(24)00038-2

DOI: 10.1016/j.csbj.2024.02.014

PMC ID: 10904920

PubMed ID: 38434250

SO-VID: a1759972-d431-456e-afd7-0ace08a8084e

License:

This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

History

Date received : 29 November 2023

Date revision received : 15 February 2024

Date accepted : 15 February 2024

Comments

Comment on this article

scite_

Cited by 5

See all cited by

Most referenced authors 743

See all reference authors

Privacy-preserving federated machine learning on FAIR health data: A real-world application

Read this article at

Abstract

Objective

Materials and methods

Results

Discussion

Conclusion

Graphical Abstract

Related collections

The FAIR data principles

Most cited references 26

The FAIR Guiding Principles for scientific data management and stewardship

Federated Machine Learning: Concept and Applications

Advances and Open Problems in Federated Learning

Author and article information

Contributors

Journal

Affiliations

Author notes

Article

History

Categories

Comments

Comment on this article

Similar content 107

Cited by 5

Most referenced authors 743