8
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Privacy-preserving federated machine learning on FAIR health data: A real-world application

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Objective

          This paper introduces a privacy-preserving federated machine learning (ML) architecture built upon Findable, Accessible, Interoperable, and Reusable (FAIR) health data. It aims to devise an architecture for executing classification algorithms in a federated manner, enabling collaborative model-building among health data owners without sharing their datasets.

          Materials and methods

          Utilizing an agent-based architecture, a privacy-preserving federated ML algorithm was developed to create a global predictive model from various local models. This involved formally defining the algorithm in two steps: data preparation and federated model training on FAIR health data and constructing the architecture with multiple components facilitating algorithm execution. The solution was validated by five healthcare organizations using their specific health datasets.

          Results

          Five organizations transformed their datasets into Health Level 7 Fast Healthcare Interoperability Resources via a common FAIRification workflow and software set, thereby generating FAIR datasets. Each organization deployed a Federated ML Agent within its secure network, connected to a cloud-based Federated ML Manager. System testing was conducted on a use case aiming to predict 30-day readmission risk for chronic obstructive pulmonary disease patients and the federated model achieved an accuracy rate of 87%.

          Discussion

          The paper demonstrated a practical application of privacy-preserving federated ML among five distinct healthcare entities, highlighting the value of FAIR health data in machine learning when utilized in a federated manner that ensures privacy protection without sharing data.

          Conclusion

          This solution effectively leverages FAIR datasets from multiple healthcare organizations for federated ML while safeguarding sensitive health datasets, meeting legislative privacy and security requirements.

          Graphical Abstract

          Related collections

          Most cited references26

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The FAIR Guiding Principles for scientific data management and stewardship

          There is an urgent need to improve the infrastructure supporting the reuse of scholarly data. A diverse set of stakeholders—representing academia, industry, funding agencies, and scholarly publishers—have come together to design and jointly endorse a concise and measureable set of principles that we refer to as the FAIR Data Principles. The intent is that these may act as a guideline for those wishing to enhance the reusability of their data holdings. Distinct from peer initiatives that focus on the human scholar, the FAIR Principles put specific emphasis on enhancing the ability of machines to automatically find and use the data, in addition to supporting its reuse by individuals. This Comment is the first formal publication of the FAIR Principles, and includes the rationale behind them, and some exemplar implementations in the community.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Federated Machine Learning: Concept and Applications

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Advances and Open Problems in Federated Learning

                Bookmark

                Author and article information

                Contributors
                Journal
                Comput Struct Biotechnol J
                Comput Struct Biotechnol J
                Computational and Structural Biotechnology Journal
                Research Network of Computational and Structural Biotechnology
                2001-0370
                17 February 2024
                December 2024
                17 February 2024
                : 24
                : 136-145
                Affiliations
                [a ]SRDC Software Research Development and Consultancy Corporation, Ankara, Turkey
                [b ]Department of Computer Engineering, Middle East Technical University, Ankara, Turkey
                [c ]Group of Research and Innovation in Biomedical Informatics, Biomedical Engineering and Health Economy, Institute of Biomedicine of Seville, IBiS / Virgen del Rocío University Hospital / CSIC / University of Seville, Seville, Spain
                [d ]Languages and Systems Department, University of Seville, Seville, Spain
                Author notes
                [* ]Correspondence to: SRDC A.S. ODTU Teknokent Silikon Bina K1-16, Cankaya, 06800 Ankara, Turkey. anil@ 123456srdc.com.tr
                [1]

                0000-0003-4397-3382

                [2]

                0000-0003-2697-5722

                [3]

                0000-0001-8647-9515

                [4]

                0000-0002-6201-3849

                [5]

                0000-0001-5614-7747

                [6]

                0000-0002-6435-1497

                [7]

                0000-0003-2609-575X

                Article
                S2001-0370(24)00038-2
                10.1016/j.csbj.2024.02.014
                10904920
                38434250
                a1759972-d431-456e-afd7-0ace08a8084e
                © 2024 The Authors

                This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

                History
                : 29 November 2023
                : 15 February 2024
                : 15 February 2024
                Categories
                Research Article

                privacy-preserving machine learning,federated machine learning,fair data,distributed datasets

                Comments

                Comment on this article