2
views
0
recommends
+1 Recommend
1 collections
    0
    shares

      Submit your digital health research with an established publisher
      - celebrating 25 years of open access

      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Privacy-Preserving Analysis of Distributed Biomedical Data: Designing Efficient and Secure Multiparty Computations Using Distributed Statistical Learning Theory

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Biomedical research often requires large cohorts and necessitates the sharing of biomedical data with researchers around the world, which raises many privacy, ethical, and legal concerns. In the face of these concerns, privacy experts are trying to explore approaches to analyzing the distributed data while protecting its privacy. Many of these approaches are based on secure multiparty computations (SMCs). SMC is an attractive approach allowing multiple parties to collectively carry out calculations on their datasets without having to reveal their own raw data; however, it incurs heavy computation time and requires extensive communication between the involved parties.

          Objective

          This study aimed to develop usable and efficient SMC applications that meet the needs of the potential end-users and to raise general awareness about SMC as a tool that supports data sharing.

          Methods

          We have introduced distributed statistical computing (DSC) into the design of secure multiparty protocols, which allows us to conduct computations on each of the parties’ sites independently and then combine these computations to form 1 estimator for the collective dataset, thus limiting communication to the final step and reducing complexity. The effectiveness of our privacy-preserving model is demonstrated through a linear regression application.

          Results

          Our secure linear regression algorithm was tested for accuracy and performance using real and synthetic datasets. The results showed no loss of accuracy (over nonsecure regression) and very good performance (20 min for 100 million records).

          Conclusions

          We used DSC to securely calculate a linear regression model over multiple datasets. Our experiments showed very good performance (in terms of the number of records it can handle). We plan to extend our method to other estimators such as logistic regression.

          Related collections

          Most cited references33

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Privacy-Preserving Ridge Regression on Hundreds of Millions of Records

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Foundations of garbled circuits

              Bookmark
              • Record: found
              • Abstract: not found
              • Book Chapter: not found

              Secret-Sharing Schemes: A Survey

                Bookmark

                Author and article information

                Contributors
                Journal
                JMIR Med Inform
                JMIR Med Inform
                JMI
                JMIR Medical Informatics
                JMIR Publications (Toronto, Canada )
                2291-9694
                Apr-Jun 2019
                29 April 2019
                : 7
                : 2
                : e12702
                Affiliations
                [1 ] United Arab Emirates University Abu Dhabi United Arab Emirates
                [2 ] Independent Scientist Ottawa, ON Canada
                [3 ] Sidra Medicine Doha Qatar
                Author notes
                Corresponding Author: Fida K Dankar fida.dankar@ 123456uaeu.ac.ae
                Author information
                http://orcid.org/0000-0002-1248-3295
                http://orcid.org/0000-0003-1736-7426
                http://orcid.org/0000-0001-8282-756X
                http://orcid.org/0000-0003-2734-3356
                Article
                v7i2e12702
                10.2196/12702
                6658266
                31033449
                407f4ecb-045a-49ea-af05-9b375d63e418
                ©Fida K Dankar, Nisha Madathil, Samar K Dankar, Sabri Boughorbel. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 29.04.2019.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/.as well as this copyright and license information must be included.

                History
                : 6 November 2018
                : 27 December 2018
                : 14 January 2019
                : 15 February 2019
                Categories
                Original Paper
                Original Paper

                data analytics,data aggregation,personal genetic information,patient data privacy

                Comments

                Comment on this article