1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Use of Non-invasive Parameters and Machine-Learning Algorithms for Predicting Future Risk of Type 2 Diabetes: A Retrospective Cohort Study of Health Data From Kuwait

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Objective: In recent decades, the Arab population has experienced an increase in the prevalence of type 2 diabetes (T2DM), particularly within the Gulf Cooperation Council. In this context, early intervention programmes rely on an ability to identify individuals at risk of T2DM. We aimed to build prognostic models for the risk of T2DM in the Arab population using machine-learning algorithms vs. conventional logistic regression (LR) and simple non-invasive clinical markers over three different time scales (3, 5, and 7 years from the baseline).

          Design: This retrospective cohort study used three models based on LR, k-nearest neighbours (k-NN), and support vector machines (SVM) with five-fold cross-validation. The models included the following baseline non-invasive parameters: age, sex, body mass index (BMI), pre-existing hypertension, family history of hypertension, and T2DM.

          Setting: This study was based on data from the Kuwait Health Network (KHN), which integrated primary health and hospital laboratory data into a single system.

          Participants: The study included 1,837 native Kuwaiti Arab individuals (equal proportion of men and women) with mean age as 59.5 ± 11.4 years. Among them, 647 developed T2DM within 7 years of the baseline non-invasive measurements.

          Analytical methods: The discriminatory power of each model for classifying people at risk of T2DM within 3, 5, or 7 years and the area under the receiver operating characteristic curve (AUC) were determined.

          Outcome measures: Onset of T2DM at 3, 5, and 7 years.

          Results: The k-NN machine-learning technique, which yielded AUC values of 0.83, 0.82, and 0.79 for 3-, 5-, and 7-year prediction horizons, respectively, outperformed the most commonly used LR method and other previously reported methods. Comparable results were achieved using the SVM and LR models with corresponding AUC values of (SVM: 0.73, LR: 0.74), (SVM: 0.68, LR: 0.72), and (SVM: 0.71, LR: 0.70) for 3-, 5-, and 7-year prediction horizons, respectively. For all models, the discriminatory power decreased as the prediction horizon increased from 3 to 7 years.

          Conclusions: Machine-learning techniques represent a useful addition to the commonly reported LR technique. Our prognostic models for the future risk of T2DM could be used to plan and implement early prevention programmes for at risk groups in the Arab population.

          Related collections

          Most cited references24

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Risk models and scores for type 2 diabetes: systematic review

          Objective To evaluate current risk models and scores for type 2 diabetes and inform selection and implementation of these in practice. Design Systematic review using standard (quantitative) and realist (mainly qualitative) methodology. Inclusion criteria Papers in any language describing the development or external validation, or both, of models and scores to predict the risk of an adult developing type 2 diabetes. Data sources Medline, PreMedline, Embase, and Cochrane databases were searched. Included studies were citation tracked in Google Scholar to identify follow-on studies of usability or impact. Data extraction Data were extracted on statistical properties of models, details of internal or external validation, and use of risk scores beyond the studies that developed them. Quantitative data were tabulated to compare model components and statistical properties. Qualitative data were analysed thematically to identify mechanisms by which use of the risk model or score might improve patient outcomes. Results 8864 titles were scanned, 115 full text papers considered, and 43 papers included in the final sample. These described the prospective development or validation, or both, of 145 risk prediction models and scores, 94 of which were studied in detail here. They had been tested on 6.88 million participants followed for up to 28 years. Heterogeneity of primary studies precluded meta-analysis. Some but not all risk models or scores had robust statistical properties (for example, good discrimination and calibration) and had been externally validated on a different population. Genetic markers added nothing to models over clinical and sociodemographic factors. Most authors described their score as “simple” or “easily implemented,” although few were specific about the intended users and under what circumstances. Ten mechanisms were identified by which measuring diabetes risk might improve outcomes. Follow-on studies that applied a risk score as part of an intervention aimed at reducing actual risk in people were sparse. Conclusion Much work has been done to develop diabetes risk models and scores, but most are rarely used because they require tests not routinely available or they were developed without a specific user or clear use in mind. Encouragingly, recent research has begun to tackle usability and the impact of diabetes risk scores. Two promising areas for further research are interventions that prompt lay people to check their own diabetes risk and use of risk scores on population datasets to identify high risk “hotspots” for targeted public health interventions.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Non-communicable diseases in the Arab world.

            According to the results of the Global Burden of Disease Study 2010, the burden of non-communicable diseases (cardiovascular disease, cancer, chronic lung diseases, and diabetes) in the Arab world has increased, with variations between countries of different income levels. Behavioural risk factors, including tobacco use, unhealthy diets, and physical inactivity are prevalent, and obesity in adults and children has reached an alarming level. Despite epidemiological evidence, the policy response to non-communicable diseases has been weak. So far, Arab governments have not placed a sufficiently high priority on addressing the high prevalence of non-communicable diseases, with variations in policies between countries and overall weak implementation. Cost-effective and evidence-based prevention and treatment interventions have already been identified. The implementation of these interventions, beginning with immediate action on salt reduction and stricter implementation of tobacco control measures, will address the rise in major risk factors. Implementation of an effective response to the non-communicable-disease crisis will need political commitment, multisectoral action, strengthened health systems, and continuous monitoring and assessment of progress. Arab governments should be held accountable for their UN commitments to address the crisis. Engagement in the global monitoring framework for non-communicable diseases should promote accountability for effective action. The human and economic burden leaves no room for inaction. Copyright © 2014 Elsevier Ltd. All rights reserved.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Predicting Diabetes Mellitus With Machine Learning Techniques

              Diabetes mellitus is a chronic disease characterized by hyperglycemia. It may cause many complications. According to the growing morbidity in recent years, in 2040, the world’s diabetic patients will reach 642 million, which means that one of the ten adults in the future is suffering from diabetes. There is no doubt that this alarming figure needs great attention. With the rapid development of machine learning, machine learning has been applied to many aspects of medical health. In this study, we used decision tree, random forest and neural network to predict diabetes mellitus. The dataset is the hospital physical examination data in Luzhou, China. It contains 14 attributes. In this study, five-fold cross validation was used to examine the models. In order to verity the universal applicability of the methods, we chose some methods that have the better performance to conduct independent test experiments. We randomly selected 68994 healthy people and diabetic patients’ data, respectively as training set. Due to the data unbalance, we randomly extracted 5 times data. And the result is the average of these five experiments. In this study, we used principal component analysis (PCA) and minimum redundancy maximum relevance (mRMR) to reduce the dimensionality. The results showed that prediction with random forest could reach the highest accuracy (ACC = 0.8084) when all the attributes were used.
                Bookmark

                Author and article information

                Contributors
                Journal
                Front Endocrinol (Lausanne)
                Front Endocrinol (Lausanne)
                Front. Endocrinol.
                Frontiers in Endocrinology
                Frontiers Media S.A.
                1664-2392
                11 September 2019
                2019
                : 10
                : 624
                Affiliations
                [1] 1Research Division, Dasman Diabetes Institute , Kuwait City, Kuwait
                [2] 2Department of Primary Health Care, Ministry of Health , Kuwait City, Kuwait
                [3] 3Department of Pediatrics, Farwaniya Hospital , Al Farwaniyah, Kuwait
                [4] 4Department of Pediatrics, Faculty of Medicine, Kuwait University , Kuwait City, Kuwait
                Author notes

                Edited by: Undurti Narasimha Das, UND Life Sciences LLC, United States

                Reviewed by: Charumathi Sabanayagam, Singapore Eye Research Institute, Singapore; Jaakko Tuomilehto, National Institute for Health and Welfare, Finland

                *Correspondence: Thangavel Alphonse Thanaraj alphonse.thangavel@ 123456dasmaninstitute.org

                This article was submitted to Clinical Diabetes, a section of the journal Frontiers in Endocrinology

                †Present Address: Bassam Farran, McLaren Applied Technologies, London, United Kingdom

                Article
                10.3389/fendo.2019.00624
                6749017
                31572303
                38ac3965-1671-4980-81d7-edfdbdf1df3b
                Copyright © 2019 Farran, AlWotayan, Alkandari, Al-Abdulrazzaq, Channanath and Thanaraj.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 14 November 2018
                : 28 August 2019
                Page count
                Figures: 1, Tables: 5, Equations: 0, References: 42, Pages: 11, Words: 7905
                Categories
                Endocrinology
                Original Research

                Endocrinology & Diabetes
                body mass index,prognosis,type 2 diabetes,hypertension,logistic regression,support vector machine,k-nearest neighbours

                Comments

                Comment on this article