6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Predicting childhood obesity using electronic health records and publicly available data

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Because of the strong link between childhood obesity and adulthood obesity comorbidities, and the difficulty in decreasing body mass index (BMI) later in life, effective strategies are needed to address this condition in early childhood. The ability to predict obesity before age five could be a useful tool, allowing prevention strategies to focus on high risk children. The few existing prediction models for obesity in childhood have primarily employed data from longitudinal cohort studies, relying on difficult to collect data that are not readily available to all practitioners. Instead, we utilized real-world unaugmented electronic health record (EHR) data from the first two years of life to predict obesity status at age five, an approach not yet taken in pediatric obesity research.

          Methods and findings

          We trained a variety of machine learning algorithms to perform both binary classification and regression. Following previous studies demonstrating different obesity determinants for boys and girls, we similarly developed separate models for both groups. In each of the separate models for boys and girls we found that weight for length z-score, BMI between 19 and 24 months, and the last BMI measure recorded before age two were the most important features for prediction. The best performing models were able to predict obesity with an Area Under the Receiver Operator Characteristic Curve (AUC) of 81.7% for girls and 76.1% for boys.

          Conclusions

          We were able to predict obesity at age five using EHR data with an AUC comparable to cohort-based studies, reducing the need for investment in additional data collection. Our results suggest that machine learning approaches for predicting future childhood obesity using EHR data could improve the ability of clinicians and researchers to drive future policy, intervention design, and the decision-making process in a clinical setting.

          Related collections

          Most cited references32

          • Record: found
          • Abstract: found
          • Article: not found

          Health consequences of obesity in youth: childhood predictors of adult disease.

          W Dietz (1998)
          Obesity now affects one in five children in the United States. Discrimination against overweight children begins early in childhood and becomes progressively institutionalized. Because obese children tend to be taller than their nonoverweight peers, they are apt to be viewed as more mature. The inappropriate expectations that result may have an adverse effect on their socialization. Many of the cardiovascular consequences that characterize adult-onset obesity are preceded by abnormalities that begin in childhood. Hyperlipidemia, hypertension, and abnormal glucose tolerance occur with increased frequency in obese children and adolescents. The relationship of cardiovascular risk factors to visceral fat independent of total body fat remains unclear. Sleep apnea, pseudotumor cerebri, and Blount's disease represent major sources of morbidity for which rapid and sustained weight reduction is essential. Although several periods of increased risk appear in childhood, it is not clear whether obesity with onset early in childhood carries a greater risk of adult morbidity and mortality. Obesity is now the most prevalent nutritional disease of children and adolescents in the United States. Although obesity-associated morbidities occur more frequently in adults, significant consequences of obesity as well as the antecedents of adult disease occur in obese children and adolescents. In this review, I consider the adverse effects of obesity in children and adolescents and attempt to outline areas for future research. I refer to obesity as a body mass index greater than the 95th percentile for children of the same age and gender.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            A simple algorithm for identifying negated findings and diseases in discharge summaries.

            Narrative reports in medical records contain a wealth of information that may augment structured data for managing patient information and predicting trends in diseases. Pertinent negatives are evident in text but are not usually indexed in structured databases. The objective of the study reported here was to test a simple algorithm for determining whether a finding or disease mentioned within narrative medical reports is present or absent. We developed a simple regular expression algorithm called NegEx that implements several phrases indicating negation, filters out sentences containing phrases that falsely appear to be negation phrases, and limits the scope of the negation phrases. We compared NegEx against a baseline algorithm that has a limited set of negation phrases and a simpler notion of scope. In a test of 1235 findings and diseases in 1000 sentences taken from discharge summaries indexed by physicians, NegEx had a specificity of 94.5% (versus 85.3% for the baseline), a positive predictive value of 84.5% (versus 68.4% for the baseline) while maintaining a reasonable sensitivity of 77.8% (versus 88.3% for the baseline). We conclude that with little implementation effort a simple regular expression algorithm for determining whether a finding or disease is absent can identify a large portion of the pertinent negatives from discharge summaries.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              A targeted real-time early warning score (TREWScore) for septic shock

              Sepsis is a leading cause of death in the United States, with mortality highest among patients who develop septic shock. Early aggressive treatment decreases morbidity and mortality. Although automated screening tools can detect patients currently experiencing severe sepsis and septic shock, none predict those at greatest risk of developing shock. We analyzed routinely available physiological and laboratory data from intensive care unit patients and developed "TREWScore," a targeted real-time early warning score that predicts which patients will develop septic shock. TREWScore identified patients before the onset of septic shock with an area under the ROC (receiver operating characteristic) curve (AUC) of 0.83 [95% confidence interval (CI), 0.81 to 0.85]. At a specificity of 0.67, TREWScore achieved a sensitivity of 0.85 and identified patients a median of 28.2 [interquartile range (IQR), 10.6 to 94.2] hours before onset. Of those identified, two-thirds were identified before any sepsis-related organ dysfunction. In comparison, the Modified Early Warning Score, which has been used clinically for septic shock prediction, achieved a lower AUC of 0.73 (95% CI, 0.71 to 0.76). A routine screening protocol based on the presence of two of the systemic inflammatory response syndrome criteria, suspicion of infection, and either hypotension or hyperlactatemia achieved a lower sensitivity of 0.74 at a comparable specificity of 0.64. Continuous sampling of data from the electronic health records and calculation of TREWScore may allow clinicians to identify patients at risk for septic shock and provide earlier interventions that would prevent or mitigate the associated morbidity and mortality.
                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: SoftwareRole: ValidationRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: ConceptualizationRole: MethodologyRole: Writing – original draftRole: Writing – review & editing
                Role: ConceptualizationRole: Project administrationRole: ResourcesRole: Writing – original draftRole: Writing – review & editing
                Role: ConceptualizationRole: MethodologyRole: SupervisionRole: VisualizationRole: Writing – review & editing
                Role: ConceptualizationRole: Project administrationRole: ResourcesRole: Writing – original draftRole: Writing – review & editing
                Role: ConceptualizationRole: MethodologyRole: SupervisionRole: Writing – review & editing
                Role: ConceptualizationRole: MethodologyRole: Writing – review & editing
                Role: ConceptualizationRole: Methodology
                Role: ConceptualizationRole: Project administrationRole: Writing – review & editing
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: MethodologyRole: SoftwareRole: SupervisionRole: ValidationRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: ConceptualizationRole: MethodologyRole: Project administrationRole: SupervisionRole: Writing – original draftRole: Writing – review & editing
                Role: Editor
                Journal
                PLoS One
                PLoS ONE
                plos
                plosone
                PLoS ONE
                Public Library of Science (San Francisco, CA USA )
                1932-6203
                22 April 2019
                2019
                : 14
                : 4
                : e0215571
                Affiliations
                [1 ] NYU Langone Comprehensive Program on Obesity, NYU School of Medicine, New York, New York, United States of America
                [2 ] Department of Cell Biology, NYU School of Medicine, New York, New York, United States of America
                [3 ] Department of Population Health, NYU School of Medicine, New York, New York, United States of America
                [4 ] Department of Pediatrics, NYU School of Medicine, Bellevue Hospital Center, New York, New York, United States of America
                [5 ] Department of Medicine, NYU School of Medicine, New York, New York, United States of America
                [6 ] Department of Radiology, NYU School of Medicine, New York, New York, United States of America
                [7 ] NYU Wagner Graduate School of Public Service, New York, New York, United States of America
                Ben-Gurion University of the Negev, ISRAEL
                Author notes

                Competing Interests: The authors have declared that no competing interests exist.

                Author information
                http://orcid.org/0000-0003-4476-6406
                http://orcid.org/0000-0002-1915-6094
                http://orcid.org/0000-0002-9922-6370
                http://orcid.org/0000-0003-1615-9430
                Article
                PONE-D-18-18646
                10.1371/journal.pone.0215571
                6476510
                31009509
                8eacac05-bc39-4575-a1a6-86de000dab5c
                © 2019 Hammond et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 21 June 2018
                : 5 April 2019
                Page count
                Figures: 3, Tables: 7, Pages: 18
                Funding
                The authors received no specific funding for this work.
                Categories
                Research Article
                Custom metadata
                The data are electronic health records owned by NYU Langone Health and contain protected health information and personally identifiable information. They were not anonymized for this study. It is a restricted data set and public sharing of these data would violate the HIPPA security rule, however a deidentified data set will be available by request through https://www.icpsr.umich.edu/icpsrweb/. Our code and the subsequent analyses can be viewed on our GitHub page at https://github.com/NYUMedML/ObesityPY.

                Uncategorized
                Uncategorized

                Comments

                Comment on this article