4
views
0
recommends
+1 Recommend
1 collections
    0
    shares

          The flagship journal of the Society for Endocrinology. Learn more

      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Optimizing the Dutch newborn screening for congenital hypothyroidism by incorporating amino acids and acylcarnitines in a machine learning-based model

      research-article

      Read this article at

      ScienceOpenPublisherPMC
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Objective

          Congenital hypothyroidism (CH) is an inborn thyroid hormone (TH) deficiency mostly caused by thyroidal (primary CH) or hypothalamic/pituitary (central CH) disturbances. Most CH newborn screening (NBS) programs are thyroid-stimulating-hormone (TSH) based, thereby only detecting primary CH. The Dutch NBS is based on measuring total thyroxine (T4) from dried blood spots, aiming to detect primary and central CH at the cost of more false-positive referrals (FPRs) (positive predictive value (PPV) of 21% in 2007–2017). An artificial PPV of 26% was yielded when using a machine learning-based model on the adjusted dataset described based on the Dutch CH NBS. Recently, amino acids (AAs) and acylcarnitines (ACs) have been shown to be associated with TH concentration. We therefore aimed to investigate whether AAs and ACs measured during NBS can contribute to better performance of the CH screening in the Netherlands by using a revised machine learning-based model.

          Methods

          Dutch NBS data between 2007 and 2017 (CH screening results, AAs and ACs) from 1079 FPRs, 515 newborns with primary (431) and central CH (84) and data from 1842 healthy controls were used. A random forest model including these data was developed.

          Results

          The random forest model with an artificial sensitivity of 100% yielded a PPV of 48% and AUROC of 0.99. Besides T4 and TSH, tyrosine, and succinylacetone were the main parameters contributing to the model’s performance.

          Conclusions

          The PPV improved significantly (26–48%) by adding several AAs and ACs to our machine learning-based model, suggesting that adding these parameters benefits the current algorithm.

          Related collections

          Most cited references27

          • Record: found
          • Abstract: not found
          • Article: not found

          Random Forests

            • Record: found
            • Abstract: found
            • Article: not found

            SMOTE: Synthetic Minority Over-sampling Technique

            An approach to the construction of classifiers from imbalanced datasets is described. A dataset is imbalanced if the classification categories are not approximately equally represented. Often real-world data sets are predominately composed of ``normal'' examples with only a small percentage of ``abnormal'' or ``interesting'' examples. It is also the case that the cost of misclassifying an abnormal (interesting) example as a normal example is often much higher than the cost of the reverse error. Under-sampling of the majority (normal) class has been proposed as a good means of increasing the sensitivity of a classifier to the minority class. This paper shows that a combination of our method of over-sampling the minority (abnormal) class and under-sampling the majority (normal) class can achieve better classifier performance (in ROC space) than only under-sampling the majority class. This paper also shows that a combination of our method of over-sampling the minority class and under-sampling the majority class can achieve better classifier performance (in ROC space) than varying the loss ratios in Ripper or class priors in Naive Bayes. Our method of over-sampling the minority class involves creating synthetic minority class examples. Experiments are performed using C4.5, Ripper and a Naive Bayes classifier. The method is evaluated using the area under the Receiver Operating Characteristic curve (AUC) and the ROC convex hull strategy.
              • Record: found
              • Abstract: found
              • Article: not found

              MissForest--non-parametric missing value imputation for mixed-type data.

              Modern data acquisition based on high-throughput technology is often facing the problem of missing data. Algorithms commonly used in the analysis of such large-scale data often depend on a complete set. Missing value imputation offers a solution to this problem. However, the majority of available imputation methods are restricted to one type of variable only: continuous or categorical. For mixed-type data, the different types are usually handled separately. Therefore, these methods ignore possible relations between variable types. We propose a non-parametric method which can cope with different types of variables simultaneously. We compare several state of the art methods for the imputation of missing values. We propose and evaluate an iterative imputation method (missForest) based on a random forest. By averaging over many unpruned classification or regression trees, random forest intrinsically constitutes a multiple imputation scheme. Using the built-in out-of-bag error estimates of random forest, we are able to estimate the imputation error without the need of a test set. Evaluation is performed on multiple datasets coming from a diverse selection of biological fields with artificially introduced missing values ranging from 10% to 30%. We show that missForest can successfully handle missing values, particularly in datasets including different types of variables. In our comparative study, missForest outperforms other methods of imputation especially in data settings where complex interactions and non-linear relations are suspected. The out-of-bag imputation error estimates of missForest prove to be adequate in all settings. Additionally, missForest exhibits attractive computational efficiency and can cope with high-dimensional data. The package missForest is freely available from http://stat.ethz.ch/CRAN/. stekhoven@stat.math.ethz.ch; buhlmann@stat.math.ethz.ch

                Author and article information

                Journal
                Eur Thyroid J
                Eur Thyroid J
                ETJ
                European Thyroid Journal
                Bioscientifica Ltd (Bristol )
                2235-0640
                2235-0802
                11 October 2023
                11 October 2023
                01 December 2023
                : 12
                : 6
                : e230141
                Affiliations
                [1 ]Department of Laboratory Medicine , Endocrine Laboratory, Amsterdam UMC location Vrije Universiteit Amsterdam, Boelelaan, Amsterdam, The Netherlands
                [2 ]Amsterdam Gastroenterology , Endocrinology and Metabolism, Amsterdam, The Netherlands
                [3 ]Department of Laboratory Medicine , Endocrine Laboratory, Amsterdam UMC location University of Amsterdam, Meibergdreef, Amsterdam, The Netherlands
                [4 ]Department of Computer Science , Vrije Universiteit, Boelelaan, Amsterdam, The Netherlands
                [5 ]Reference Laboratory Neonatal Screening , Center for Health protection, National Institute for Public Health and the Environment, Bilthoven, The Netherlands
                [6 ]Department of Laboratory Medicine , Laboratory Specialized Diagnostics & Research, Amsterdam UMC, University of Amsterdam, Meibergdreef, Amsterdam, The Netherlands
                [7 ]Amsterdam Public Health , Amsterdam, The Netherlands
                [8 ]Department of Endocrinology and Metabolism , Amsterdam UMC location University of Amsterdam, Meibergdreef, Amsterdam, The Netherlands
                [9 ]TNO - Child Health , Sylviusweg, Leiden, The Netherlands
                [10 ]Department of Paediatric Endocrinology , Emma Children’s Hospital, Amsterdam UMC, University of Amsterdam, Meibergdreef, Amsterdam, The Netherlands
                [11 ]Amsterdam Reproduction & Development Research Institute , Amsterdam, The Netherlands
                [12 ]Department of Pediatrics , Division of Metabolic Disorders, Emma Children’s Hospital, Amsterdam UMC, University of Amsterdam, Meibergdreef, Amsterdam, The Netherlands
                [13 ]Department of Laboratory Medicine , Amsterdam UMC, Vrije Universiteit, Boelelaan, Amsterdam, The Netherlands
                [14 ]Department of Laboratory Medicine , Amsterdam UMC, University of Amsterdam, Meibergdreef, Amsterdam, The Netherlands
                Author notes
                Correspondence should be addressed to A Boelen: a.boelen@ 123456amsterdamumc.nl
                Author information
                http://orcid.org/0000-0003-2359-2865
                http://orcid.org/0000-0001-6466-8497
                http://orcid.org/0000-0002-6712-9955
                http://orcid.org/0000-0002-4994-2918
                Article
                ETJ-23-0141
                10.1530/ETJ-23-0141
                10692681
                37855424
                bca88ee8-8a3b-42d7-b6fc-e275c478ca42
                © the author(s)

                This work is licensed under a Creative Commons Attribution 4.0 International License.

                History
                : 20 July 2023
                : 11 October 2023
                Categories
                Research

                congenital hypothyroidism,newborn screening,amino acids,acylcarnitines,machine learning based

                Comments

                Comment on this article

                Related Documents Log