9
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Using transfer learning and dimensionality reduction techniques to improve generalisability of machine-learning predictions of mosquito ages from mid-infrared spectra

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Old mosquitoes are more likely to transmit malaria than young ones. Therefore, accurate prediction of mosquito population age can drastically improve the evaluation of mosquito-targeted interventions. However, standard methods for age-grading mosquitoes are laborious and costly. We have shown that Mid-infrared spectroscopy (MIRS) can be used to detect age-specific patterns in mosquito cuticles and thus can be used to train age-grading machine learning models. However, these models tend to transfer poorly across populations. Here, we investigate whether applying dimensionality reduction and transfer learning to MIRS data can improve the transferability of MIRS-based predictions for mosquito ages.

          Methods

          We reared adults of the malaria vector Anopheles arabiensis in two insectaries. The heads and thoraces of female mosquitoes were scanned using an attenuated total reflection-Fourier transform infrared spectrometer, which were grouped into two different age classes. The dimensionality of the spectra data was reduced using unsupervised principal component analysis or t-distributed stochastic neighbour embedding, and then used to train deep learning and standard machine learning classifiers. Transfer learning was also evaluated to improve transferability of the models when predicting mosquito age classes from new populations.

          Results

          Model accuracies for predicting the age of mosquitoes from the same population as the training samples reached 99% for deep learning and 92% for standard machine learning. However, these models did not generalise to a different population, achieving only 46% and 48% accuracy for deep learning and standard machine learning, respectively. Dimensionality reduction did not improve model generalizability but reduced computational time. Transfer learning by updating pre-trained models with 2% of mosquitoes from the alternate population improved performance to ~ 98% accuracy for predicting mosquito age classes in the alternative population.

          Conclusion

          Combining dimensionality reduction and transfer learning can reduce computational costs and improve the transferability of both deep learning and standard machine learning models for predicting the age of mosquitoes. Future studies should investigate the optimal quantities and diversity of training data necessary for transfer learning and the implications for broader generalisability to unseen datasets.

          Related collections

          Most cited references22

          • Record: found
          • Abstract: not found
          • Article: not found

          Principal component analysis

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Scikit-learn: machine learning in Python

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              The effect of malaria control on Plasmodium falciparum in Africa between 2000 and 2015

              Since the year 2000, a concerted campaign against malaria has led to unprecedented levels of intervention coverage across sub-Saharan Africa. Understanding the effect of this control effort is vital to inform future control planning. However, the effect of malaria interventions across the varied epidemiological settings of Africa remains poorly understood owing to the absence of reliable surveillance data and the simplistic approaches underlying current disease estimates. Here we link a large database of malaria field surveys with detailed reconstructions of changing intervention coverage to directly evaluate trends from 2000 to 2015 and quantify the attributable effect of malaria disease control efforts. We found that Plasmodium falciparum infection prevalence in endemic Africa halved and the incidence of clinical disease fell by 40% between 2000 and 2015. We estimate that interventions have averted 663 (542–753 credible interval) million clinical cases since 2000. Insecticide-treated nets, the most widespread intervention, were by far the largest contributor (68% of cases averted). Although still below target levels, current malaria interventions have substantially reduced malaria disease incidence across the continent. Increasing access to these interventions, and maintaining their effectiveness in the face of insecticide and drug resistance, should form a cornerstone of post-2015 control strategies.
                Bookmark

                Author and article information

                Contributors
                emwanga@ihi.or.tz
                Journal
                BMC Bioinformatics
                BMC Bioinformatics
                BMC Bioinformatics
                BioMed Central (London )
                1471-2105
                9 January 2023
                9 January 2023
                2023
                : 24
                : 11
                Affiliations
                [1 ]GRID grid.414543.3, ISNI 0000 0000 9144 642X, Environmental Health and Ecological Sciences Department, , Ifakara Health Institute, ; Morogoro, Tanzania
                [2 ]GRID grid.8756.c, ISNI 0000 0001 2193 314X, School of Biodiversity, One Health, and Veterinary Medicine, , University of Glasgow, ; Glasgow, G12 8QQ UK
                [3 ]GRID grid.8756.c, ISNI 0000 0001 2193 314X, School of Computing Science, , University of Glasgow, ; Glasgow, G12 8QQ UK
                [4 ]GRID grid.8756.c, ISNI 0000 0001 2193 314X, School of Chemistry, , University of Glasgow, ; Glasgow, G12 8QQ UK
                [5 ]Institute for Disease Modelling, Bellevue, WA 98005 USA
                [6 ]GRID grid.11951.3d, ISNI 0000 0004 1937 1135, School of Public Health, , University of Witwatersrand, ; Johannesburg, South Africa
                Article
                5128
                10.1186/s12859-022-05128-5
                9830685
                36624386
                551eb012-8bde-4756-9ded-4cde1c7a6b25
                © The Author(s) 2023

                Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

                History
                : 26 July 2022
                : 26 December 2022
                Funding
                Funded by: FundRef http://dx.doi.org/10.13039/100010269, Wellcome Trust;
                Award ID: 214643/Z/18/Z
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/501100000265, Medical Research Council;
                Award ID: MR/P025501/1
                Award ID: MR/P025501/1
                Award ID: MR/P025501/1
                Award ID: MR/P025501/1
                Award ID: MR/P025501/1
                Award ID: MR/P025501/1
                Award Recipient :
                Categories
                Research
                Custom metadata
                © The Author(s) 2023

                Bioinformatics & Computational biology
                anopheles arabiensis,convolutional neural network,standard machine learning,generalisability,dimensionality reduction,transfer learning

                Comments

                Comment on this article