9
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Classification of Complex Diseases using an Improved Binary Cuckoo Search and Conditional Mutual Information Maximization

      research-article
      ,
      Computación y Sistemas
      Centro de Investigación en computación, IPN
      Metaheuristic, CMIM, IBCS, feature selection, classification, SNP

      Read this article at

      SciELO
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Abstract: With the advancement of various computational techniques, there is an exponential growth in genomic data. To analyze such huge amount of data, there is necessity of efficient machine learning techniques. The genomic data usually suffers from “curse of dimensionality” problem, having large number of n (features) and small number of p (samples), which makes classification task very complex. In the present study, a new intelligent hybrid method based on CMIM (conditional mutual information maximization) and novel IBCS (Improved binary cuckoo search) is used for classifying various complex diseases. The CMIM is used to deal with dimensionality problem and IBCS is to select most informative features. Generally, the standard BCS (binary cuckoo search algorithm) is used for feature selection but it has problems like low optimization accuracy and low localized searching. The IBCS overcome the shortcomings of BCS, and improved the classification accuracy by choosing best informative feature subset. The proposed technique applied on five different SNPs dataset which are publically available on NCBI GEO. The proposed model attains high classification accuracy and outperformed other feature selection techniques. The IBCS was also compared with other metaheuristics algorithms such as Binary GA, Binary PSO and Binary ACO, and the result shows that it has better classification accuracy.

          Related collections

          Most cited references21

          • Record: found
          • Abstract: found
          • Article: not found

          Structural variation of chromosomes in autism spectrum disorder.

          Structural variation (copy number variation [CNV] including deletion and duplication, translocation, inversion) of chromosomes has been identified in some individuals with autism spectrum disorder (ASD), but the full etiologic role is unknown. We performed genome-wide assessment for structural abnormalities in 427 unrelated ASD cases via single-nucleotide polymorphism microarrays and karyotyping. With microarrays, we discovered 277 unbalanced CNVs in 44% of ASD families not present in 500 controls (and re-examined in another 1152 controls). Karyotyping detected additional balanced changes. Although most variants were inherited, we found a total of 27 cases with de novo alterations, and in three (11%) of these individuals, two or more new variants were observed. De novo CNVs were found in approximately 7% and approximately 2% of idiopathic families having one child, or two or more ASD siblings, respectively. We also detected 13 loci with recurrent/overlapping CNV in unrelated cases, and at these sites, deletions and duplications affecting the same gene(s) in different individuals and sometimes in asymptomatic carriers were also found. Notwithstanding complexities, our results further implicate the SHANK3-NLGN4-NRXN1 postsynaptic density genes and also identify novel loci at DPP6-DPP10-PCDH9 (synapse complex), ANKRD11, DPYD, PTCHD1, 15q24, among others, for a role in ASD susceptibility. Our most compelling result discovered CNV at 16p11.2 (p = 0.002) (with characteristics of a genomic disorder) at approximately 1% frequency. Some of the ASD regions were also common to mental retardation loci. Structural variants were found in sufficiently high frequency influencing ASD to suggest that cytogenetic and microarray analyses be considered in routine clinical workup.
            • Record: found
            • Abstract: not found
            • Article: not found

            Engineering optimisation by cuckoo search

              • Record: found
              • Abstract: found
              • Article: not found

              A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms.

              We describe a map of 1.42 million single nucleotide polymorphisms (SNPs) distributed throughout the human genome, providing an average density on available sequence of one SNP every 1.9 kilobases. These SNPs were primarily discovered by two projects: The SNP Consortium and the analysis of clone overlaps by the International Human Genome Sequencing Consortium. The map integrates all publicly available SNPs with described genes and other genomic features. We estimate that 60,000 SNPs fall within exon (coding and untranslated regions), and 85% of exons are within 5 kb of the nearest SNP. Nucleotide diversity varies greatly across the genome, in a manner broadly consistent with a standard population genetic model of human history. This high-density SNP map provides a public resource for defining haplotype variation across the genome, and should help to identify biomedically important genes for diagnosis and therapy.

                Author and article information

                Journal
                cys
                Computación y Sistemas
                Comp. y Sist.
                Centro de Investigación en computación, IPN (México, DF, Mexico )
                1405-5546
                2007-9737
                September 2020
                : 24
                : 3
                : 1121-1129
                Affiliations
                [1] orgnameGuru Jambheshwar University of Science and Technology India dharmindia24@ 123456gmail.com
                Article
                S1405-55462020000301121 S1405-5546(20)02400301121
                10.13053/cys-24-3-3354
                54c2fd08-c892-45d1-91a3-59813fe32745

                This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

                History
                : 22 May 2020
                : 02 April 2020
                Page count
                Figures: 0, Tables: 0, Equations: 0, References: 23, Pages: 9
                Product

                SciELO Mexico

                Categories
                Articles

                Metaheuristic,SNP,classification,feature selection,IBCS,CMIM
                Metaheuristic, SNP, classification, feature selection, IBCS, CMIM

                Comments

                Comment on this article

                Related Documents Log