165
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Annotating the human genome with Disease Ontology

      research-article
        1 , 2 , 3 , 2 , 2 , 5 , 6 , 2 , 4 ,
      BMC Genomics
      BioMed Central
      The 2008 International Conference on Bioinformatics & Computational Biology (BIOCOMP'08)
      14–17 July 2008

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          The human genome has been extensively annotated with Gene Ontology for biological functions, but minimally computationally annotated for diseases.

          Results

          We used the Unified Medical Language System (UMLS) MetaMap Transfer tool (MMTx) to discover gene-disease relationships from the GeneRIF database. We utilized a comprehensive subset of UMLS, which is disease-focused and structured as a directed acyclic graph (the Disease Ontology), to filter and interpret results from MMTx. The results were validated against the Homayouni gene collection using recall and precision measurements. We compared our results with the widely used Online Mendelian Inheritance in Man (OMIM) annotations.

          Conclusion

          The validation data set suggests a 91% recall rate and 97% precision rate of disease annotation using GeneRIF, in contrast with a 22% recall and 98% precision using OMIM. Our thesaurus-based approach allows for comparisons to be made between disease containing databases and allows for increased accuracy in disease identification through synonym matching. The much higher recall rate of our approach demonstrates that annotating human genome with Disease Ontology and GeneRIF for diseases dramatically increases the coverage of the disease annotation of human genome.

          Related collections

          Most cited references19

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Emergence of scaling in random networks

          Systems as diverse as genetic networks or the world wide web are best described as networks with complex topology. A common property of many large networks is that the vertex connectivities follow a scale-free power-law distribution. This feature is found to be a consequence of the two generic mechanisms that networks expand continuously by the addition of new vertices, and new vertices attach preferentially to already well connected sites. A model based on these two ingredients reproduces the observed stationary scale-free distributions, indicating that the development of large networks is governed by robust self-organizing phenomena that go beyond the particulars of the individual systems.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            DAVID: Database for Annotation, Visualization, and Integrated Discovery.

            Functional annotation of differentially expressed genes is a necessary and critical step in the analysis of microarray data. The distributed nature of biological knowledge frequently requires researchers to navigate through numerous web-accessible databases gathering information one gene at a time. A more judicious approach is to provide query-based access to an integrated database that disseminates biologically rich information across large datasets and displays graphic summaries of functional information. Database for Annotation, Visualization, and Integrated Discovery (DAVID; http://www.david.niaid.nih.gov) addresses this need via four web-based analysis modules: 1) Annotation Tool - rapidly appends descriptive data from several public databases to lists of genes; 2) GoCharts - assigns genes to Gene Ontology functional categories based on user selected classifications and term specificity level; 3) KeggCharts - assigns genes to KEGG metabolic processes and enables users to view genes in the context of biochemical pathway maps; and 4) DomainCharts - groups genes according to PFAM conserved protein domains. Analysis results and graphical displays remain dynamically linked to primary data and external data repositories, thereby furnishing in-depth as well as broad-based data coverage. The functionality provided by DAVID accelerates the analysis of genome-scale datasets by facilitating the transition from data collection to biological meaning.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              The genetic association database.

                Bookmark

                Author and article information

                Conference
                BMC Genomics
                BMC Genomics
                BioMed Central
                1471-2164
                2009
                7 July 2009
                : 10
                : Suppl 1
                : S6
                Affiliations
                [1 ]Department of Microbiology, University of Alabama at Birmingham, Birmingham, AL 35294, USA
                [2 ]The Biomedical Informatics Center, Robert H. Lurie Comprehensive Cancer Center, Northwestern University, Chicago, IL 60611, USA
                [3 ]Department of Preventive Medicine, Robert H. Lurie Comprehensive Cancer Center, Northwestern University, Chicago, IL 60611, USA
                [4 ]The Center for Genetic Medicine, Robert H. Lurie Comprehensive Cancer Center, Northwestern University, Chicago, IL 60611, USA
                [5 ]Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, MA 01605, USA
                [6 ]Division of Clinical Immunology and Rheumatology, University of Alabama at Birmingham, Birmingham, AL 35294, USA
                Article
                1471-2164-10-S1-S6
                10.1186/1471-2164-10-S1-S6
                2709267
                19594883
                e60c443d-2ca9-4fb1-b5a6-4d2f8bcd3d89
                Copyright © 2009 Osborne et al; licensee BioMed Central Ltd.

                This is an open access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                The 2008 International Conference on Bioinformatics & Computational Biology (BIOCOMP'08)
                Las Vegas, NV, USA
                14–17 July 2008
                History
                Categories
                Research

                Genetics
                Genetics

                Comments

                Comment on this article