1,079
views
0
recommends
+1 Recommend
0 collections
    25
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes

      research-article
      1 , 16 , 10 , 3 , 17 , 12 , 13 , 3 , 12 , 1 , 7 , 8 , 1 , 18 , 2 , 1 , 2 , 3 , 14 , 15 , 5 , 17 , 3 , 12 , 11 , 3 , 3 , 3 , 3 , 9 , 12 , 1 , 8 , 17 , 1 , 6 , 4 , 17 , 2 , 12 , 17 , 1 , 8 , 1 , 1 , *
      Nucleic Acids Research
      Oxford University Press
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The release of the 1000 th complete microbial genome will occur in the next two to three years. In anticipation of this milestone, the Fellowship for Interpretation of Genomes (FIG) launched the Project to Annotate 1000 Genomes. The project is built around the principle that the key to improved accuracy in high-throughput annotation technology is to have experts annotate single subsystems over the complete collection of genomes, rather than having an annotation expert attempt to annotate all of the genes in a single genome. Using the subsystems approach, all of the genes implementing the subsystem are analyzed by an expert in that subsystem. An annotation environment was created where populated subsystems are curated and projected to new genomes. A portable notion of a populated subsystem was defined, and tools developed for exchanging and curating these objects. Tools were also developed to resolve conflicts between populated subsystems. The SEED is the first annotation environment that supports this model of annotation. Here, we describe the subsystem approach, and offer the first release of our growing library of populated subsystems. The initial release of data includes 180 177 distinct proteins with 2133 distinct functional roles. This data comes from 173 subsystems and 383 different organisms.

          Related collections

          Most cited references33

          • Record: found
          • Abstract: found
          • Article: not found

          Gene Ontology: tool for the unification of biology

          Genomic sequencing has made it clear that a large fraction of the genes specifying the core biological functions are shared by all eukaryotes. Knowledge of the biological role of such shared proteins in one organism can often be transferred to other organisms. The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing. To this end, three independent ontologies accessible on the World-Wide Web (http://www.geneontology.org) are being constructed: biological process, molecular function and cellular component.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Whole-genome random sequencing and assembly of Haemophilus influenzae Rd.

            An approach for genome analysis based on sequencing and assembly of unselected pieces of DNA from the whole chromosome has been applied to obtain the complete nucleotide sequence (1,830,137 base pairs) of the genome from the bacterium Haemophilus influenzae Rd. This approach eliminates the need for initial mapping efforts and is therefore applicable to the vast array of microbial species for which genome maps are unavailable. The H. influenzae Rd genome sequence (Genome Sequence DataBase accession number L42023) represents the only complete genome sequence from a free-living organism.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              A database for post-genome analysis.

                Bookmark

                Author and article information

                Journal
                Nucleic Acids Res
                Nucleic Acids Research
                Nucleic Acids Research
                Oxford University Press
                0305-1048
                1362-4962
                2005
                2005
                7 October 2005
                : 33
                : 17
                : 5691-5702
                Affiliations
                1Fellowship for Interpretation of Genomes 15W155 81st Street, Burr Ridge, IL 60527, USA
                2Mathematics and Computer Science Division, Argonne National Laboratory Argonne, IL 60439, USA
                3Center for Biotechnology, Institute for Genome Research, Bielefeld University 33594 Bielefeld, Germany, USA
                4International NRW Graduate School in Bioinformatics & Genome Research, Institute for Genome Research, Bielefeld University 33594 Bielefeld, Germany, USA
                5Emerson Hall, University of Florida PO Box 14425, Gainesville, FL 32604, USA
                6Institute for Information Transmission Problems, Russian Academy of Sciences Moscow, Russia
                7Center for Microbial Sciences, San Diego State University San Diego, CA 92813, USA
                8The Burnham Institute San Diego CA 92037, USA
                9Department of Microbiology, University of Illinois at Urbana-Champaign Urbana, IL 61801
                10Computer Science Dept, Middle Tennessee State University Murfreesboro, TN 37132, USA
                11Danish Genome Institute Gustav Wieds vej 10 C, DK-8000 Aarhus C, Denmark
                12Computation Institute, University of Chicago Chicago, IL 60637, USA
                13Departments of Microbiology and Cell Science, University of Florida Gainesville, FL 32611, USA
                14Department of Horticultural Science, University of Florida Gainesville, FL 32611, USA
                15Department of Chemistry, Portland State University Portland, OR 97207, USA
                16Department of Chemistry and Chemical Biology, Cornell University Ithaca, NY14853, USA
                17University of California San Diego, CA 92093, USA
                18Cleveland BioLabs, Inc. Cleveland, OH 44106, USA
                Author notes
                *To whom correspondence should be addressed. Tel: +1 630 325 4178; Fax: +1 630 325 4179; Email: Veronika@ 123456theFIG.info
                Article
                10.1093/nar/gki866
                1251668
                16214803
                4c33397b-9e53-4a60-bd20-5acaca279818
                © The Author 2005. Published by Oxford University Press. All rights reserved

                The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions@ 123456oxfordjournals.org

                History
                : 09 June 2005
                : 08 September 2005
                : 08 September 2005
                Categories
                Article

                Genetics
                Genetics

                Comments

                Comment on this article