93
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Escherichia coli K-12: a cooperatively developed annotation snapshot—2005

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The goal of this group project has been to coordinate and bring up-to-date information on all genes of Escherichia coli K-12. Annotation of the genome of an organism entails identification of genes, the boundaries of genes in terms of precise start and end sites, and description of the gene products. Known and predicted functions were assigned to each gene product on the basis of experimental evidence or sequence analysis. Since both kinds of evidence are constantly expanding, no annotation is complete at any moment in time. This is a snapshot analysis based on the most recent genome sequences of two E.coli K-12 bacteria. An accurate and up-to-date description of E.coli K-12 genes is of particular importance to the scientific community because experimentally determined properties of its gene products provide fundamental information for annotation of innumerable genes of other organisms. Availability of the complete genome sequence of two K-12 strains allows comparison of their genotypes and mutant status of alleles.

          Related collections

          Most cited references40

          • Record: found
          • Abstract: found
          • Article: not found

          Gene Ontology: tool for the unification of biology

          Genomic sequencing has made it clear that a large fraction of the genes specifying the core biological functions are shared by all eukaryotes. Knowledge of the biological role of such shared proteins in one organism can often be transferred to other organisms. The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing. To this end, three independent ontologies accessible on the World-Wide Web (http://www.geneontology.org) are being constructed: biological process, molecular function and cellular component.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            The Pfam protein families database.

            Pfam is a large collection of protein families and domains. Over the past 2 years the number of families in Pfam has doubled and now stands at 6190 (version 10.0). Methodology improvements for searching the Pfam collection locally as well as via the web are described. Other recent innovations include modelling of discontinuous domains allowing Pfam domain definitions to be closer to those found in structure databases. Pfam is available on the web in the UK (http://www.sanger.ac.uk/Software/Pfam/), the USA (http://pfam.wustl.edu/), France (http://pfam.jouy.inra.fr/) and Sweden (http://Pfam.cgb.ki.se/).
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              The TIGRFAMs database of protein families.

              TIGRFAMs is a collection of manually curated protein families consisting of hidden Markov models (HMMs), multiple sequence alignments, commentary, Gene Ontology (GO) assignments, literature references and pointers to related TIGRFAMs, Pfam and InterPro models. These models are designed to support both automated and manually curated annotation of genomes. TIGRFAMs contains models of full-length proteins and shorter regions at the levels of superfamilies, subfamilies and equivalogs, where equivalogs are sets of homologous proteins conserved with respect to function since their last common ancestor. The scope of each model is set by raising or lowering cutoff scores and choosing members of the seed alignment to group proteins sharing specific function (equivalog) or more general properties. The overall goal is to provide information with maximum utility for the annotation process. TIGRFAMs is thus complementary to Pfam, whose models typically achieve broad coverage across distant homologs but end at the boundaries of conserved structural domains. The database currently contains over 1600 protein families. TIGRFAMs is available for searching or downloading at www.tigr.org/TIGRFAMs.
                Bookmark

                Author and article information

                Journal
                Nucleic Acids Res
                Nucleic Acids Research
                Nucleic Acids Research
                Oxford University Press
                0305-1048
                1362-4962
                2006
                2006
                5 January 2006
                : 34
                : 1
                : 1-9
                Affiliations
                Josephine Bay Paul Center, Marine Biological Laboratory Woods Hole, MA 02543, USA
                1Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Research Organization of Information and Systems Yata 1111, Mishima, Shizuoka 411-8540, Japan
                2Department of Genetics, Candida Genome Database Stanford University School of Medicine Stanford, CA 94305-5120, USA
                3Department of Molecular, Cellular and Developmental Biology, Yale University New Haven, CT 06520-8103, USA
                4Genome Center of Wisconsin 425 Henry MallUniversity of Wisconsin Madison, WI 53706, USA
                5Division of Immunity and Infection, University of Birmingham Edgbaston, Birmingham B15 2TT, UK
                6National Institute for Basic Biology Nishigonaka 38, Myodaiji, Okazaki 444-8585 Aichi, Japan
                7SRI International 333 Ravenswood Avenue, Menlo Park, CA 94025-3493, USA
                8Graduate School of Biological Sciences, Nara Institute of Science and Technology Ikoma, Nara 630-0101, Japan
                9The Institute of Advanced Biosciences, Keio University Tsuruoka, Yamagata 997-0017, Japan
                10Department of Biochemistry and Molecular Biology, The University of Miami Miller School of Medicine Miami, FL 33140, USA
                11Department of Biology, University of York PO Box 373, York YO10 5YW, UK
                12The Wellcome Trust Sanger Institute Genome Campus, Hinxton, Cambridge CB10 1SA, UK
                13Department of Computing Science and Biological Sciences, 2-21 Athabasca Hall University of Alberta Edmonton, Alberta, Canada T6G 2E8
                14Department of Biological Sciences, Purdue University 915 W. State Street, West Lafayette, IN 47907-2054, USA
                Author notes
                *To whom correspondence should be addressed. Tel: +1 508 269 7388; Fax: +1 508 457 4727; Email: mriley@ 123456mbl.edu

                Correspondence may also be addressed to Barry L. Wanner. Tel: +1 765 494 8034; Fax: +1 765 494 0876; Email: blwanner@ 123456purdue.edu

                Article
                10.1093/nar/gkj405
                1325200
                16397293
                82a240fa-0eb2-4582-b6c7-3b6d6cb9c756
                © The Author 2006. Published by Oxford University Press. All rights reserved

                The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions@ 123456oxfordjournals.org

                History
                : 05 November 2005
                : 05 December 2005
                : 05 December 2005
                Categories
                Article

                Genetics
                Genetics

                Comments

                Comment on this article