51
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found
      Is Open Access

      UniProt: a worldwide hub of protein knowledge.

      UniProt Consortium
      Nucleic acids research
      Oxford University Press (OUP)

      Read this article at

      ScienceOpenPublisherPMC
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The UniProt Knowledgebase is a collection of sequences and annotations for over 120 million proteins across all branches of life. Detailed annotations extracted from the literature by expert curators have been collected for over half a million of these proteins. These annotations are supplemented by annotations provided by rule based automated systems, and those imported from other resources. In this article we describe significant updates that we have made over the last 2 years to the resource. We have greatly expanded the number of Reference Proteomes that we provide and in particular we have focussed on improving the number of viral Reference Proteomes. The UniProt website has been augmented with new data visualizations for the subcellular localization of proteins as well as their structure and interactions. UniProt resources are available under a CC-BY (4.0) license via the web at https://www.uniprot.org/.

          Related collections

          Most cited references31

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          UniProt: the universal protein knowledgebase

          (2016)
          The UniProt knowledgebase is a large resource of protein sequences and associated detailed annotation. The database contains over 60 million sequences, of which over half a million sequences have been curated by experts who critically review experimental and predicted data for each protein. The remainder are automatically annotated based on rule systems that rely on the expert curated knowledge. Since our last update in 2014, we have more than doubled the number of reference proteomes to 5631, giving a greater coverage of taxonomic diversity. We implemented a pipeline to remove redundant highly similar proteomes that were causing excessive redundancy in UniProt. The initial run of this pipeline reduced the number of sequences in UniProt by 47 million. For our users interested in the accessory proteomes, we have made available sets of pan proteome sequences that cover the diversity of sequences for each species that is found in its strains and sub-strains. To help interpretation of genomic variants, we provide tracks of detailed protein information for the major genome browsers. We provide a SPARQL endpoint that allows complex queries of the more than 22 billion triples of data in UniProt (http://sparql.uniprot.org/). UniProt resources can be accessed via the website at http://www.uniprot.org/.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            A METTL3-METTL14 complex mediates mammalian nuclear RNA N6-adenosine methylation

            N 6-methyladenosine (m6A) is the most prevalent and reversible internal modification in mammalian messenger and non-coding RNAs. We report here that human METTL14 catalyzes m6A RNA methylation. Together with METTL3, the only previously known m6A methyltransferase, these two proteins form a stable heterodimer core complex of METTL3-14 that functions in cellular m6A deposition on mammalian nuclear RNAs. WTAP, a mammalian splicing factor, can interact with this complex and affect this methylation.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Ensembl 2018

              Abstract The Ensembl project has been aggregating, processing, integrating and redistributing genomic datasets since the initial releases of the draft human genome, with the aim of accelerating genomics research through rapid open distribution of public data. Large amounts of raw data are thus transformed into knowledge, which is made available via a multitude of channels, in particular our browser (http://www.ensembl.org). Over time, we have expanded in multiple directions. First, our resources describe multiple fields of genomics, in particular gene annotation, comparative genomics, genetics and epigenomics. Second, we cover a growing number of genome assemblies; Ensembl Release 90 contains exactly 100. Third, our databases feed simultaneously into an array of services designed around different use cases, ranging from quick browsing to genome-wide bioinformatic analysis. We present here the latest developments of the Ensembl project, with a focus on managing an increasing number of assemblies, supporting efforts in genome interpretation and improving our browser.
                Bookmark

                Author and article information

                Journal
                Nucleic Acids Res
                Nucleic acids research
                Oxford University Press (OUP)
                1362-4962
                0305-1048
                January 08 2019
                : 47
                : D1
                Article
                5160987
                10.1093/nar/gky1049
                6323992
                30395287
                42fd9466-139a-4a99-a103-bda97c18ecf3

                Comments

                Comment on this article