LigASite—a database of biologically relevant binding sites in proteins with known apo-structures

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Better characterization of binding sites in proteins and the ability to accurately predict their location and energetic properties are major challenges which, if addressed, would have many valuable practical applications. Unfortunately, reliable benchmark datasets of binding sites in proteins are still sorely lacking. Here, we present LigASite (‘LIGand Attachment SITE’), a gold-standard dataset of binding sites in 550 proteins of known structures. LigASite consists exclusively of biologically relevant binding sites in proteins for which at least one apo- and one holo-structure are available. In defining the binding sites for each protein, information from all holo-structures is combined, considering in each case the quaternary structure defined by the PQS server. LigASite is built using simple criteria and is automatically updated as new structures become available in the PDB, thereby guaranteeing optimal data coverage over time. Both a redundant and a culled non-redundant version of the dataset is available at http://www.scmbb.ulb.ac.be/Users/benoit/LigASite. The website interface allows users to search the dataset by PDB identifiers, ligand identifiers, protein names or sequence, and to look for structural matches as defined by the CATH homologous superfamilies. The datasets can be downloaded from the website as Schema-validated XML files or comma-separated flat files.

Related collections

Most cited references 25

Record: found
Abstract: found
Article: not found

PISCES: a protein sequence culling server.

Guoli Wang, Roland L. Dunbrack (2003)

PISCES is a public server for culling sets of protein sequences from the Protein Data Bank (PDB) by sequence identity and structural quality criteria. PISCES can provide lists culled from the entire PDB or from lists of PDB entries or chains provided by the user. The sequence identities are obtained from PSI-BLAST alignments with position-specific substitution matrices derived from the non-redundant protein sequence database. PISCES therefore provides better lists than servers that use BLAST, which is unable to identify many relationships below 40% sequence identity and often overestimates sequence identity by aligning only well-conserved fragments. PDB sequences are updated weekly. PISCES can also cull non-PDB sequences provided by the user as a list of GenBank identifiers, a FASTA format file, or BLAST/PSI-BLAST output.

0 comments Cited 487 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

LIGPLOT: a program to generate schematic diagrams of protein-ligand interactions.

A. Wallace, R. A. Laskowski, J Thornton (1995)

The LIGPLOT program automatically generates schematic 2-D representations of protein-ligand complexes from standard Protein Data Bank file input. The output is a colour, or black-and-white, PostScript file giving a simple and informative representation of the intermolecular interactions and their strengths, including hydrogen bonds, hydrophobic interactions and atom accessibilities. The program is completely general for any ligand and can also be used to show other types of interaction in proteins and nucleic acids. It was designed to facilitate the rapid inspection of many enzyme complexes, but has found many other applications.

0 comments Cited 472 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Q-SiteFinder: an energy-based method for the prediction of protein-ligand binding sites.

Alasdair Laurie, Richard Jackson (2005)

Identifying the location of ligand binding sites on a protein is of fundamental importance for a range of applications including molecular docking, de novo drug design and structural identification and comparison of functional sites. Here, we describe a new method of ligand binding site prediction called Q-SiteFinder. It uses the interaction energy between the protein and a simple van der Waals probe to locate energetically favourable binding sites. Energetically favourable probe sites are clustered according to their spatial proximity and clusters are then ranked according to the sum of interaction energies for sites within each cluster. There is at least one successful prediction in the top three predicted sites in 90% of proteins tested when using Q-SiteFinder. This success rate is higher than that of a commonly used pocket detection algorithm (Pocket-Finder) which uses geometric criteria. Additionally, Q-SiteFinder is twice as effective as Pocket-Finder in generating predicted sites that map accurately onto ligand coordinates. It also generates predicted sites with the lowest average volumes of the methods examined in this study. Unlike pocket detection, the volumes of the predicted sites appear to show relatively low dependence on protein volume and are similar in volume to the ligands they contain. Restricting the size of the pocket is important for reducing the search space required for docking and de novo drug design or site comparison. The method can be applied in structural genomics studies where protein binding sites remain uncharacterized since the 86% success rate for unbound proteins appears to be only slightly lower than that of ligand-bound proteins. Both Q-SiteFinder and Pocket-Finder have been made available online at http://www.bioinformatics.leeds.ac.uk/qsitefinder and http://www.bioinformatics.leeds.ac.uk/pocketfinder

0 comments Cited 257 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (nlm-ta): Nucleic Acids Res

Journal ID (iso-abbrev): Nucleic Acids Res

Journal ID (publisher-id): nar

Journal ID (hwp): nar

Title: Nucleic Acids Research

Publisher: Oxford University Press

ISSN (Print): 0305-1048

ISSN (Electronic): 1362-4962

Publication date (Print): January 2008

Publication date (Electronic): 11 October 2007

Publication date PMC-release: 11 October 2007

Volume: 36

Issue: Database issue , Database issue

Pages: D667-D673

Affiliations

¹Center for Structural Biology and Bioinformatics, Université Libre de Bruxelles (U. L. B.), Bld du Triomphe – CP 263, 1050 Bruxelles, Belgium, ²Biomolecular Structure and Modelling Unit, University College of London, Gower Street, London WC1E 6BT, UK and ³Structural Biology and Biochemistry Program, Hospital for Sick Children, 555 University Avenue, Toronto, Ontario M5G 1X8, Canada

Author notes

* To whom correspondence should be addressed.+44 (0) 20 7679 3890+44 (0) 20 7679 7193 benoit@ 123456biochem.ucl.ac.uk

Present address: Biomolecular Structure and Modelling Unit, University College of London, Gower Street, London WC1E 6BT, UK

Article

DOI: 10.1093/nar/gkm839

PMC ID: 2238865

PubMed ID: 17933762

SO-VID: b53a88b8-b938-4297-a169-33725b295089

License:

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

History

Date received : 13 August 2007

Date revision received : 24 September 2007

Date accepted : 25 September 2007

Comments

Comment on this article

scite_

Cited by 31

See all cited by

Most referenced authors 1,159

See all reference authors

LigASite—a database of biologically relevant binding sites in proteins with known apo-structures

Read this article at

Abstract

Related collections

Genomic Prediction

Most cited references 25

PISCES: a protein sequence culling server.

LIGPLOT: a program to generate schematic diagrams of protein-ligand interactions.

Q-SiteFinder: an energy-based method for the prediction of protein-ligand binding sites.

Author and article information

Journal

Affiliations

Author notes

Article

History

Categories

Comments

Comment on this article

Similar content 160

Cited by 31

Most referenced authors 1,159