MMsINC: a large-scale chemoinformatics database

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

MMsINC ( http://mms.dsfarm.unipd.it/MMsINC/search) is a database of non-redundant, richly annotated and biomedically relevant chemical structures. A primary goal of MMsINC is to guarantee the highest quality and the uniqueness of each entry. MMsINC then adds value to these entries by including the analysis of crucial chemical properties, such as ionization and tautomerization processes, and the in silico prediction of 24 important molecular properties in the biochemical profile of each structure. MMsINC is consequently a natural input for different chemoinformatics and virtual screening applications. In addition, MMsINC supports various types of queries, including substructure queries and the novel ‘molecular scissoring’ query. MMsINC is interfaced with other primary data collectors, such as PubChem, Protein Data Bank (PDB), the Food and Drug Administration database of approved drugs and ZINC.

Related collections

Most cited references 12

Record: found
Abstract: found
Article: found

Is Open Access

Database resources of the National Center for Biotechnology Information

David Wheeler, Tanya Barrett, Dennis A Benson … (2008)

In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data available through NCBI's web site. NCBI resources include Entrez, the Entrez Programming Utilities, My NCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link, Electronic PCR, OrfFinder, Spidey, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genome, Genome Project and related tools, the Trace, Assembly, and Short Read Archives, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups, Influenza Viral Resources, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus, Entrez Probe, GENSAT, Database of Genotype and Phenotype, Online Mendelian Inheritance in Man, Online Mendelian Inheritance in Animals, the Molecular Modeling Database, the Conserved Domain Database, the Conserved Domain Architecture Retrieval Tool and the PubChem suite of small molecule databases. Augmenting the web applications are custom implementations of the BLAST program optimized to search specialized data sets. These resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.

0 comments Cited 372 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Property distribution of drug-related chemical databases.

T Oprea (2000)

The process of compound selection and prioritization is crucial for both combinatorial chemistry (CBC) and high throughput screening (HTS). Compound libraries have to be screened for unwanted chemical structures, as well as for unwanted chemical properties. Property extrema can be eliminated by using property filters, in accordance with their actual distribution. Property distribution was examined in the following compound databases: MACCS-II Drug Data Report (MDDR), Current Patents Fast-alert, Comprehensive Medicinal Chemistry, Physician Desk Reference, New Chemical Entities, and the Available Chemical Directory (ACD). The ACDF and MDDRF subsets were created by removing reactive functionalities from the ACD and MDDR databases, respectively. The ACDF subset was further filtered by keeping only molecules with a 'drug-like' score [Ajay et al., J. Med. Chem., 41 (1998) 3314; Sadowski and Kubinyi, J. Med. Chem., 41 (1998) 3325] below 0.8. The following properties were examined: molecular weight (MW), the calculated octanol/water partition coefficient (CLOGP), the number of rotatable (RTB) and rigid bonds (RGB), the number of rings (RNG), and the number of hydrogen bond donors (HDO) and acceptors (HAC). Of these, MW and CLOGP follow a Gaussian distribution, whereas all other descriptors have an asymmetric (truncated Gaussian) distribution. Four out of five compounds in ACDF and MDDRF pass the 'rule of 5' test, a probability scheme that estimates oral absorption proposed by Lipinski et al. [Adv. Drug Deliv. Rev., 23 (1997) 3]. Because property distributions of HDO, HAC, MW and CLOGP (used in the 'rule of 5' test) do not differ significantly between these datasets, the 'rule of 5' does not distinguish 'drugs' from 'nondrugs'. Therefore, Pareto analyses were performed to examine skewed distributions in all compound collections. Seventy percent of the 'drug-like' compounds were found between the following limits: 0 or = 3, and RGB > or = 18, and only 24.73% of MDDRF compounds have 0 < or = RNG < or = 2 rings, and RGB < or = 17. The probability of identifying 'drug-like' structures increases with molecular complexity.

0 comments Cited 93 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Recent developments of the chemistry development kit (CDK) - an open-source java library for chemo- and bioinformatics.

Christoph Steinbeck, Stefan Kuhn, Matteo Floris … (2006)

The Chemistry Development Kit (CDK) provides methods for common tasks in molecular informatics, including 2D and 3D rendering of chemical structures, I/O routines, SMILES parsing and generation, ring searches, isomorphism checking, structure diagram generation, etc. Implemented in Java, it is used both for server-side computational services, possibly equipped with a web interface, as well as for applications and client-side applets. This article introduces the CDK's new QSAR capabilities and the recently introduced interface to statistical software.

0 comments Cited 72 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (nlm-ta): Nucleic Acids Res

Journal ID (iso-abbrev): Nucleic Acids Res

Journal ID (publisher-id): nar

Journal ID (hwp): nar

Title: Nucleic Acids Research

Publisher: Oxford University Press

ISSN (Print): 0305-1048

ISSN (Electronic): 1362-4962

Publication date Collection: January 2009

Publication date (Print): January 2009

Publication date (Electronic): 17 October 2008

Publication date PMC-release: 17 October 2008

Volume: 37

Issue: Database issue , Database issue

Pages: D284-D290

Affiliations

¹CRS4 – Bioinformatics Laboratory, Parco Sardegna Ricerche, Pula (CA) 09010 and ²Molecular Modeling Section (MMS), Department of Pharmaceutical Sciences, University of Padova, PD 35131, Italy

Author notes

*To whom correspondence should be addressed. Tel: +39 049 8275704; Fax: +39 049 8275366; Email: stefano.moro@ 123456unipd.it

The authors wish it to be known that, in their opinion, the first two and last two authors should be regarded as joint First Authors.

Article

Publisher ID: gkn727

DOI: 10.1093/nar/gkn727

PMC ID: 2686567

PubMed ID: 18931373

SO-VID: 6e09115e-6a45-44c7-a21b-f31d58c1bf32

License:

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

History

Date received : 11 August 2008

Date revision received : 27 September 2008

Date accepted : 1 October 2008

Comments

Comment on this article

scite_

Cited by 16

See all cited by

Most referenced authors 1,075

See all reference authors

MMsINC: a large-scale chemoinformatics database

Read this article at

Abstract

Related collections

Genes & Diseases

Most cited references 12

Database resources of the National Center for Biotechnology Information

Property distribution of drug-related chemical databases.

Recent developments of the chemistry development kit (CDK) - an open-source java library for chemo- and bioinformatics.

Author and article information

Journal

Affiliations

Author notes

Article

History

Categories

Comments

Comment on this article

Similar content 97

Cited by 16

Most referenced authors 1,075