Disease candidate gene identification and prioritization using protein interaction networks

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Background

Although most of the current disease candidate gene identification and prioritization methods depend on functional annotations, the coverage of the gene functional annotations is a limiting factor. In the current study, we describe a candidate gene prioritization method that is entirely based on protein-protein interaction network (PPIN) analyses.

Results

For the first time, extended versions of the PageRank and HITS algorithms, and the K-Step Markov method are applied to prioritize disease candidate genes in a training-test schema. Using a list of known disease-related genes from our earlier study as a training set ("seeds"), and the rest of the known genes as a test list, we perform large-scale cross validation to rank the candidate genes and also evaluate and compare the performance of our approach. Under appropriate settings – for example, a back probability of 0.3 for PageRank with Priors and HITS with Priors, and step size 6 for K-Step Markov method – the three methods achieved a comparable AUC value, suggesting a similar performance.

Conclusion

Even though network-based methods are generally not as effective as integrated functional annotation-based methods for disease candidate gene prioritization, in a one-to-one comparison, PPIN-based candidate gene prioritization performs better than all other gene features or annotations. Additionally, we demonstrate that methods used for studying both social and Web networks can be successfully used for disease candidate gene prioritization.

Related collections

Most cited references 40

Record: found
Abstract: found
Article: not found

Emergence of scaling in random networks

Zoltan Barabasi, Albert Donnay, Albert Dionise … (1999)

Systems as diverse as genetic networks or the World Wide Web are best described as networks with complex topology. A common property of many large networks is that the vertex connectivities follow a scale-free power-law distribution. This feature was found to be a consequence of two generic mechanisms: (i) networks expand continuously by the addition of new vertices, and (ii) new vertices attach preferentially to sites that are already well connected. A model based on these two ingredients reproduces the observed stationary scale-free distributions, which indicates that the development of large networks is governed by robust self-organizing phenomena that go beyond the particulars of the individual systems.

0 comments Cited 642 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

The genetic association database.

Kevin G. Becker, Kathleen C Barnes, Tiffani Bright … (2004)

0 comments Cited 435 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

BIND: the Biomolecular Interaction Network Database.

Gary Bader, Doron Betel, Christopher W. V. Hogue (2003)

The Biomolecular Interaction Network Database (BIND: http://bind.ca) archives biomolecular interaction, complex and pathway information. A web-based system is available to query, view and submit records. BIND continues to grow with the addition of individual submissions as well as interaction data from the PDB and a number of large-scale interaction and complex mapping experiments using yeast two hybrid, mass spectrometry, genetic interactions and phage display. We have developed a new graphical analysis tool that provides users with a view of the domain composition of proteins in interaction and complex records to help relate functional domains to protein interactions. An interaction network clustering tool has also been developed to help focus on regions of interest. Continued input from users has helped further mature the BIND data specification, which now includes the ability to store detailed information about genetic interactions. The BIND data specification is available as ASN.1 and XML DTD.

0 comments Cited 400 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (nlm-ta): BMC Bioinformatics

Title: BMC Bioinformatics

Publisher: BioMed Central

ISSN (Electronic): 1471-2105

Publication date Collection: 2009

Publication date (Electronic): 27 February 2009

Volume: 10

Page: 73

Affiliations

[1 ]Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA

[2 ]Department of Biomedical Engineering, University of Cincinnati, Cincinnati, OH, USA

[3 ]Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA

Article

Publisher ID: 1471-2105-10-73

DOI: 10.1186/1471-2105-10-73

PMC ID: 2657789

PubMed ID: 19245720

SO-VID: 72063e7a-c765-463f-9c1c-a7d30f1af35f

License:

This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Disease candidate gene identification and prioritization using protein interaction networks

Read this article at

Abstract

Background

Results

Conclusion

Related collections

Genetoberfest

Most cited references 40

Emergence of scaling in random networks

The genetic association database.

BIND: the Biomolecular Interaction Network Database.

Author and article information

Journal

Affiliations

Article

History

Categories

Comments

Comment on this article

Similar content 168

Cited by 124

Most referenced authors 1,440