There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Over recent years, there has been a growing interest in extracting entities and relations from biomedical literature. There are a vast number of systems and approaches being proposed to extract biological relations, but none of them achieves satisfactory results. These methodologies are either parsing-based or pattern-based, which are not competent to handle the grammatical complexities of biomedical texts, or too complicated to be adapted. It is well known that appositive, coordinative propositions and such grammatical structures are extremely common in biomedical texts, particularly in full texts. However, these problems are still untouched for most of researchers. In this paper, we have proposed a new approach, which is hybrid with both shallow parsing and pattern matching, to extract relations between proteins from scientific papers of biomedical themes. In the method, appositive and coordinative structures are interpreted based on the shallow parsing analysis, with both syntactic and semantic constraints. Then long sentences are splitted into sub-ones, from which relations are extracted by a greedy pattern matching algorithm, along with automatically generated patterns. Our approach is experimented to extract protein-protein interactions from full biomedical texts, and has achieved an average F-score of 80% on individual verbs, and 66% on all verbs. With the help of shallow parsing analysis, pattern matching is improved remarkably. Compared with the traditional pattern matching algorithm, our approach achieves about 7% improvement of both precision and F-score. In contrast to other systems, our approach achieves performance comparable to the best. A demo system has been available at http://spies.cs.tsinghua.edu.cn.

Related collections

Author and article information

Journal

PubMed ID:: 16095962

DOI:: 10.1016/j.ijmedinf.2005.06.010

ScienceOpen disciplines: Chemistry

Keywords: Abstracting and Indexing as Topic,methods,Artificial Intelligence,Biology,Database Management Systems,Databases, Bibliographic,Information Storage and Retrieval,Linguistics,Medicine,Natural Language Processing,Periodicals as Topic,Semantics,Terminology as Topic,Vocabulary, Controlled

A hybrid method for relation extraction from biomedical literature.

Read this article at

Abstract

Related collections

Radiology and Natural Language Processing

Author and article information

Journal

Comments

Comment on this article

Similar content 136

Cited by 8