Improving broad-coverage medical entity linking with semantic type prediction and large-scale datasets

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Objectives:

Biomedical natural language processing tools are increasingly being applied for broad-coverage information extraction—extracting medical information of all types in a scientific document or a clinical note. In such broad-coverage settings, linking mentions of medical concepts to standardized vocabularies requires choosing the best candidate concepts from large inventories covering dozens of types. This study presents a novel semantic type prediction module for biomedical NLP pipelines and two automatically-constructed, large-scale datasets with broad coverage of semantic types.

Methods:

We experiment with five off-the-shelf biomedical NLP toolkits on four benchmark datasets for medical information extraction from scientific literature and clinical notes. All toolkits adopt a staged approach of mention detection followed by two stages of medical entity linking: (1) generating a list of candidate concepts, and (2) picking the best concept among them. We introduce a semantic type prediction module to alleviate the problem of overgeneration of candidate concepts by filtering out irrelevant candidate concepts based on the predicted semantic type of a mention. We present M edT ype, a fully modular semantic type prediction model which we integrate into the existing NLP toolkits. To address the dearth of broad-coverage training data for medical information extraction, we further present W ikiM ed and P ubM edDS, two large-scale datasets for medical entity linking.

Results:

Semantic type filtering improves medical entity linking performance across all toolkits and datasets, often by several percentage points of F-1. Further, pretraining M edT ype on our novel datasets achieves state-of-the-art performance for semantic type prediction in biomedical text.

Conclusions:

Semantic type prediction is a key part of building accurate NLP pipelines for broad-coverage information extraction from biomedical text. We make our source code and novel datasets publicly available to foster reproducible research.

Related collections

Most cited references 136

Record: found
Abstract: not found
Article: not found

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, Jia Deng, Jiang-hao Su … (2015)

0 comments Cited 2464 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Adam: A Method for Stochastic Optimization

Diederik Kingma, Jimmy Lei Ba (2014)

We introduce Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments. The method is straightforward to implement, is computationally efficient, has little memory requirements, is invariant to diagonal rescaling of the gradients, and is well suited for problems that are large in terms of data and/or parameters. The method is also appropriate for non-stationary objectives and problems with very noisy and/or sparse gradients. The hyper-parameters have intuitive interpretations and typically require little tuning. Some connections to related algorithms, on which Adam was inspired, are discussed. We also analyze the theoretical convergence properties of the algorithm and provide a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework. Empirical results demonstrate that Adam works well in practice and compares favorably to other stochastic optimization methods. Finally, we discuss AdaMax, a variant of Adam based on the infinity norm. Published as a conference paper at the 3rd International Conference for Learning Representations, San Diego, 2015

0 comments Cited 1551 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead

Cynthia Rudin (2019)

Black box machine learning models are currently being used for high stakes decision-making throughout society, causing problems throughout healthcare, criminal justice, and in other domains. People have hoped that creating methods for explaining these black box models will alleviate some of these problems, but trying to explain black box models, rather than creating models that are interpretable in the first place, is likely to perpetuate bad practices and can potentially cause catastrophic harm to society. There is a way forward - it is to design models that are inherently interpretable. This manuscript clarifies the chasm between explaining black boxes and using inherently interpretable models, outlines several key reasons why explainable black boxes should be avoided in high-stakes decisions, identifies challenges to interpretable machine learning, and provides several example applications where interpretable models could potentially replace black box models in criminal justice, healthcare, and computer vision.

0 comments Cited 899 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (nlm-journal-id): 100970413

Journal ID (pubmed-jr-id): 22289

Journal ID (nlm-ta): J Biomed Inform

Journal ID (iso-abbrev): J Biomed Inform

Title: Journal of biomedical informatics

ISSN (Print): 1532-0464

ISSN (Electronic): 1532-0480

Publication date Nihms-submitted: 10 March 2022

Publication date (Print): September 2021

Publication date (Electronic): 12 August 2021

Publication date PMC-release: 25 March 2022

Volume: 121

Page: 103880

Affiliations

[a ]Carnegie Mellon University, 5000 Forbes Ave, Pittsburgh, PA, USA

[b ]University of Pittsburgh, 5607 Baum Blvd, Pittsburgh, PA, USA

Author notes

[* ]Corresponding author. shikharvashishth@ 123456gmail.com (S. Vashishth).

Article

Manuscript ID: NIHMS1786948

DOI: 10.1016/j.jbi.2021.103880

PMC ID: 8952339

PubMed ID: 34390853

SO-VID: 44239a91-25ae-451b-a312-96dfe842557b

License:

This is an open access article under the CC BY license ( http://creativecommons.org/licenses/by/4.0/).

Improving broad-coverage medical entity linking with semantic type prediction and large-scale datasets

Read this article at

Abstract

Objectives:

Methods:

Results:

Conclusions:

Related collections

Semantic Knowledge Base

Most cited references 136

ImageNet Large Scale Visual Recognition Challenge

Adam: A Method for Stochastic Optimization

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead

Author and article information

Journal

Affiliations

Author notes

Article

History

Categories

Comments

Comment on this article

Similar content 44

Cited by 3

Most referenced authors 1,161