Extracting drug-drug interaction articles from MEDLINE to improve the content of drug databases.

PubMed (2005)

Stephany N. Duda Constantin Aliferis Randolph A. Miller Alexander Statnikov Kevin B. Johnson

Citation

Reference

Related Paper

Citation Trend

Abstract:

Drug-drug interaction systems exhibit low signal-to-noise ratios because of the amount of clinically insignificant or inaccurate information they contain. MEDLINE represents a respected source of peer-reviewed biomedical citations that potentially might serve as a valuable source of drug-drug interaction information, if relevant articles could be pinpointed effectively and efficiently. We evaluated the classification capability of Support Vector Machines as a method for locating articles about drug interactions. We used a corpus of "positive" and"negative" drug interaction citations to generate datasets composed of MeSH terms, CUI-tagged title and abstract text, and stemmed text words. The study showed that automated classification techniques have the potential to perform at least as well as PubMed in identifying drug-drug interaction articles.

Keywords:

Drug-drug interaction

Topics:

Biomedical Text Mining and Ontologies

Advanced Text Analysis Techniques

Semantic Web and Ontologies

Source

Cite

An algorithm for suffix stripping

Program electronic library and information systems (1980)

Martin Porter

The automatic removal of suffixes from words in English is of particular interest in the field of information retrieval. An algorithm for suffix stripping is described, which has been implemented as a short, fast program in BCPL. Although simple, it performs slightly better than a much more elaborate system with which it has been compared. It effectively works by treating complex suffixes as compounds made up of simple suffixes, and removing the simple suffixes in a number of steps. In each step the removal of the suffix is made to depend upon the form of the remaining stem, which usually involves a measure of its syllable length.

Stripping (fiber)

Suffix array

Generalized suffix tree

Compressed suffix array

SIMPLE algorithm

10.1108/eb046814

Cite

Citations (8,177)

Automatic construction of a large-scale and accurate drug-side-effect association knowledge base from biomedical literature

Journal of Biomedical Informatics (2014)

Rong Xu QuanQiu Wang

10.1016/j.jbi.2014.05.013

Cite

Citations (43)

LIBSVM

ACM Transactions on Intelligent Systems and Technology (2011)

Chih-Chung Chang Chih‐Jen Lin

LIBSVM is a library for Support Vector Machines (SVMs). We have been actively developing this package since the year 2000. The goal is to help users to easily apply SVM to their applications. LIBSVM has gained wide popularity in machine learning and many other areas. In this article, we present all implementation details of LIBSVM. Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.

Popularity

Multiclass classification

10.1145/1961189.1961199

Cite

Citations (40,754)

A drug-adverse event extraction algorithm to support pharmacovigilance knowledge mining from PubMed citations.

PubMed (2011)

Wei Wang Krystl Haerian Hojjat Salmasian Rave Harpaz Herbert Chase

Adverse drug events (ADEs) create a serious problem causing substantial harm to patients. An executable standardized knowledgebase of drug-ADE relations which is publicly available would be valuable so that it could be used for ADE detection. The literature is an important source that could be used to generate a knowledgebase of drug-ADE pairs. In this paper, we report on a method that automatically determines whether a specific adverse event (AE) is caused by a specific drug based on the content of PubMed citations. A drug-ADE classification method was initially developed to detect neutropenia based on a pre-selected set of drugs. This method was then applied to a different set of 76 drugs to determine if they caused neutropenia. For further proof of concept this method was applied to 48 drugs to determine whether they caused another AE, myocardial infarction. Results showed that AUROC was 0.93 and 0.86 respectively.

Adverse drug event

Adverse drug reaction

Executable

Source

Cite

Citations (44)

Using a shallow linguistic kernel for drug–drug interaction extraction

Journal of Biomedical Informatics (2011)

Isabel Segura-Bedmar Paloma Martı́nez César de Pablo-Sánchez

Drug-drug interaction

Kernel (algebra)

10.1016/j.jbi.2011.04.005

Cite

Citations (143)

Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

PubMed (2001)

Alan R. Aronson

The UMLS Metathesaurus, the largest thesaurus in the biomedical domain, provides a representation of biomedical knowledge consisting of concepts classified by semantic type and both hierarchical and non-hierarchical relationships among the concepts. This knowledge has proved useful for many applications including decision support systems, management of patient records, information retrieval (IR) and data mining. Gaining effective access to the knowledge is critical to the success of these applications. This paper describes MetaMap, a program developed at the National Library of Medicine (NLM) to map biomedical text to the Metathesaurus or, equivalently, to discover Metathesaurus concepts referred to in text. MetaMap uses a knowledge intensive approach based on symbolic, natural language processing (NLP) and computational linguistic techniques. Besides being applied for both IR and data mining applications, MetaMap is one of the foundations of NLM's Indexing Initiative System which is being applied to both semi-automatic and fully automatic indexing of the biomedical literature at the library.

Automatic indexing

Thesaurus

Controlled vocabulary

National library

Representation

Source

Cite

Citations (1,981)

Extraction and mapping of drug names from free text to a standardized nomenclature.

PubMed (2007)

Matthew A. Levin Marina Krol Ankur M. Doshi David L. Reich

Free text fields are often used to store clinical drug data in electronic health records. The use of free text facilitates rapid data entry by the clinician. Errors in spelling, abbreviations, and jargon, however, limit the utility of these data. We designed and implemented an algorithm, using open source tools and RxNorm, to extract and normalize drug data stored in free text fields of an anesthesia electronic health record. The algorithm was developed using a training set containing drug data from 49,518 cases, and validated using a validation set containing data from 14,655 cases. Overall sensitivity and specificity for the validation set were 92.2% and 95.7% respectively. The mains sources of error were misspellings and unknown but valid drug names. These preliminary results demonstrate that free text clinical drug data can be efficiently extracted and mapped to a controlled drug nomenclature.

Text Messaging

Jargon

Spelling

Data set

Data extraction

Source

Cite

Citations (60)

A novel MEDLINE topic indexing method using image presentation

Journal of Visual Communication and Image Representation (2018)

Ye Wang Lan Huang Shuyu Guo Leiguang Gong Tian Bai

Presentation (obstetrics)

Bibliographic database

10.1016/j.jvcir.2018.11.022

Cite

Citations (7)

Predicting anti-cancer drug response by finding optimal subset of drugs

Bioinformatics (2021)

Fatemeh Yassaee Meybodi Changiz Eslahchi

One of the most difficult challenges in precision medicine is determining the best treatment strategy for each patient based on personal information. Since drug response prediction in vitro is extremely expensive, time-consuming and virtually impossible, and because there are so many cell lines and drug data, computational methods are needed.MinDrug is a method for predicting anti-cancer drug response which try to identify the best subset of drugs that are the most similar to other drugs. MinDrug predicts the anti-cancer drug response on a new cell line using information from drugs in this subset and their connections to other drugs. MinDrug employs a heuristic star algorithm to identify an optimal subset of drugs and a regression technique known as Elastic-Net approaches to predict anti-cancer drug response in a new cell line. To test MinDrug, we use both statistical and biological methods to assess the selected drugs. MinDrug is also compared to four state-of-the-art approaches using various k-fold cross-validations on two large public datasets: GDSC and CCLE. MinDrug outperforms the other approaches in terms of precision, robustness and speed. Furthermore, we compare the evaluation results of all the approaches with an external dataset with a statistical distribution that is not exactly the same as the training data. The results show that MinDrug continues to outperform the other approaches.MinDrug's source code can be found at https://github.com/yassaee/MinDrug.Supplementary data are available at Bioinformatics online.

Robustness

Drug response

Elastic net regularization

Cancer cell lines

Cancer drugs

10.1093/bioinformatics/btab466

Cite

Citations (7)

Sequential result refinement for searching the biomedical literature

Journal of Biomedical Informatics (2009)

Len Tanaka Jorge R Herskovic M. Sriram Iyengar Elmer V. Bernstam

Listing (finance)

10.1016/j.jbi.2009.02.009

Cite

Citations (4)