Interpretable segmentation of medical free-text records based on word embeddings

Adam Gabriel Dobrakowski,Agnieszka Mykowiecka,Małgorzata Marciniak,Wojciech Jaworski,Przemysław Biecek

Interpretable segmentation of medical free-text records based on word embeddings

2021

Adam Gabriel Dobrakowski
Agnieszka Mykowiecka
Małgorzata Marciniak
Wojciech Jaworski
Przemysław Biecek

Is it true that patients with similar conditions get similar diagnoses? In this paper we present a natural language processing (NLP) method that can be used to validate this claim. We (1) introduce a method for representation of medical visits based on free-text descriptions recorded by doctors, (2) introduce a new method for segmentation of patients’ visits, (3) present an application of the proposed method on a corpus of 100,000 medical visits and (4) show tools for interpretation and exploration of derived knowledge representation. With the proposed method we obtained stable and separated segments of visits which were positively validated against medical diagnoses. We show how the presented algorithm may be used to aid doctors in their practice.

Keywords:

Interpretation (logic)
Segmentation
Word (computer architecture)
Natural language processing
Medical diagnosis
Computer science
Representation (mathematics)
Knowledge representation and reasoning
Artificial intelligence
text messaging

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations