Normalization of Long-tail Adverse Drug Reactions in Social Media

E. Manousogiannis, Sepideh Mesbah, Alessandro Bozzon, Robert-Jan Sips, Zoltán Szlávik, Selene Baez Santamaria

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review


The automatic mapping of Adverse Drug Reaction (ADR) reports from user-generated content to concepts in a controlled medical vocabulary provides valuable insights for monitoring public health. While state-of-the-art deep learning-based sequence classification techniques achieve impressive performance for medical concepts with large amounts of training data, they show their limit with long-tail concepts that have a low number of training samples. The above hinders their adaptability to the changes of layman’s terminology and the constant emergence of new informal medical terms. Our objective in this paper is to tackle the problem of normalizing long-tail ADR mentions in user-generated content. In this paper, we exploit the implicit semantics of rare ADRs for which we have few training samples, in order to detect the most similar class for the given ADR. The evaluation results demonstrate that our proposed approach addresses the limitations of the existing techniques when the amount of training data is limited.
Original languageEnglish
Title of host publicationProceedings of the 11th International Workshop on Health Text Mining and Information Analysis
Publication statusPublished - 2020


Dive into the research topics of 'Normalization of Long-tail Adverse Drug Reactions in Social Media'. Together they form a unique fingerprint.

Cite this