A Trustworthy Approach to Classify and Analyze Epidemic-Related Information From Microblogs

verfasst von
Thi Huyen Nguyen, Marco Fisichella, Koustav Rudra
Abstract

Social media platforms, such as Twitter, are crucial resources to obtain situational information during disease outbreaks. Due to the sheer volume of user-generated content, providing tools that can automatically classify input texts into various types, such as symptoms, transmission, prevention measures, etc., and generate concise situational updates is necessary. Apart from high classification accuracy, interpretability is an important requirement when designing machine learning models for tasks in medical domain. In this article, we provide annotated epidemic-related datasets with labels of information types and rationales, which are short phrases from the original tweets, to support the assigned labels. Next, we introduce a trustworthy approach for the automatic classification of tweets posted during epidemics. Our classification model is able to extract short explanations/rationales for output decisions on unseen data. Moreover, we propose a simple graph-based ranking method to generate short summaries of tweets. Experiments on two epidemic-related datasets show the following: 1) our classification model obtains an average of 82% Macro-F1 and better interpretability scores in terms of Token-F1 (20% improvement) than baselines; 2) the extracted rationales capture essential disease-related information in the tweets; 3) our graph-based method with rationales is simple, yet efficient for generating concise situational updates.

Organisationseinheit(en)
Forschungszentrum L3S
Externe Organisation(en)
Indian Institute of Technology Kharagpur (IITKGP)
Typ
Artikel
Journal
IEEE Transactions on Computational Social Systems
Band
11
Seiten
6229-6241
Anzahl der Seiten
13
Publikationsdatum
10.2024
Publikationsstatus
Veröffentlicht
Peer-reviewed
Ja
ASJC Scopus Sachgebiete
Modellierung und Simulation, Sozialwissenschaften (sonstige), Mensch-Maschine-Interaktion
Ziele für nachhaltige Entwicklung
SDG 3 – Gute Gesundheit und Wohlergehen
Elektronische Version(en)
https://doi.org/10.1109/TCSS.2024.3391395 (Zugang: Geschlossen)