Improving Health Mention Classification of Social Media Content Using Contrastive Adversarial Training

Khan, PI; Siddiqui, SA; Razzak, Imran; Dengel, A; Ahmed, S

File(s) not publicly available

Improving Health Mention Classification of Social Media Content Using Contrastive Adversarial Training

journal contribution

posted on 2023-02-14, 04:15 authored by PI Khan, SA Siddiqui, Imran RazzakImran Razzak, A Dengel, S Ahmed

Health mention classification (HMC) involves the classification of an input text as health mention or not. Figurative and non-health mention of disease words makes the classification task challenging. Learning the context of the input text is the key to this problem. The idea is to learn word representation by its surrounding words and utilize emojis in the text to help improve the classification results. In this paper, we improve the word representation of the input text using adversarial training that acts as a regularizer during fine-tuning of the model. We generate adversarial examples by perturbing the word embeddings of the model and then train the model on a pair of clean and adversarial examples. Additionally, we utilize contrastive loss that tries to learn similar representations for the clean example and its perturbed version. We train and evaluate the method on three public datasets. Experiments show that contrastive adversarial training improves the performance significantly in terms of F1-score over the baseline methods of both BERTLarge and RoBERTaLarge on all three datasets. Furthermore, we provide a brief analysis of the results by utilizing the power of explainable AI.

History

Journal

IEEE Access

Volume

10

Pagination

87900-87910

Publisher DOI

https://doi.org/10.1109/ACCESS.2022.3200159

ISSN

2169-3536

eISSN

2169-3536

Language

English

Author URL

https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000848200600001&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=a045e4b2bb1f2b747c68c720ec8913b7

Publication classification

C1.1 Refereed article in a scholarly journal

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Usage metrics

Keywords

Science & Technology Technology Computer Science, Information Systems Engineering, Electrical & Electronic Telecommunications Computer Science Engineering Training data Adversarial machine learning Social networking (online)Bit error rate Transformers Perturbation methods Health mention classification contrastive adversarial training tweet classification Information and Computing Sciences

Licence

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) not publicly available

Improving Health Mention Classification of Social Media Content Using Contrastive Adversarial Training

History

Journal

Volume

Pagination

Publisher DOI

ISSN

eISSN

Language

Author URL

Publication classification

Publisher

Usage metrics

Categories

Keywords

Licence

Exports