CYBERBULLYING DETECTION IN ONLINE SOCIAL MEDIA USING PRE-TRAINED LANGUAGE MODELS

Kasturi Dewi Varathan; Jasmeen Kah Ying  Bong; Teoh Hwai Teng

Published: Jan 30, 2025

Keywords:

Cyberbullying Detection Transfer Learning Pre-trained Language Models AMiCA Dataset Text Classification

Kasturi Dewi Varathan

Universiti Malaya

Jasmeen Kah Ying Bong

Department of Information Systems, Faculty of Computer Science & Information Technology, Universiti Malaya, 50603 Kuala Lumpur, Malaysia

Teoh Hwai Teng

Department of Information Systems, Faculty of Computer Science & Information Technology, Universiti Malaya, 50603 Kuala Lumpur, Malaysia

Abstract

The rapid integration of Information and Communication Technologies (ICT) has revolutionized online communication, yet it has also led to the emergence of cyberbullying, a harmful digital behaviour. This study addresses the urgency of combating cyberbullying and its negative impacts by using advanced pre-trained language models (PLMs) through transfer learning in detecting cyberbullying in social media. The goal is to enhance cyberbullying detection's effectiveness to create safer online spaces. Cyberbullying detection model using transfer learning, using DistilBERT, DistilELECTRA, and MiniLM PLMs were explored. The PLMs' evaluation using the AMiCA dataset, MiniLM achieves the highest performance in detecting cyberbullying, with an accuracy of 97.84% in cross-validation and 98.57% in hold-out testing, while DistilBERT and DistilELECTRA also perform well, achieving accuracies of 97.34% and 98.03%, and 97.58% and 92.97%, respectively. MiniLM consistently maintains competitive F-measures, addressing class imbalance. Overall, MiniLM stands out with high accuracy and micro F1-scores, outperforming other models. Comparative analysis reaffirms MiniLM's excellence in binary classes and overall evaluation showcasing the effectiveness of transfer learning compared to previous studies. In conclusion, this study demonstrates the capabilities of PLMs for cyberbullying detection and suggests future research directions.

Downloads

Download data is not yet available.

How to Cite

Varathan, K. D., Bong, J. K. Y. ., & Teoh Hwai Teng. (2025). CYBERBULLYING DETECTION IN ONLINE SOCIAL MEDIA USING PRE-TRAINED LANGUAGE MODELS. Malaysian Journal of Computer Science, 38(1), 29–54. Retrieved from https://mjcs.um.edu.my/index.php/MJCS/article/view/53959

Issue

Vol. 38 No. 1 (2025): Malaysian Journal of Computer Science

Section

Articles

Author Biography

CYBERBULLYING DETECTION IN ONLINE SOCIAL MEDIA USING PRE-TRAINED LANGUAGE MODELS

Abstract

Downloads

Kasturi Dewi Varathan, Universiti Malaya

Most read articles by the same author(s)

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

Kasturi Dewi Varathan, Universiti Malaya

Most read articles by the same author(s)