All Works

Arabic spam tweets classification using deep learning

Sanaa Kaddoura, Zayed University
Suja A. Alex, St. Xavier's Catholic College of Engineering
Maher Itani, Academic Development Division
Safaa Henno, Zayed University
Asma AlNashash, Princess Sumaya University
D. Jude Hemanth, Karunya Institute of Technology and Sciences

Document Type

Article

Source of Publication

Neural Computing and Applications

Publication Date

1-1-2023

Abstract

With the increased use of social network sites, such as Twitter, attackers exploit these platforms to spread counterfeit content. Such content can be fake advertisements or illegal content. Classifying such content is a challenging task, especially in Arabic. The Arabic language has a complex structure and makes classification tasks more difficult. This paper presents an approach to classifying Arabic tweets using classical machine learning (non-deep machine learning) and deep learning techniques. Tweets corpus were collected through Twitter API and labelled manually to get a reliable dataset. For an efficient classifier, feature extraction is applied to the corpus dataset. Then, two learning techniques are used for each feature extraction technique on the created dataset using N-gram models (uni-gram, bi-gram, and char-gram). The applied classical machine learning algorithms are support vector machines, neural networks, logistics regression, and naïve Bayes. Global vector (GloVe) and fastText learning models are utilised for the deep learning approaches. The Precision, Recall, and F1-score are the suggested performance measures calculated in this paper. Afterwards, the dataset is increased using the synthetic minority oversampling technique class to create a balanced dataset. After applying the classical machine learning models, the experimental results show that the neural network algorithm outperforms the other algorithms. Moreover, the GloVe outperforms the fastText model for the deep learning approach.

DOI Link

10.1007/s00521-023-08614-w

ISSN

0941-0643

Publisher

Springer Science and Business Media LLC

Disciplines

Computer Sciences

Keywords

Classification, Deep learning, Machine learning, Spam, Tweets

Scopus ID

85153946153

Recommended Citation

Kaddoura, Sanaa; Alex, Suja A.; Itani, Maher; Henno, Safaa; AlNashash, Asma; and Hemanth, D. Jude, "Arabic spam tweets classification using deep learning" (2023). All Works. 5826.
https://zuscholars.zu.ac.ae/works/5826

Indexed in Scopus

yes

Open Access

Link to Full Text

COinS

All Works

Arabic spam tweets classification using deep learning

Document Type

Source of Publication

Publication Date

Abstract

DOI Link

ISSN

Publisher

Disciplines

Keywords

Scopus ID

Recommended Citation

Indexed in Scopus

Open Access

Search

Browse

Contribute

Content Type

All Works

Arabic spam tweets classification using deep learning

Author First name, Last name, Institution

Document Type

Source of Publication

Publication Date

Abstract

DOI Link

ISSN

Publisher

Disciplines

Keywords

Scopus ID

Recommended Citation

Indexed in Scopus

Open Access

Share

Search

Browse

Contribute

Content Type