RobustEncoder: Leveraging K-Means clustering technique to defend NLP models against backdoor attacks
Document Type
Conference Proceeding
Source of Publication
2024 6th International Conference on Blockchain Computing and Applications (BCCA)
Publication Date
11-29-2024
Abstract
As machine learning (ML) systems become increasingly integrated into real-world applications for sensitive tasks, ensuring the security and privacy of these models becomes paramount. Deep Neural Networks (DNNs), in particular, are susceptible to backdoor attacks, where adversaries manipulate training data by inserting specially crafted samples. While the NLP community has extensively studied backdoor attacks, there remains a gap in effective defense mechanisms. To address this, we propose RobustEncoder, a novel approach leveraging K-Means clustering to detect and mitigate backdoor attacks in text-based models. Our method demonstrates significant efficacy in identifying and neutralizing backdoor triggers, as evidenced by extensive empirical evaluations. Additionally, we propose potential applications of blockchain technology to further enhance the security and integrity of the defense mechanisms in future implementations.
DOI Link
ISBN
979-8-3503-5153-8
Publisher
IEEE
Volume
00
First Page
179
Last Page
188
Disciplines
Computer Sciences
Keywords
Backdoor attacks, K-Means clustering, NLP models, Defense mechanisms, Security
Recommended Citation
Albtosh, Luay; Omar, Marwan; Al-Karaki, Jamal N.; Mohammed, Derek; and Zangana, Hewa Majeed, "RobustEncoder: Leveraging K-Means clustering technique to defend NLP models against backdoor attacks" (2024). All Works. 7222.
https://zuscholars.zu.ac.ae/works/7222
Indexed in Scopus
no
Open Access
no