Countering Malicious URLs in Internet of Things Using a Knowledge-Based Approach and a Simulated Expert

Document Type

Article

Source of Publication

IEEE Internet of Things Journal

Publication Date

5-1-2020

Abstract

© 2014 IEEE. This article proposes a novel methodology to detect malicious uniform resource locators (URLs) using simulated expert (SE) and knowledge-base system (KBS). The proposed study not only efficiently detects known malicious URLs but also adapts countermeasure against the newly generated malicious URLs. Moreover, this article also explored which lexical features are contributing more in final decision using a factor analysis method, and thus help in avoiding the involvement of human experts. Furthermore, we apply the following state-of-the-art machine learning (ML) algorithms, i.e., naïve Bayes (NB), decision tree (DT), gradient boosted trees (GBT), generalized linear model (GLM), logistic regression (LR), deep learning (DL), and random rest (RF), and evaluate the performance of these algorithms on a large-scale real data set of data-driven Web applications. The experimental results clearly demonstrate the efficiency of NB in the proposed model as NB outperforms when compared to the rest of the aforementioned algorithms in terms of average minimum execution time (i.e., 3 s) and is able to accurately classify the 107 586 URLs with 0.2% error rate and 99.8% accuracy rate.

ISSN

2327-4662

Publisher

Institute of Electrical and Electronics Engineers Inc.

Volume

7

Issue

5

First Page

4497

Last Page

4504

Disciplines

Computer Sciences

Keywords

Feature extraction, malicious URLs, naïve Bayes (NB), simulated experts (SEs), URL classification

Scopus ID

85084928260

Indexed in Scopus

yes

Open Access

no

Share

COinS