Classification of malicious and benign websites by network features using supervised machine learning algorithms

Author First name, Last name, Institution

Sanaa Kaddoura, Zayed University

Document Type

Conference Proceeding

Source of Publication

2021 5th Cyber Security in Networking Conference (CSNet)

Publication Date

10-14-2021

Abstract

Due to the increase in Internet usage through the past years, cyber-attacks have rapidly increased, leading to high personal information and financial loss. Cyberattacks can include phishing, spamming, and malware. Because websites, the most common element of the Internet, are widely used, hackers find their targets to attack. Therefore, the detection of malicious websites is critical for organizations and individuals to increase security. The earlier a malicious website is detected, the faster it is defended. In this paper, a dataset is analyzed and applied to multiple supervised machine learning models such as Random Forest, Artificial Neural Network, K-nearest neighbors, and Support Vector Machine. The dataset attributes are extracted based on the application layer and different network characteristics. The experimental studies with many benign and malicious websites obtained from real-life Internet resources show a high prediction performance. Due to the imbalanced dataset studied in this paper, the F1-score was measured instead of the accuracy. The support vector machine algorithm showed the highest performance over all the other algorithms studied, with a value of 92%.

ISBN

9781665407229

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Volume

00

Disciplines

Computer Sciences

Keywords

Support vector machines, Machine learning algorithms, Unsolicited e-mail, Phishing, Organizations, Prediction algorithms, Classification algorithms

Indexed in Scopus

no

Open Access

no

Share

COinS