Machine Learning in Cyber Security Analytics using NSL-KDD Dataset

Classification is the procedure to recognize, understand, as well as group ideas and objects into given categories. Classification techniques adopt training data patterns to predict the likelihood that subsequent data will classify into one of the given categories. Classification techniques utilize...

Full description

Saved in:
Bibliographic Details
Published inConference on Technologies and Applications of Artificial Intelligence (Online) pp. 260 - 265
Main Authors Hong, Rui-Fong, Horng, Shih-Cheng, Lin, Shieh-Shing
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.11.2021
Subjects
Online AccessGet full text
ISSN2376-6824
DOI10.1109/TAAI54685.2021.00057

Cover

More Information
Summary:Classification is the procedure to recognize, understand, as well as group ideas and objects into given categories. Classification techniques adopt training data patterns to predict the likelihood that subsequent data will classify into one of the given categories. Classification techniques utilize a variety of algorithms to classify future datasets through training data patterns. In current society, many network attacks continue to carry out various types of attacks. This work performs data pre-processing and uses Python with machine learning algorithms to analyze the NSL-KDD data set. We use various machine learning methods, such as decision trees, random forests, Naïve Bayes, KNN, Gradient Boosted Trees, and SVM to analyze the confusion matrix and predict the accuracy. We also draw the ROC curve and the AUC area. We calculate the ACC accuracy and make a simple assessment of the quality of different algorithms. Test results show that through data pre-processing, machine learning algorithms can be performed with extremely high accuracy.
ISSN:2376-6824
DOI:10.1109/TAAI54685.2021.00057