Technique for preparation of large data for machine learning algorithms to generate intrusion detection system

Machine Learning has become the norm for creating models that predict and detect security breaches in computer systems. Modelling of machine learning systems requires a huge amount of data for training and testing the machine learning algorithm. The present research is concerned with preparing the d...

Full description

Saved in:
Bibliographic Details
Published inARPN journal of engineering and applied sciences pp. 1539 - 1546
Format Journal Article
LanguageEnglish
Published 13.09.2023
Online AccessGet full text
ISSN2409-5656
1819-6608
DOI10.59018/0723193

Cover

More Information
Summary:Machine Learning has become the norm for creating models that predict and detect security breaches in computer systems. Modelling of machine learning systems requires a huge amount of data for training and testing the machine learning algorithm. The present research is concerned with preparing the data prior to feeding it to machine learning algorithms. The necessity for such preparation occurs due to the physical limitations of spreadsheet applications (with regard to the number of records they can handle). This research describes a simple shell scripting approach to combine, extract, and weed out unwanted records and finally prepare the data for feeding to machine learning algorithms.
ISSN:2409-5656
1819-6608
DOI:10.59018/0723193