Preprocessing dataset in IDS

This code transfer nominal features into numeric and then normalize the whole dataset using min-max
1.2K Downloads
Updated 4 Apr 2013

View License

Most of classifiers in IDS area, especially artificial intelligence such as SOM, handle only numeric dataset and ignore the symbolic features. Therefore, in this section I present a simple version of matlab code that transfers nominal features in KDD dataset into numeric. In addition, after transferring, it normalizes the dataset using minimum maximum normalization to scale all features into [0,1] to avoid dominance and feature Impact.
The principle idea is considering all KDDcup features as random variables. Hence, this code use the probability mass function to transfer all nominal (symbolic) features, e.g. protocol_type, into numeric ones.

Unfortunately, you need to define the input excel file manually at the beginning, then you need to determine which column is nominal ...
The code is well commented and I think it will be easy to understand

Cite As

Maher Salem (2024). Preprocessing dataset in IDS (https://www.mathworks.com/matlabcentral/fileexchange/41129-preprocessing-dataset-in-ids), MATLAB Central File Exchange. Retrieved .

MATLAB Release Compatibility
Created with R2011a
Compatible with any release
Platform Compatibility
Windows macOS Linux

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!
Version Published Release Notes
1.3.0.0

upload in ZIP format. it contains M file, sample dataset for evaluation, and features header file

1.0.0.0