Preprocessing dataset in IDS
Most of classifiers in IDS area, especially artificial intelligence such as SOM, handle only numeric dataset and ignore the symbolic features. Therefore, in this section I present a simple version of matlab code that transfers nominal features in KDD dataset into numeric. In addition, after transferring, it normalizes the dataset using minimum maximum normalization to scale all features into [0,1] to avoid dominance and feature Impact.
The principle idea is considering all KDDcup features as random variables. Hence, this code use the probability mass function to transfer all nominal (symbolic) features, e.g. protocol_type, into numeric ones.
Unfortunately, you need to define the input excel file manually at the beginning, then you need to determine which column is nominal ...
The code is well commented and I think it will be easy to understand
Cite As
Maher Salem (2026). Preprocessing dataset in IDS (https://www.mathworks.com/matlabcentral/fileexchange/41129-preprocessing-dataset-in-ids), MATLAB Central File Exchange. Retrieved .
MATLAB Release Compatibility
Platform Compatibility
Windows macOS LinuxCategories
Tags
Discover Live Editor
Create scripts with code, output, and formatted text in a single executable document.
