Preprocessing dataset in IDS

This code transfer nominal features into numeric and then normalize the whole dataset using min-max

You are now following this Submission

Most of classifiers in IDS area, especially artificial intelligence such as SOM, handle only numeric dataset and ignore the symbolic features. Therefore, in this section I present a simple version of matlab code that transfers nominal features in KDD dataset into numeric. In addition, after transferring, it normalizes the dataset using minimum maximum normalization to scale all features into [0,1] to avoid dominance and feature Impact.
The principle idea is considering all KDDcup features as random variables. Hence, this code use the probability mass function to transfer all nominal (symbolic) features, e.g. protocol_type, into numeric ones.

Unfortunately, you need to define the input excel file manually at the beginning, then you need to determine which column is nominal ...
The code is well commented and I think it will be easy to understand

Cite As

Maher Salem (2026). Preprocessing dataset in IDS (https://www.mathworks.com/matlabcentral/fileexchange/41129-preprocessing-dataset-in-ids), MATLAB Central File Exchange. Retrieved .

General Information

MATLAB Release Compatibility

  • Compatible with any release

Platform Compatibility

  • Windows
  • macOS
  • Linux
Version Published Release Notes Action
1.3.0.0

upload in ZIP format. it contains M file, sample dataset for evaluation, and features header file

1.0.0.0