The algorithm is implemented according to the book "Introduction to Data Mining (Second Edition)" by Pang-Ning Tan.
- The target or dependent column name must end with the suffix "_T".
For example, if in a dataset the target variable column name is "gender", then it will be changed to "gender_T".
Changes should be made by the user to the dataset before using it with the script. - Columns with categorical value must end with the suffix "_C".
- "file_name" should be updated inside the script according to your dataset after following the previous steps.
- After completing the previous steps, the script can be run as a typical Python 3 script, and it should output the accuracy of the given dataset.
- Dataset is collated from here.
- Using attached dataset, the measured accuracy is 76.6.