Multi-class imbalanced data classification in supervised learning is one of the most challenging research issues in machine learning for data mining applications. Although several data sampling methods have been introduced by computational intelligence researchers in the past decades for handling imbalanced data, still learning from imbalanced data is a challenging task and played as a significant focused research interest as well. Traditional machine learning algorithms usually biased to the majority class instances whereas ignored the minority class instances. As a result, ignoring minority class instances may affect the prediction accuracy of classifiers. Generally, under-sampling and over-sampling methods are commonly used in single model classifiers or ensemble learning for dealing with imbalanced data. In this paper, we have introduced an under-sampling method with support vectors for classifying imbalanced data. The proposed approach selects the most informative majority class instances based on the support vectors that help to engender decision boundary. We have tested the performance of the proposed method with single classifiers (C4.5 Decision Tree classifier and naïve Bayes classifier) and ensemble classifiers (Random Forest and AdaBoost) on 13 benchmark imbalanced datasets. It is explicitly shown by the experimental result that the proposed method produces high accuracy when classifying both the minority and majority class instances compared to other existing methods.
History
Publication title
Proceedings of the 13th International Conference on Software, Knowledge, Information Management and Applications (SKIMA 2019)
Pagination
1-6
ISBN
978-1-7281-2741-5
Department/School
School of Information and Communication Technology
Publisher
Institute of Electrical and Electronics Engineers
Place of publication
United States
Event title
13th International Conference on Software, Knowledge, Information Management and Applications (SKIMA 2019)
Event Venue
Ukulhas, Maldives
Date of Event (Start Date)
2019-08-26
Date of Event (End Date)
2019-08-28
Rights statement
Copyright 2019 IEEE
Repository Status
Restricted
Socio-economic Objectives
Information systems, technologies and services not elsewhere classified