Enhancing Classification Performance through Rough Set Theory Feature Selection: A Comparative Study across Multiple Datasets

Ashika T; Hannah Grace G

doi:10.29020/nybg.ejpam.v18i2.5934

Enhancing Classification Performance through Rough Set Theory Feature Selection: A Comparative Study across Multiple Datasets

Authors

Ashika T Department of Mathematics, School of Advanced Sciences, Vellore Institute of Technology, Chennai, India
Hannah Grace G Department of Mathematics, School of Advanced Sciences, Vellore Institute of Technology, Chennai, India https://orcid.org/0000-0001-9923-3709

DOI:

https://doi.org/10.29020/nybg.ejpam.v18i2.5934

Keywords:

Machine learning, rough set theory, Quickreduct, feature selection

Abstract

In Machine Learning (ML), handling high-dimensional data with redundant or irrelevant features presents significant challenges. Effective feature selection is essential for enhancing model performance, reducing computational complexity, and improving interpretability. Rough Set Theory (RST) provides a powerful mathematical framework for managing uncertainty, making it a valuable tool for feature selection. This study applies RST-based feature selection to five diverse datasets, aiming to eliminate insignificant attributes. We evaluate the performance of various ML models, including Logistic Regression (LR), K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Kernel SVM, Naïve Bayes (NB), Decision Tree (DT) and Random Forest (RF), on both the original and RST-selected datasets. Standard metrics such as accuracy, precision, recall, F1-score and Mean Absolute Error (MAE) are used for evaluation. Our results demonstrate that RST effectively selects relevant features without significant information loss. Models trained on RST-selected datasets exhibit comparable or improved performance, with RF and SVM models showing notable gains in accuracy and efficiency. These findings highlight the potential of RST-based feature selection to enhance ML model performance while reducing computational complexity, making it a valuable approach for various ML applications.

Downloads

Full Text

Published

2025-05-01

Issue

Vol. 18 No. 2: (April 2025)

Section

Computer Science

License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Upon acceptance of an article by the European Journal of Pure and Applied Mathematics, the author(s) retain the copyright to the article. However, by submitting your work, you agree that the article will be published under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). This license allows others to copy, distribute, and adapt your work, provided proper attribution is given to the original author(s) and source. However, the work cannot be used for commercial purposes.

By agreeing to this statement, you acknowledge that:

You retain full copyright over your work.
The European Journal of Pure and Applied Mathematics will publish your work under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
This license allows others to use and share your work for non-commercial purposes, provided they give appropriate credit to the original author(s) and source.

How to Cite

Enhancing Classification Performance through Rough Set Theory Feature Selection: A Comparative Study across Multiple Datasets. (2025). European Journal of Pure and Applied Mathematics, 18(2), 5934. https://doi.org/10.29020/nybg.ejpam.v18i2.5934

Download Citation

Enhancing Classification Performance through Rough Set Theory Feature Selection: A Comparative Study across Multiple Datasets

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

License

How to Cite

submit a manuscript

Information

right_block_image

affiliated_journal_block

formatting_package