Regularized SVM Classification with a new Complexity-Driven Stochastic Optimizer

J. Andrew Howe; Hamparsum Bozdgoan

Regularized SVM Classification with a new Complexity-Driven Stochastic Optimizer

Authors

J. Andrew Howe European Journal of Pure and Applied Mathematics Risk Dynamics Consultancy http://orcid.org/0000-0002-3553-1990 (unauthenticated)
Hamparsum Bozdgoan University of Tennessee, Knoxville

Keywords:

Supervised classification, Discriminant analysis, Support vectors, Information criteria, Feature selection, Stochastic optimization, Reproducing kernel Hilbert space

Abstract

Given a multivariate dataset composed of data from different known sources or processes,Â how can we create a rule to separate the data, and classify any future data? Kernel discriminant analysisÂ is one of many supervised learning techniques that handle this problem. Recently, in this and otherÂ knowledge discovery problems, kernel methods have gained popularity. This is somewhat ironic asÂ another common theme is variable reduction, and kernel methods actually inflate dimensionality. DueÂ to the substantial benefits of processing "kernelized" data, this is excusable - kernel methods frequentlyÂ outperform traditional classification techniques for real data when the classes are not easily separable.Â In performing kernel discriminant analysis, there are two main issues that we address in this article.Â The first is that, in the literature, the question of which kernel function to use is often subjectivelyÂ selected a prior, or determined by cross-validation with the sole objective of maximizing classificationÂ performance. Secondly, after obtaining discriminant functions or support vectors to classify a dataset,Â how do we know which of our variables are most responsible for, and important to, the classification?Â In this research, we develop a new regularized algorithmthat simultaneously selects the kernel functionÂ and subset of original variables. Our algorithm, a hybrid of cross-validation and the genetic algorithm,Â does this by optimizing a function that rewards correct classification while penalizing model complexityÂ and misclassification.

We report results on three real datasets, including data from a medical imaging study. For the latter,Â we obtained an impressively low misclassification rate of 0.3%, while reducing the number of featuresÂ from p = 20 to pâˆ— = 6.

References

Downloads

Published

2016-04-30

Issue

Vol. 9 No. 2: (April 2016)

Section

Econometrics and Statistics

License

Upon acceptance of an article by the European Journal of Pure and Applied Mathematics, the author(s) retain the copyright to the article. However, by submitting your work, you agree that the article will be published under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). This license allows others to copy, distribute, and adapt your work, provided proper attribution is given to the original author(s) and source. However, the work cannot be used for commercial purposes.

By agreeing to this statement, you acknowledge that:

You retain full copyright over your work.
The European Journal of Pure and Applied Mathematics will publish your work under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
This license allows others to use and share your work for non-commercial purposes, provided they give appropriate credit to the original author(s) and source.

How to Cite

Regularized SVM Classification with a new Complexity-Driven Stochastic Optimizer. (2016). European Journal of Pure and Applied Mathematics, 9(2), 216-230. https://www.ejpam.com/ejpam/article/view/2688

Download Citation

Regularized SVM Classification with a new Complexity-Driven Stochastic Optimizer

Authors

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite

submit a manuscript

Information

right_block_image

affiliated_journal_block

formatting_package