Newest 'kernel-trick+classification' Questions

7 votes

5 answers

4k views

Would a machine learning classifier algorithm be able to determine whether a number is odd or even?

I was testing out some classifier algorithms in scikit but wasn't able to find a classifier (linear or non-linear) that managed to provide good prediction on whether an input number is odd or even. ...

thiagoh

189

asked Dec 29, 2022 at 2:04

1 vote

0 answers

92 views

Implementing kernel alignment for SVM algorithms

I am trying to understand and re-implement the results from Table 2 in the first Kernel-Target Alignment paper. The task that is being done is a simple classification task using an SVM with RBF ...

sheesymcdeezy

61

asked May 20, 2022 at 22:19

1 vote

0 answers

153 views

Why are odd-degreed polynomial kernels slower than those with even degrees for SVM?

I have been using one-class support vector classifiers to extract features for multinomial classification. I noticed that fitting time is much longer when the degree of the polynomial kernel is odd. ...

michen00

111

asked Dec 3, 2021 at 2:08

6 votes

1 answer

3k views

What does representer theorem in machine learning tell us?

In reference to the Representer Theorem in machine learning, Why this is so important? Somehow, this theorem justifies the importance of Kernels in machine learning, i.e. the Kernel trick - a more ...

stats_noob

1

asked Sep 17, 2021 at 6:53

5 votes

2 answers

421 views

Why use RBF kernel if less is needed?

I have seen online theorem's such as Cover's theorem Wikipedia which prove how given $p$ points in $\mathbb{R}^N$ the linear separability is almost certain as the fraction $\dfrac{p}{N}$ is kept close ...

simonegiancola09

51

asked Apr 25, 2021 at 22:51

1 vote

0 answers

69 views

How to extend kernel-based classifier to non-euclidean space like SO3

What is the proper way to extend kernel-based classifier to non-euclidean space like SO3? This kind of situation happens a lot in robotics, where the data points all live in a specific manifold. (Note:...

orematasaburo

211

asked Feb 8, 2021 at 9:51

3 votes

1 answer

86 views

In this example, which of these vectors are support vectors?

The hyperplane of hard margin SVM with $\phi$ kernel is calculated as following that input space using $\phi$ to map to higher dimension space. $$f(\phi(x))=4\phi_1(x)+9\phi_2(x)+4\phi_3(x)$$ $$ \phi(...

batra11

31

asked Jan 12, 2021 at 17:40

2 votes

1 answer

1k views

What is the intuition behind changing the dot product for another inner product in SVM?

I understand that, when classifying with a SVM using a non-linear kernel, we are basically changing the dot product for a "custom" inner product. Is there some reason for working with a different ...

Bananin

728

asked Mar 19, 2019 at 4:04

1 vote

1 answer

2k views

What is the difference between explicit and implicit mapping in SVM?

1) What is the difference between explicit and implicit mapping 2) What is the difference between mapping and kernel trick?

dinesh12

asked Nov 22, 2017 at 5:57

2 votes

2 answers

728 views

Is alpha*RBF a valid kernel, where alpha >= 0 is a parameter?

I wonder if K = alpha*RBF can be a valid kernel satisfying Mercer's condition, where ...

Hello World

173

asked Sep 9, 2017 at 8:39

3 votes

1 answer

689 views

A bunch of questions about Kernels in Machine Learning

i've read many topics on this platform about this topic but i still have some questions, mainly theoretical. We are dealing with ML, so if we are here means that we have to classify with linear ...

rollotommasi

131

asked Sep 7, 2017 at 13:25

1 vote

1 answer

685 views

Does SVM get biased towards majority class in case of imbalanced class proportion?

After reading many posts, I thought of asking: Why should a SVM be biased towards majority class like other classifiers, since an SVM never used the whole data of the training data set—it only uses ...

Argho Chatterjee

111

asked Jun 30, 2017 at 12:53

1 vote

1 answer

775 views

How to understand the predicted negative values by Kernel Regularized Least Squares (KRLS)?

I am learning the prediction algorithm, Kernel Regularized Least Squares (KRLS). The predicted values are listed in the follows: $$\hat{y} = K((K + 1 \times I)^{-1}y)$$ For example, I have 100 ...

Kevin

221

asked May 14, 2017 at 2:26

1 vote

1 answer

573 views

Does SVM prediction accuracy depend on a positive scaling of the kernel function?

Support vector machine (SVM) is a supervised learning algorithm. It draws hyperplanes to separate data points of different classes. The objective function involves inner products of pairs of feature ...

Machine

133

asked Oct 27, 2016 at 19:52

8 votes

1 answer

1k views

Should we account for the intercept term when kernelizing algorithms?

When a learning algorithm (e.g. classification, regression, clustering or dimension reduction) uses only the dot product between data points $\mathbf {x x^T}$ we can implicitly use a higher ...

Firebug

19.5k

asked Aug 30, 2016 at 21:33

9 votes

3 answers

7k views

Projecting to lower/higher-dimensional space for classification: dimensionality reduction vs kernel trick

Whilst learning about classification, I have seen two different arguments. One is that projecting the data to a lower-dimensional space, such as with PCA, makes the data more easily separable. The ...

Karnivaurus

7,129

asked Mar 5, 2016 at 3:36

6 votes

2 answers

1k views

Understanding Kernel Functions for SVMs

I am learning about Support Vector Machines, and in particular, those with kernels for non-linear decision boundaries. I understand the concept of projecting the original data to a higher-dimensional ...

Karnivaurus

7,129

asked Mar 4, 2016 at 20:50

3 votes

1 answer

459 views

How are Hyperplane Heatmaps created and how should they be interpreted?

For nonlinear data, when we are using Support Vector Machines, we can use kernels such as Gaussian RBF, Polynomial, etc to achieve linearity in a different (potentially unknown to us) feature space ...

Ragnar

254

asked Feb 28, 2016 at 6:06

4 votes

1 answer

1k views

SVM Kernel confusion

Suppose that we have an array of 10x2 elements (features). Each of these features are two-dimensional. Something like this: ...

Modium

53

asked Oct 3, 2015 at 23:08

1 vote

1 answer

184 views

How to know which Kernel is better?

I am working on an Image recognition software - My first question is since I already explicitly turm my training images to features vector (and also my test images) what is the point of using ...

Nimrodshn

123

asked Aug 9, 2015 at 4:39

0 votes

1 answer

134 views

How to detect classifier curve in non-separable SVM problem

Suppose we want to classify two class of data that are non-separable with hyper-plane. So we use kernels to map data to high-dimensional space. See my codes: ...

SKMohammadi

121

asked Jul 13, 2015 at 14:15

4 votes

1 answer

287 views

Using kernels with Fisher's linear discriminant analysis

I am a bit stuck implementing the Kernel Fisher Discriminant. $$ J(\mathbf{w}) = \frac{\mathbf{w}^{\text{T}}\mathbf{S}_B^{\phi}\mathbf{w}}{\mathbf{w}^{\text{T}}\mathbf{S}_W^{\phi}\mathbf{w}} $$ $$ ...

crodriguezo

93

asked Apr 17, 2015 at 13:57

1 vote

0 answers

328 views

Probabilistic degree of confidence for the kernel SVM with RBF

Let $f\colon\Bbb{R}^n\to\Bbb{R}$ be the decision function of an SVM using the radial basis function (RBF), $$ k(\mathbf{x},\mathbf{x}')=\exp\Big(-\gamma\|\mathbf{x}-\mathbf{x}'\|^2\Big). $$ That is, $...

nullgeppetto

353

asked Apr 3, 2015 at 17:24

1 vote

2 answers

2k views

Kernel PCA and classification

I need to perform kernel PCA on the colon-‐cancer dataset and then I need to plot number of principal components vs classification accuracy with PCA data. For the first part I am ...

Vivek Aditya

135

asked Mar 19, 2015 at 12:26

2 votes

1 answer

177 views

Integrating length for input-space feature PC projections in kernel PCA

I read a paper detailing the algebraic process of kernel PCA. I have question though: the paper details the projection of new points onto the new eigenvectors in the feature space, but what if I want ...

Simon Kuang

2,121

asked Jan 2, 2015 at 20:03

1 vote

2 answers

3k views

Kernel SVM on sparse data

I have a sparse dataset where a lot of the columns (features) contain mostly zero values. Class labels are multiple discrete categories (10 classes to be precise). I'm wondering if this should trouble ...

Joe

403

asked Dec 4, 2014 at 5:09

3 votes

1 answer

1k views

Which PCA (or kernel PCA) basis better describes a single test sample?

I have two PCA bases obtained by decomposition of two groups of training data. I also have some samples of test data. How can I decide which PCA basis fits better each test sample? I tried to ...

yoki

1,526

asked Oct 13, 2014 at 9:51

3 votes

3 answers

1k views

Applying an RBF kernel first and then train using a Linear Classifier

I will start off by saying that I don't have a concrete understanding of what's under the hood of a SVM classifier. I am interested in using an SVM with the RBF kernel to train a two class ...

Sooshii

31

asked Jul 25, 2014 at 3:44

0 votes

1 answer

86 views

Evolution strategies in libsvm

I'm working on a protein multi-classification problem, using libsvm and the edit distance kernel. This kernel depends on a parameter $\gamma$. I'm able to get the best parameters ($\gamma$ and $C$) ...

Mattia

11

asked Jun 20, 2014 at 15:48

2 votes

1 answer

444 views

Improving SVM classification

I have a classification problem (bioinformatics domain) where I have around 333 features. Currently, I am first selecting features (using importance feature of random forest) and then pushing the same ...

priyanka

325

asked Jun 15, 2014 at 15:52

1 vote

0 answers

51 views

construct/load dataset that performs better with diffusion kernel than other kernel

I'm looking for a dataset on which a diffusion kernel (also called heat kernel), used via SVM, would get better accuracy than other kernels for the classification task. I want to use such a dataset to ...

Jess

11

asked Jun 11, 2014 at 18:39

3 votes

1 answer

1k views

Binary classification using radial basis kernel SVM with a single feature

Is there any interpretation (graphical or otherwise) of a radial basis kernel SVM being trained with a single feature? I can visualize the effect in 2 dimensions (the result being a separation ...

user2422566

459

asked Feb 13, 2014 at 15:33

0 votes

1 answer

561 views

SVM cost, kernel and dimension

Why is it SVM computation cost does not depend on kernel value, dimensions (when separating hyperplane )? Is it because all it does is just classifying and not much calculation involved?

Siga

69

asked Jan 6, 2014 at 12:29

0 votes

1 answer

485 views

Difference between Kernel classifier and linear classifier [duplicate]

I would just like to know what are the differences between kernel classifier and linear classifier? In what kind of problems the first is used and in what kind the second? What could be the ...

Jim Blum

644

asked Nov 24, 2013 at 23:05

59 votes

2 answers

89k views

Linear kernel and non-linear kernel for support vector machine?

When using support vector machine, are there any guidelines on choosing linear kernel vs. nonlinear kernel, like RBF? I once heard that non-linear kernel tends not to perform well once the number of ...

user3269

5,282

asked Oct 17, 2013 at 2:21

5 votes

1 answer

916 views

SVM classification step on embedded system with RBF kernel

I am about to implement the classification step of a trained SVM model. I would like to ask, how the actual classification step is carried out (assuming I would like to port that step to some low-...

puzzled_rhino

51

asked Mar 4, 2013 at 15:05

10 votes

2 answers

20k views

Which SVM kernel to use for a binary classification problem?

I'm a beginner when it comes to support vector machines. Are there some guidelines that say which kernel (e.g. linear, polynomial) is best suited for a specific problem? In my case, I have to classify ...

pemistahl

445

asked Feb 3, 2013 at 13:56

6 votes

1 answer

2k views

Regarding redundant training data in building SVM-based classifier

To build a SVM-based classifier, I have a training data set consisting of N data points. Some of them are redundant. For instance, there have 50 data points which are exactly the same, and there have ...

bit-question

2,827

asked Dec 3, 2012 at 22:14

4 votes

1 answer

10k views

About SVM cost and gamma parameters tuning

I am using R and e1071 package to tune a C-classification SVM. My question is: regardless of the kernel type (linear, ...

Lisa Ann

637

asked Nov 21, 2012 at 10:03

42 votes

2 answers

64k views

Which search range for determining SVM optimal C and gamma parameters?

I am using SVM for classification and I am trying to determine the optimal parameters for linear and RBF kernels. For the linear kernel I use cross-validated parameter selection to determine C and for ...

Kywia

421

asked Nov 19, 2012 at 16:33

9 votes

1 answer

18k views

Non-linear SVM classification with RBF kernel

I'm implementing a non-linear SVM classifier with RBF kernel. I was told that the only difference from a normal SVM was that I had to simply replace the dot product with a kernel function: $$ K(x_i,...

Jan Hadáček

115

asked Nov 16, 2012 at 20:15

2 votes

1 answer

798 views

Possible reason for failing to build a support vector machine

I was trying to build a classifier for a set of documents using a support vector machine. I choose to build the feature space using term occurrence. While experimenting, I found the following scenario:...

user785099

1,307

asked Aug 23, 2012 at 20:26

12 votes

1 answer

3k views

The relationship between the number of support vectors and the number of features

I ran an SVM against a given data set, and made the following observation: If I change the number of features for building the classifier, the number of resulting support vectors will also be changed. ...

user3269

5,282

asked Aug 8, 2012 at 2:37

7 votes

4 answers

8k views

Train a SVM-based classifier while taking into account the weight information

Currently I have a data set which are known to belong to two classes, and would like to build a classifier using SVM. However, there exist different confidence levels for this data set. For example, ...

user3125

3,089

asked Jan 25, 2012 at 3:32

5 votes

1 answer

249 views

Constrain decision boundary to fall on grid lines in multiple class logistic regression

I would like to use multiple class logistic regression to learn the decision boundaries separating the different classes (denoted by color) in the image below. Kernel logistic regression with a RBF ...

fgregg

1,200

asked Nov 5, 2011 at 16:22

3 votes

1 answer

918 views

Linear discriminant analysis and the "kernel trick"?

This is problem 12.10 in "The Elements of Statistical Learning": Suppose you wish to carry out a linear discriminant analysis (two classes) using a vector of transformations of the input ...

Belmont

1,393

asked Mar 8, 2011 at 4:08

7 votes

3 answers

7k views

VC dimension of SVM with polynomial kernel in $\mathbb{R^{2}}$

What is the VC dimension of SVM with the polynomial kernel $k(x,x')=(1+<x,x'>_{\mathbb{R^{2}}})^{2}$ for binary classification in $\mathbb{R^{2}}$? It would be equal or more than v iff ...

Wok

1,105

asked Nov 24, 2010 at 18:47

All Questions

Related Tags