I'd just like to note that instead of creating additional animosity between SVMs...

strebler · on June 22, 2017

That's a great point. Fundamentally, if you look at something like a CNN, what it's really doing is producing a feature descriptor based on the input image. One can easily use that feature descriptor in a classic SVM, alongside (or instead of) SoftMax.

deepnotderp · on June 22, 2017

Yup, in fact, the universal feature extraction is what allows imagenet pretraining to work well on lung cancer images.

One nitpick though, ConvNets can absolutely be used to do "thinking" and more than just feature extraction. For example, fully convolutional networks can be extremely competitive with FC-layer based nets.

JustFinishedBSG · on June 23, 2017

You may be interested by

https://arxiv.org/abs/1605.06265

http://papers.nips.cc/paper/5348-convolutional-kernel-networ...

sivvy · on June 22, 2017

Could you explain in a bit more detail how you would integrate an SVM layer into a DNN? The kernel matrix depends on all samples, while at training time you would only have access to those in the minibatch.

IanCal · on June 22, 2017

The simplest is to pop it on the top. Run you DNN to reduce your input down to a nicer cleaner smaller dimensional output, then plop an SVM on top for classification.

sivvy · on June 22, 2017

Seems like in that case you would train both models separately on different cost functions. By phrasing it as a layer I was expecting both the SVM and the DNN could be trained simultaneously.

IanCal · on June 23, 2017

Unless things have changed, one of the key benefits of DNNs was that you trained them layer by layer.

You also want to be able to train the DNN on your unlabelled data and the SVM on your much smaller labelled set.

JustFinishedBSG · on June 23, 2017

Yes

https://arxiv.org/abs/1605.06265

http://papers.nips.cc/paper/5348-convolutional-kernel-networ...