Convolutional neural networks with low-rank regularization

Tai, Cheng; Xiao, Tong; Zhang, Yi; Wang, Xiaogang; E, Weinan

Computer Science > Machine Learning

arXiv:1511.06067 (cs)

[Submitted on 19 Nov 2015 (v1), last revised 14 Feb 2016 (this version, v3)]

Title:Convolutional neural networks with low-rank regularization

Authors:Cheng Tai, Tong Xiao, Yi Zhang, Xiaogang Wang, Weinan E

View PDF

Abstract:Large CNNs have delivered impressive performance in various computer vision applications. But the storage and computation requirements make it problematic for deploying these models on mobile devices. Recently, tensor decompositions have been used for speeding up CNNs. In this paper, we further develop the tensor decomposition technique. We propose a new algorithm for computing the low-rank tensor decomposition for removing the redundancy in the convolution kernels. The algorithm finds the exact global optimizer of the decomposition and is more effective than iterative methods. Based on the decomposition, we further propose a new method for training low-rank constrained CNNs from scratch. Interestingly, while achieving a significant speedup, sometimes the low-rank constrained CNNs delivers significantly better performance than their non-constrained counterparts. On the CIFAR-10 dataset, the proposed low-rank NIN model achieves $91.31\%$ accuracy (without data augmentation), which also improves upon state-of-the-art result. We evaluated the proposed method on CIFAR-10 and ILSVRC12 datasets for a variety of modern CNNs, including AlexNet, NIN, VGG and GoogleNet with success. For example, the forward time of VGG-16 is reduced by half while the performance is still comparable. Empirical success suggests that low-rank tensor decompositions can be a very useful tool for speeding up large CNNs.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:1511.06067 [cs.LG]
	(or arXiv:1511.06067v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1511.06067

Submission history

From: Cheng Tai [view email]
[v1] Thu, 19 Nov 2015 06:13:55 UTC (612 KB)
[v2] Thu, 10 Dec 2015 23:46:17 UTC (636 KB)
[v3] Sun, 14 Feb 2016 03:46:09 UTC (781 KB)

Computer Science > Machine Learning

Title:Convolutional neural networks with low-rank regularization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Convolutional neural networks with low-rank regularization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators