Simple Deep Learning Network via Tensor-Train Haar Wavelet Decomposition Without Retraining
Journal
IEEE International Workshop on Machine Learning for Signal Processing, MLSP
Journal Volume
2018-September
Pages
1522-1527
Date Issued
2018
Author(s)
Abstract
Deep neural network has revolutionized machine learning recently. However, it suffers from both high computation and memory cost such that deploying it on a hardware with limited resources (e.g., mobile devices) becomes a challenge. To address this problem, we propose a new technique, called Tensor-Train Haar-wavelet decomposition, that decomposes a large weight tensor from a fully-connected layer into a sequence of partial Haar-wavelet matrices without retraining. The novelty originates from the deterministic partial Haar-wavelet matrices such that we only need to store row indices instead of the whole matrix. Empirical results demonstrate that our method achieves efficient model compression while maintaining limited accuracy loss, even without retraining. © 2018 IEEE.
Subjects
Artificial intelligence; Deep neural networks; Signal processing; Tensors; Accuracy loss; Fully-connected layers; Haar wavelet decomposition; Haar wavelets; Learning network; Memory cost; Model compression; Tensor trains; Wavelet decomposition
Other Subjects
Artificial intelligence; Deep neural networks; Signal processing; Tensors; Accuracy loss; Fully-connected layers; Haar wavelet decomposition; Haar wavelets; Learning network; Memory cost; Model compression; Tensor trains; Wavelet decomposition
Type
conference paper
