Our NeurIPS 2022 paper "Wavelet Feature Maps Compression for Image-to-Image CNNs" is now available.

In this paper, we propose a novel approach to compress CNNs using a modified wavelet compression technique.

Abstract:

>Convolutional Neural Networks (CNNs) are known for requiring extensive computational resources, and quantization is among the best and most common methods for compressing them. While aggressive quantization (i.e., less than 4-bits) performs well for classification, it may cause severe performance degradation in image-to-image tasks such as semantic segmentation and depth estimation. In this paper, we propose Wavelet Compressed Convolution (WCC) -- a novel approach for high-resolution activation maps compression integrated with point-wise convolutions, which are the main computational cost of modern architectures. To this end, we use an efficient and hardware-friendly Haar-wavelet transform, known for its effectiveness in image compression, and define the convolution on the compressed activation map. We experiment with various tasks that benefit from high-resolution input. By combining WCC with light quantization, we achieve compression rates equivalent to 1-4bit activation quantization with relatively small and much more graceful degradation in performance.

Cityscapes semantic segmentation with different compressions.

KITTI depth prediction with different compressions.

Comments

regandeRR t1_is0jd4a wrote on October 12, 2022 at 12:47 PM

#85,355

Great Work!

londons_explorer t1_is0rn7l wrote on October 12, 2022 at 1:52 PM

#85,760

This is the kind of research that makes companies with hardware accelerators (google, nvidia, tesla, etc.) suddenly have to redesign and re-buy their very expensive hardware accelerators...

shahaff32 OP t1_is0ths5 wrote on October 12, 2022 at 2:06 PM

#85,856

Replying to londons_explorer (#85,760)

This is aimed mostly at edge devices, where an accelerator is not available (e.g. mobile phones), or you want to design a cheaper chip for a product that requires running such networks (e.g. autonomous vehicles)

This work was, in fact, partially supported by AVATAR consortium, aimed at smart vehicles. https://avatar.org.il/

londons_explorer t1_is11x8p wrote on October 12, 2022 at 3:05 PM

#86,302

Replying to shahaff32 (#85,856)

Sure this work was aimed at that, but these same techniques can be used to make a datacenter-scale inference machine into an even more powerful one.

And presumably if a way can be found to do backpropagation in 'wavelet domain', then training could be done like this too.

shahaff32 OP t1_is13c2c wrote on October 12, 2022 at 3:15 PM

#86,360

Replying to londons_explorer (#86,302)

We are in fact doing the backpropagation in the wavelet domain :)

The gradient simply goes through the inverse wavelet transform

See WCC/util/wavelet.py in our GitHub repo, lines 52-83 define the forward/backward of WT and IWT.

pm_me_your_ensembles t1_is1busc wrote on October 12, 2022 at 4:11 PM

#86,810

Could this work with 1d convolutions?

shahaff32 OP t1_is1cvgx wrote on October 12, 2022 at 4:18 PM

#86,866

Replying to pm_me_your_ensembles (#86,810)

With some modifications to the code, I believe it can :)

pm_me_your_ensembles t1_is1d3un wrote on October 12, 2022 at 4:20 PM

#86,881

Replying to shahaff32 (#86,866)

Very cool, will take a look, thanks! :D

shahaff32 OP t1_is1dlvu wrote on October 12, 2022 at 4:23 PM

#86,906

Replying to pm_me_your_ensembles (#86,881)

Thank you for your interest in our paper :)

hughperman t1_is1qyob wrote on October 12, 2022 at 5:50 PM

#87,580

So. Since wavelets here are just filter banks, equivalent to fixed/non-varying convolution+downsampling blocks. Could you learn an improved set of wavelet filters to improve this result?

shahaff32 OP t1_is1tuuf wrote on October 12, 2022 at 6:08 PM

#87,707

Replying to hughperman (#87,580)

That is indeed possible, though at a computational cost. The Haar wavelet can be implemented very efficiently because of its simplicity.

Please see Appendix F, where we shortly discuss other wavelets and their added computational costs.

NeverCast t1_is2f1jt wrote on October 12, 2022 at 8:24 PM

#88,628

Replying to shahaff32 (#85,856)

The immediate use case for me was on autonomous flight vehicles where weight and battery usage matters

shahaff32 OP t1_is2o2yz wrote on October 12, 2022 at 9:21 PM

#89,053

Replying to Ecclestoned (#88,990)

Thank you for your interest in our work :)

We were not aware of these recent works. Thanks for sharing :) we will definitely check those out.

davidrodord92 t1_is2q6d3 wrote on October 12, 2022 at 9:35 PM

#89,120

I love wavelets

[deleted] t1_is3370c wrote on October 12, 2022 at 11:08 PM

#89,675

[deleted]

[deleted] t1_is3nly9 wrote on October 13, 2022 at 1:39 AM

#90,567

[deleted]

SearchAtlantis t1_is3pnyq wrote on October 13, 2022 at 1:54 AM

#90,655

Hey my favorite wavelet! It's what I use to explain wavelets before getting into more complex things like daubechies or others.

The compression and depending on task dimension reduction you can get with wavelets is pretty impressive.

shahaff32 OP t1_is4bbd7 wrote on October 13, 2022 at 5:05 AM

#91,516

Replying to SearchAtlantis (#90,655)

Haar wavelet is also very efficient, as it can be implemented using additions and subtractions (and maybe a few bit manipulations) :)

You can also see Appendix F where we tested several others :)

danny_fel t1_is4bv5g wrote on October 13, 2022 at 5:12 AM

#91,539

This sounds great! I'd like to try your method on a small nvidia jetson setup. Do I still need to convert the "minimized" model to TFlite? Or it should be good as it is?

shahaff32 OP t1_is4jcv4 wrote on October 13, 2022 at 6:44 AM

#91,789

Replying to danny_fel (#91,539)

Thanks :)

In the current state the implementation is using only standard Pytorch operations, therefore it is not as optimal as it can be, and the overhead of the wavelet transforms can outweighs the speedup of the convolution.

We are currently working on a CUDA implementation to overcome that :) see Appendix H for more details

danny_fel t1_is8eaky wrote on October 14, 2022 at 1:20 AM

#98,751

Replying to shahaff32 (#91,789)

Oh thanks! Probably will play around with it! This sounds exciting from a maker/hobbyist perspective wanting to do edge applications.

[R] Wavelet Feature Maps Compression for Image-to-Image CNNs