STuning-DL: Model-Driven Autotuning of Sparse GPU Kernels for Deep Learning
The relentless growth of modern Machine Learning models has spurred the adoption of sparsification techniques to simplify their architectures and reduce the computational demands.Network pruning has demonstrated success in maintaining original network accuracy while shedding significant portions of the original weights.However, leveraging this spar