Patent for Compressing and Accelerating Mobile AI Models
ECE Professor Yanzhi Wang was awarded a patent for “Computer-implemented methods and systems for compressing recurrent neural network (RNN) models and accelerating RNN execution in mobile devices to achieve real-time inference.”
Abstract Source: USPTO
A recurrent neural network (RNN) acceleration framework leverages both a block-based pruning approach and compiler optimizations to accelerate RNN inference on mobile devices.
