Patent for Compressing and Accelerating Mobile AI Models

Yanzhi Wang

ECE Professor Yanzhi Wang was awarded a patent for “Computer-implemented methods and systems for compressing recurrent neural network (RNN) models and accelerating RNN execution in mobile devices to achieve real-time inference.”


Abstract Source: USPTO

A recurrent neural network (RNN) acceleration framework leverages both a block-based pruning approach and compiler optimizations to accelerate RNN inference on mobile devices.

Related Faculty: Yanzhi Wang

Related Departments:Electrical & Computer Engineering