International Business Machines Corporation
AUTOMATIC HYBRID QUANTIZATION FOR DEEP NEURAL NETWORK

Last updated:

Abstract:

Methods, computer program products, and/or systems are provided that perform the following operations: obtaining a target neural network structure and constraints for a target neural network; generating a meta learning network having an associated quantization function based, at least in part, on the target neural network structure; training the meta learning network based, at least in part, on providing a hybrid quantization vector as input to the meta learning network and providing a training dataset to the target neural network; obtaining a plurality of hybrid quantization vectors; determining a new hybrid quantization vector from the plurality of hybrid quantization vectors; and retraining the trained meta learning network based, at least in part, on providing the new hybrid quantization vector as input to the trained meta learning network.

Status:
Application
Type:

Utility

Filling date:

11 Dec 2020

Issue date:

16 Jun 2022