Intel Corporation
Per kernel Kmeans compression for neural networks

Last updated: 27 Jul 2021

Abstract:

Methods and apparatus relating to techniques for incremental network quantization. In an example, an apparatus comprises logic, at least partially comprising hardware logic to determine a plurality of weights for a layer of a convolutional neural network (CNN) comprising a plurality of kernels; organize the plurality of weights into a plurality of clusters for the plurality of kernels; and apply a K-means compression algorithm to each of the plurality of clusters. Other embodiments are also disclosed and claimed.

Status:

Grant

Type:

Utility

Filling date:

12 Sep 2017

Issue date:

6 Jul 2021

Full patent description

Patent application document

Intel Corporation Per kernel Kmeans compression for neural networks

Abstract:

Intel Corporation
Per kernel Kmeans compression for neural networks