Intel Corporation
A CONTENT ADAPTIVE ATTENTION MODEL FOR NEURAL NETWORK-BASED IMAGE AND VIDEO ENCODERS
Last updated:
Abstract:
Various embodiments are generally directed to using attention models in neural network-based image and video encoders and/or decoders. A first feature map of a first image may be generated by a first layer of a neural network, the neural network executing on a computer processor to encode the first image. An attention layer of the neural network may compute an adaptive spatial saliency map for the first feature map of the first image based on the first feature map of the first image. The neural network may then perform an element-wise multiplication of the first feature map and the adaptive spatial saliency map for the first feature map to generate a modulated feature map to encode the first image.
Status:
Application
Type:
Utility
Filling date:
3 Dec 2018
Issue date:
25 Nov 2021