SAP SE
Document Information Extraction Without Additional Annotations

Last updated: 18 May 2022

Abstract:

Disclosed herein are system, method, and computer program product embodiments for document information extraction without additional annotations. An embodiment operates by receiving an input representing a document and a key. The embodiment processes the input using a convolutional neural network to obtain a feature map. The embodiment combines the feature map with positional information to obtain a spatial-aware feature map. The embodiment then repeatedly performs the following decoding process: generate attention weights, generate a context vector based on the spatial-aware feature map and the generated attention weights using an attention layer, process the context vector, the key, and an input vector using a recurrent neural network (RNN) to obtain a RNN state, and generate an output vector based on the RNN state and the context vector using a projection layer. The embodiment then extracts a field based on the result of the decoding process.

Status:

Application

Type:

Utility

Filling date:

22 Oct 2020

Issue date:

28 Apr 2022

Full patent description

Patent application document

SAP SE Document Information Extraction Without Additional Annotations

Abstract:

SAP SE
Document Information Extraction Without Additional Annotations