Coupa Software Incorporated
TEXT-BASED MACHINE LEARNING EXTRACTION OF TABLE DATA FROM A READ-ONLY DOCUMENT

Last updated:

Abstract:

Embodiments of the disclosed technologies provide solutions for automatically reading digital electronic documents that contain tables and correctly extracting table data, rows and columns from the documents with high accuracy and high throughput. Embodiments are capable of converting a table portion of a read-only document to a searchable, editable data record using text rectangle (TR)-level numerical data that indicates probabilities of TRs belonging to canonicals and at least one convolutional neural network (CNN) that processes the TR-level numerical data to produce table-level numerical data.

Status:
Application
Type:

Utility

Filling date:

20 Oct 2020

Issue date:

3 Mar 2022