Coupa Software Incorporated
TEXT-BASED MACHINE LEARNING EXTRACTION OF TABLE DATA FROM A READ-ONLY DOCUMENT
Last updated:
Abstract:
Embodiments of the disclosed technologies provide solutions for automatically reading digital electronic documents that contain tables and correctly extracting table data, rows and columns from the documents with high accuracy and high throughput. Embodiments are capable of converting a table portion of a read-only document to a searchable, editable data record using text rectangle (TR)-level numerical data that indicates probabilities of TRs belonging to canonicals and at least one convolutional neural network (CNN) that processes the TR-level numerical data to produce table-level numerical data.
Status:
Application
Type:
Utility
Filling date:
20 Oct 2020
Issue date:
3 Mar 2022