Intuit Inc.
Relative positional parsing of documents using trees

Last updated:

Abstract:

Certain aspects of the present disclosure provide techniques for improved retrieval of data from documents. Embodiments include receiving, from a user, a definition of a document region, wherein the definition comprises coordinates relative to a location on a document page. Embodiments include receiving, from the user, an identifier associated with the document region. Embodiments include receiving a document comprising one or more elements. The document may not support queries for the one or more elements. Embodiments include building a tree based on the document, the tree including one or more elements with element coordinates. Embodiments include retrieving an item of data associated with the identifier by determining that the element coordinates of an element in the tree are within the document region associated with the identifier and retrieving the element as the item of data. Embodiments include using the item of data to perform an action.

Status:
Grant
Type:

Utility

Filling date:

1 Aug 2018

Issue date:

15 Jun 2021