Amazon.com, Inc.
Systems, apparatuses, and method for document ingestion

Last updated:

Abstract:

Techniques for intaking one or more documents are described. An exemplary method includes receiving an ingestion request to ingest a document; extracting text from the document; pre-processing the extracted text to generate pre-processed text that is predictable and analyzable; generating an index entry for the extracted text, the index entry to map the extracted text to a reserved field of a plurality of reserved fields; and storing the extracted text, index entry, and pre-processed text in at least one data storage location.

Status:
Grant
Type:

Utility

Filling date:

27 Nov 2019

Issue date:

26 Apr 2022