Amazon.com, Inc.
Systems, apparatuses, and method for document ingestion
Last updated:
Abstract:
Techniques for intaking one or more documents are described. An exemplary method includes receiving an ingestion request to ingest a document; extracting text from the document; pre-processing the extracted text to generate pre-processed text that is predictable and analyzable; generating an index entry for the extracted text, the index entry to map the extracted text to a reserved field of a plurality of reserved fields; and storing the extracted text, index entry, and pre-processed text in at least one data storage location.
Status:
Grant
Type:
Utility
Filling date:
27 Nov 2019
Issue date:
26 Apr 2022