NetApp, Inc.
MULTI-MODAL ELECTRONIC DOCUMENT CLASSIFICATION

Last updated:

Abstract:

A method comprising operating at least one hardware processor for: receiving, as input, a plurality of electronic documents, training a machine learning classifier based, at least on part, on a training set comprising: (i) labels associated with the electronic documents, (ii) raw text from each of said plurality of electronic documents, and (iii) a rasterized version of each of said plurality of electronic documents, and applying said machine learning classifier to classify one or more new electronic documents.

Status:
Application
Type:

Utility

Filling date:

10 Feb 2019

Issue date:

16 Jan 2020