NetApp, Inc.
MULTI-MODAL ELECTRONIC DOCUMENT CLASSIFICATION
Last updated:
Abstract:
A method comprising operating at least one hardware processor for: receiving, as input, a plurality of electronic documents, training a machine learning classifier based, at least on part, on a training set comprising: (i) labels associated with the electronic documents, (ii) raw text from each of said plurality of electronic documents, and (iii) a rasterized version of each of said plurality of electronic documents, and applying said machine learning classifier to classify one or more new electronic documents.
Status:
Application
Type:
Utility
Filling date:
10 Feb 2019
Issue date:
16 Jan 2020