International Business Machines Corporation
INTERPRETING TEXT CLASSIFICATION PREDICTIONS THROUGH DETERMINISTIC EXTRACTION OF PROMINENT N-GRAMS

Last updated:

Abstract:

Provided are a computer program product, system, and method for interpreting text classification predictions through deterministic extraction of prominent n-grams. A determination is made of n-gram vectors comprising word embeddings of n-grams in a document and of a document vector comprising word embeddings of the document. A label is received from the text classifier program, comprising a text classification of the document. A determination is made of a label vector comprising word embeddings of the label. The n-gram vectors, the document vector, and the label vector are used to determine n-grams that explain the text classification of the text classifier program.

Status:
Application
Type:

Utility

Filling date:

10 Jan 2020

Issue date:

15 Jul 2021