International Business Machines Corporation
LIGHTWEIGHT TAGGING FOR DISJOINT ENTITIES

Last updated:

Abstract:

Text data including at least named entities can be received. From the named entities, continuous entities, overlapping entities and disjoint entities can be identified. The overlapping entities can be transformed into continuous entities. The continuous entities, the transformed entities and the disjoint entities can be encoded. The encoded entities can be input to a machine learning language model to train the machine learning model to predict candidate entities. The predicted entities can be decoded to reconstruct the predicted entities.

Status:
Application
Type:

Utility

Filling date:

30 Jan 2020

Issue date:

5 Aug 2021