International Business Machines Corporation
EXTRACTING RELEVANT SENTENCES FROM TEXT CORPUS

Last updated:

Abstract:

Managing a computer database having a plurality of data entries, in a talent framework system. Generate a description text field of a data entry of the computer database by classifying sentences of one or more text streams stored in a text stream corpus. The classifying is performed by identifying, for a given sentence, whether the given sentence is relevant or irrelevant to a title text field of the data entry. K-means clustering can be used, where experimental data show that k=2 produces desirable classification outcomes.

Status:
Application
Type:

Utility

Filling date:

10 Feb 2020

Issue date:

12 Aug 2021