International Business Machines Corporation
LABELING DATA USING AUTOMATED WEAK SUPERVISION

Last updated:

Abstract:

A computer-implemented method includes: receiving, by a computing device, data comprising a labeled dataset and an unlabeled dataset; generating, by the computing device, a set of heuristics using the labeled dataset; generating, by the computing device, a vector of initial labels by labeling each point in the unlabeled dataset using the set of heuristics; generating, by the computing device, a refined set of heuristics using data-driven active learning; generating, by the computing device, a vector of training labels by automatically labeling each point in the unlabeled dataset using the refined set of heuristics; and outputting, by the computing device, the vector of training labels to a client device or a data repository.

Status:
Application
Type:

Utility

Filling date:

2 Jan 2020

Issue date:

8 Jul 2021