Mastercard Incorporated
METHODS AND SYSTEMS FOR PROCESSING UNSTRUCTURED AND UNLABELLED DATA
Last updated:
Abstract:
Embodiments provide methods and systems for processing unstructured and unlabelled data. A method includes generating, by a processor, a structured and unlabelled training dataset from an unstructured and unlabelled dataset. The method includes categorizing the structured and unlabelled training dataset into a plurality of clusters by executing an unsupervised algorithm. Each cluster of a selected set of clusters from the plurality of clusters is labelled with an applicable label from a set of labels. The method includes executing a supervised algorithm to generate a trained supervised model using a labelled training dataset including the set of labels and an input dataset generated from plurality of datapoints present in each cluster of the selected set of clusters. The method includes generating a Labelled Data1 (LD1) by executing the trained supervised model configured to assign applicable label from the set of labels to each datapoint of the structured and unlabelled training dataset.
Utility
7 Oct 2021
14 Apr 2022