International Business Machines Corporation
Mixup image captioning

Last updated: 18 May 2022

Abstract:

In an approach to augmenting caption datasets, one or more computer processors sample a ratio lambda from a probability distribution based on a pair of datapoints contained in a dataset, wherein each datapoint in the pair of datapoints comprises an image and an associated caption; extend the dataset by generating one or more new datapoints based on the sampled ratio lambda for each pair of datapoints in the dataset, wherein the sampled ratio lambda incorporates an interpolation of features associated with the pair of datapoints into the generated one or more new datapoints; identify one or more objects contained within a subsequent image utilizing an image model trained utilizing the extended dataset; generate a subsequent caption for one or more identified objects contained within the subsequent image utilizing a language generating model trained utilizing the extended dataset.

Status:

Grant

Type:

Utility

Filling date:

7 Jul 2020

Issue date:

17 May 2022

Full patent description

Patent application document

International Business Machines Corporation Mixup image captioning

Abstract:

International Business Machines Corporation
Mixup image captioning