International Business Machines Corporation
VISUAL QUESTION GENERATION WITH ANSWER-AWARENESS AND REGION-REFERENCE

Last updated:

Abstract:

A computer-implemented method for visual question generation includes training an alignment module to analyze an image, an answer hint, and a visual hint with respect to the image. A k-nearest neighbors (KNN) graph is constructed by performing an aligned embedding for each region of the image. A node embedding component is generated by using a graph embedding component of the KNN graph. A visual question is generated by sequence decoding each image and graph of the image.

Status:
Application
Type:

Utility

Filling date:

29 Jan 2021

Issue date:

4 Aug 2022