International Business Machines Corporation
BOOTSTRAPPING RELATION TRAINING DATA
Last updated:
Abstract:
Aspects of the invention include a computer-implemented method for bootstrapping relation training data. The method includes traversing a corpus to detect a first passage having a first set of co-occurring entities and intervening tokens associated with a relation type. Identifying a first predicate frame of the first passage based on the co-occurring entities and intervening tokens. Traversing the corpus again to detect a second passage having a second predicate frame with a same semantic structure as the first predicate frame, wherein the passage contains a second set of co-occurring entities associated with the relation during first instance that the processor did not detect during the first time. Detecting a second set of co-occurring entities in the second passage based on the second predicate frame. Annotating the second set of co-occurring entities to have a same relation as the first set of co-occurring entities.
Utility
31 Jul 2020
3 Feb 2022