International Business Machines Corporation
AUTOMATED NATURAL LANGUAGE SPLITTING FOR GENERATION OF KNOWLEDGE GRAPHS
Last updated:
Abstract:
Splitting a natural language sentence into primitive phrases retaining relations of terms includes receiving a natural language sentence, building a parse tree from the natural language sentence using a natural language parser, and recursively identifying discourse markers in subtrees of the parse tree, starting with the highest ranking discourse marker in the parse tree, thereby separating each of the respective subtrees at the respective discourse marker using a set of predefined rules until a set of basic subtrees remains. The recursive identification includes looking-ahead for identifying long ranging discourse markers before identifying local discourse markers.
Status:
Application
Type:
Utility
Filling date:
16 Dec 2020
Issue date:
16 Jun 2022