International Business Machines Corporation
Structured term recognition
Last updated:
Abstract:
A method, system and computer program product for recognizing terms in a specified corpus. In one embodiment, the method comprises providing a set of known terms t.di-elect cons.T, each of the known terms t belonging to a set of types .GAMMA. (t)={.gamma..sub.1, . . . }, wherein each of the terms is comprised of a list of words, t=w.sub.1, w.sub.2, . . . , w.sub.n, and the union of all the words for all the terms is a word set W. The method further comprises using the set of terms T and the set of types to determine a set of pattern-to-type mappings p.fwdarw..gamma.; and using the set of pattern-to-type mappings to recognize terms in the specified corpus and, for each of the recognized terms in the specified corpus, to recognize one or more of the types .gamma. for said each recognized term.
Utility
24 May 2019
11 Jan 2022