International Business Machines Corporation
Linked data seeded multi-lingual lexicon extraction
Last updated:
Abstract:
One embodiment provides a method for relevant language-independent terminology extraction from content, the method including extracting lexicon items from the content based on context extraction patterns using statistical processing. Feedback on the extracted lexicon items is received to automatically tune scores and thresholds for the context extraction patterns. Available Linked Data is leveraged for a bootstrap source. The relevant language-independent terminology extraction is bootstrapped using the bootstrap source.
Status:
Grant
Type:
Utility
Filling date:
11 Jul 2018
Issue date:
2 Nov 2021