International Business Machines Corporation
Automated document filtration and prioritization for document searching and access

Last updated:

Abstract:

Computer based methods, systems, and computer readable media for classifying documents within a content repository or documents within the document subsets are provided. Documents may be pre-processed to render document sections visible to machine readers. Document subsets may be generated based on user-defined terms. The machine readable documents may be classified within the content repository into one of a group of categories, based-upon the number of times classification terms appear in a specific document section of the document. Documents may be ranked based upon the frequency of classification terms in the specific section. Documents may be associated with specific diseases such as cancer, genes, gene variants, and drugs or synonyms thereof by comparing relevant search terms to specific sections of the documents.

Status:
Grant
Type:

Utility

Filling date:

30 Nov 2018

Issue date:

27 Jul 2021