Microsoft Corporation
Parsing an Ink Document using Object-Level and Stroke-Level Processing
Last updated:
Abstract:
Technology is descried herein for parsing an ink document having a plurality of ink strokes. The technology performs stroke-level processing on the plurality of ink strokes to produce stroke-level information, the stroke-level information identifying at least one characteristic associated with each ink stroke. The technology also performs object-level processing on individual objects within the ink document to produce object-level information, the object-level information identifying one or more groupings of ink strokes in the ink document. The technology then parses the ink document into constituent parts based on the stroke-level information and the object-level information. In some implementations, the technology converts the ink stroke data into an ink image. The stroke-level processing and/or the object-level processing may operate on the ink image using one or more neural networks. More specifically the stroke-level processing can classify pixels in the input image, while the object-level processing can identify bounding boxes containing possible objects.
Utility
10 Dec 2020
16 Jun 2022