Microsoft Corporation
CLASSIFICATION OF AUDITORY AND VISUAL MEETING DATA TO INFER IMPORTANCE OF USER UTTERANCES
Last updated:
Abstract:
In non-limiting examples of the present disclosure, systems, methods and devices for generating summary content are presented. Voice audio data and video data for an electronic meeting may be received. A language processing model may be applied to a transcript of the audio data and textual importance scores may be calculated. A video/image model may be applied to the video data and visual importance scores may be calculated. A combined importance score may be calculated for sections of the electronic meeting based on the textual importance scores and the visual importance scores. A meeting summary that includes summary content from sections for which combined importance scores exceed a threshold value may be generated.
Utility
4 Jun 2020
9 Dec 2021