Biodesix, Inc.
INTERPRETATION OF MACHINE LEARNING CLASSIFICATIONS IN CLINICAL DIAGNOSTICS USING SHAPELY VALUES AND USES THEREOF
Last updated:
Abstract:
Shapley values (SVs) have become an important tool to further the goal of explainability of machine learning (ML) models. However, the computational load of exact SV calculations increases exponentially with the number of attributes. Hence, the calculation of SVs for models incorporating large numbers of interpretable attributes is problematic. Molecular diagnostic tests typically seek to leverage information from hundreds or thousands of attributes, often using training sets with fewer instances. Methods are described for evaluate SVs using Monte Carlo sampling or exact calculation in polynomial time (i.e., reasonably quickly and efficiently) using the architecture of a ML model designed for robust molecular test generation, and without requiring classifier retraining.
Utility
28 Jun 2021
16 Jun 2022