Efficient KernelSHAP Explanations for Patch-based 3D Medical Image Segmentation

KernelSHAP, vital for model explainability, is currently too slow for patch-based 3D medical image segmentation due to high computational demands. This research proposes an efficient variant to make practical explanations feasible for critical medical AI applications.

intermediate20 min5 steps

The play

Install SHAP Library
Set up your Python environment by installing the SHAP (SHapley Additive exPlanations) library, which provides the KernelSHAP algorithm.
Run Basic KernelSHAP on Tabular Data
Execute the provided starter code to see KernelSHAP in action on a simple, small-scale dataset. This demonstrates how feature contributions are calculated for a single prediction.
Observe Computational Cost
Note the execution time for even this small example. Realize that KernelSHAP evaluates numerous 'coalitions' of features, leading to significant computation for each explanation.
Understand 3D Medical Imaging Scale
Consider applying this process to 3D medical images. Segmentation models often use patch-based inference with sliding windows, meaning a single 3D image requires explanations for hundreds or thousands of overlapping patches, each with many 'features' (voxels or superpixels).
Recognize the Efficiency Gap
Conclude that the current KernelSHAP's computational burden (many coalition evaluations per patch, multiplied by many patches per image) makes it impractical for 3D medical imaging, highlighting the critical need for the efficient variant described in the research.

Starter code

import shap
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestClassifier
from sklearn.datasets import load_iris
import time

# Load a simple dataset
iris = load_iris()
X, y = iris.data, iris.target
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Train a simple model
model = RandomForestClassifier(random_state=42)
model.fit(X_train, y_train)

# Select a single instance to explain
instance_to_explain = X_test[0, :]

# Initialize KernelSHAP explainer
explainer = shap.KernelExplainer(model.predict_proba, X_train)

print(f"Explaining instance: {instance_to_explain}")
start_time = time.time()
# Calculate SHAP values for the instance
shap_values = explainer.shap_values(instance_to_explain)
end_time = time.time()

print(f"\nSHAP values for instance (per class): {shap_values}")
print(f"Time taken for explanation: {end_time - start_time:.4f} seconds")

# Optional: Visualize the explanation
# shap.initjs()
# shap.force_plot(explainer.expected_value[0], shap_values[0], instance_to_explain, feature_names=iris.feature_names)

Source

Paperarxiv.org