Add explain class and shap by npanczyk · Pull Request #74 · aims-umich/pyMAISE

npanczyk · 2024-11-04T22:07:24Z

PR Description

This PR adds an explain class and the SHAP package to pyMAISE.

Closes: #67

What changes were made?

Will ultimately include:

new explain class
updated CHF benchmark
unit tests (not done yet)

Reviewers: @myerspat @mradaideh

Important

Please do not review SHAP package files! (I.e., explain/shap)

…xplain_gpu

mradaideh · 2024-11-08T19:59:14Z

Hi @npanczyk , thanks for a great work. I tested your branch on a clean venv and it worked like a charm on short (small search) and long (full search) environments. These are things to consider before merge:

1- After I installed pyMAISE on fresh venv, I had to install three more dependencies for shap. These were something we talked about to be added to pyMAISE default packages to install with these specific versions:

slicer-0.0.8
numba-0.60.0
cloudpickle-3.0.0

2- Find a way to pass verbose=0 to all tensorflow prediction lines associated with SHAP methods. These are coming from model.predict calls. There might be a way to silence those before or while passing them to DeepLIFT, KernelSHAP, and IG.

3- Let @myerspat knows to add a clear acknowledge in the github page and docs page to SHAP package that we are using their implementation.

4- For the example CHF notebook, since we do not have a detailed documentation yet for that capability yet, it is helpful to discuss something about the classes. For example, why you use nsamples=None (what that means and what would happen if you used nsamples=20 and so on), what is background samples for KernelSHAP, why that needs to be small from a computing cost perspective, and so on for what you feel is important to be described for the user when using such methods.

myerspat

@npanczyk, all code looks great. I didn't review any of the SHAP package specific stuff but I'll leave that to you if you have any issues with it. Now we just need to update the documentation. There are four places that need to be updated with the new capabilities:

In the landing page given by docs/source/index.rst please include a blurb somewhere that briefly explains the new explainability features. Copy this over to README.md.
In the installation guide at docs/source/installation.rst please include the new dependencies.
In the user guide under docs/source/user_guide.rst include a section that fits within the order of things that outlines the new capabilities with any examples.
In the pyMAISE API reference at docs/source/pymaise_api.rst include a section for explainability that links the methods/classes you want the user to be aware of. This will link to the methods/classes docstring so make sure those are up to snuff too.

Feel free to go through the docs and add any blurbs on explainabiity where you see fit. Also please address any of the other comments I've left.

setup.py

myerspat · 2024-12-08T18:03:45Z

pyMAISE/explain/_explain.py

+        """
+        This functin fits a DeepLIFT explainer to evaluate SHAP coeffiicents (only for
+        neural networks).
+
+        :param nsamples: (int less than total samples in test set or None, default=None)
+            Number of samples used to estimate the DeepLIFT importances if it is
+            different than using all samples in X


For all functions that the user will be calling directly that are from SHAP should have their docstrings changed to the format pyMAISE uses.

The user shouldn't call any functions directly from SHAP. Let me know if I'm missing something here.

pyMAISE/explain/_explain.py

tests/unit/test_explain.py

docs/source/benchmarks/chf.ipynb

pyMAISE/explain/_explain.py

…into explain_stash

myerspat

Thanks @npanczyk, some last changes here.

docs/source/index.rst

myerspat · 2025-02-20T15:49:25Z

docs/source/pymaise_api.rst

+.. rubric:: Classes
+
+.. autosummary::
+   :toctree: stubs
+   :nosignatures:
+   :template: class.rst
+
+   pyMAISE.ShapExplainers


At least for me pyMAISE.ShapExplainers does not work in generating documentation. You can test this locally by moving to docs/ and running make html assuming you installed pyMAISE using pip install -e ".[dev]". This will build the HTML files that will be generated on readthedocs. You can open them at docs/build/html/. Also if ShapExplainers is the only class/function the user will need from the explain module consider just importing that in pyMAISE/__init__.py:

from pyMAISE.explain import ShapExplainers

The user should then be able to directly import ShapExplainers by from pyMAISE import ShapExplainers (test this to ensure this is true), and the above autosummary should work.

Pretty sure I got this but I'm having issues getting sphinx to generate an autosummary for me. It's no longer throwing errors about the ShapExplainers thing though, I added it to the init.py as you suggested. Let me know if this still breaks for you

environment.yml

myerspat · 2025-02-20T15:52:46Z

docs/source/user_guide.rst

I would add something at the top of this file referring to the explainability features.

I thought that explainability kind of fell under the post-processing category here. If you think I should make it separate, I can add a "Step 6" but then this changes more of the document structure.

myerspat · 2025-02-20T15:53:51Z

docs/source/user_guide.rst

+- :meth:`pyMAISE.ShapExplainers.DeepLIFT`: fits a DeepLIFT explainer to evaluate SHAP coefficients,
+- :meth:`pyMAISE.ShapExplainers.IntGradients`: fits an Integrated Gradient explainer to evaluate SHAP coefficients,
+- :meth:`pyMAISE.ShapExplainers.KernelSHAP`: fits a KernelSHAP explainer to evaluate SHAP coefficients,
+- :meth:`pyMAISE.ShapExplainers.Exact_SHAP`: fits an Exact SHAP explainer to evaluate SHAP coefficients,
+- :meth:`pyMAISE.ShapExplainers.postprocess_results`: generates SHAP mean values for plotting functions,
+- :meth:`pyMAISE.ShapExplainers.plot`: makes a beeswarm plot and a bar plot for each SHAP method or for a particular method, and
+- :meth:`pyMAISE.ShapExplainers.plot_bar_only`: makes a bar plot for each or a particular SHAP method.


Check these links work in the generated documentation.

myerspat · 2025-02-20T16:04:39Z

docs/source/benchmarks/chf.ipynb

The tuning and postprocessing results do not look correct here. I think you ran one iteration/epochs for all models here, which is not what we want to show in the final results. You should run a complete run of this benchmark with your explainability features. Feel free to use the parallelization capabilities to make the models tune faster (30 minutes tuning and 75 total, before explain, on AIMS01). Also, nowhere in the landing pages of pyMAISE do you point users to this benchmark as an example of how to use the new explainability features. I would add something to docs/source/index.rst and README.md. Also in your documentation under Explainability Metrics any specific references to function arguments, functions, and methods should have `` surrounding them to make them code and not standard text. Refer to MIT reactor benchmark for examples.

myerspat · 2025-02-20T16:12:15Z

pyMAISE/explain/_explain.py

+    return ax
+
+
+class ShapExplainers:


Given that you're pointing users to this class, you should add a docstring discussing it and its parameters (refer to Tuner of PostProcessor for examples). You may also consider writing examples for using this (refer to Tuner for an example). Also if you look at the second to last code block on your CHF benchmark you see there are warnings. I know you are having issues getting this to go away, but for Jupyter notebooks, we can use the pyMAISE.utils._try_clear() functions to clear unwanted output, which is what I'm using pretty much everywhere in the tuner. This should clean some things up, as we shouldn't see any warnings for verbosity=0.

pyMAISE/explain/_explain.py

Co-authored-by: Patrick Myers <90068356+myerspat@users.noreply.github.com>

npanczyk added 3 commits October 17, 2024 09:22

add shap, new explain class, and init

4cec533

XMerge branch 'develop' of https://github.com/myerspat/pyMAISE into e…

67c668f

…xplain_gpu

adds explainability features

50e87b5

npanczyk added documentation Improvements or additions to documentation enhancement New feature or request labels Nov 4, 2024

npanczyk self-assigned this Nov 4, 2024

myerspat closed this Nov 5, 2024

myerspat reopened this Nov 5, 2024

adds explainability feature to chf benchmark and explain class update

a1e61f8

npanczyk marked this pull request as ready for review November 5, 2024 23:11

npanczyk requested review from mradaideh and myerspat and removed request for mradaideh November 5, 2024 23:11

npanczyk added 10 commits November 8, 2024 19:23

adds 3 dependencies for SHAP

1c838d6

suppresses verbosity

ffc5b31

chf notes and software ref

7c282ef

cleaned chf benchmark

540c96e

remove tensorflow silencing

df7b795

adds SHAP software acknowledgement

da5e2e8

adds explain tests round 1

476e01d

fixes tests

eadbab7

turns off plot saving

1cff744

removes old comments

94fb818

myerspat requested changes Dec 8, 2024

View reviewed changes

npanczyk added 4 commits February 6, 2025 11:01

adds plot_bar_only() to ShapExplainers

d68d00f

adds blurb for new explainability features in index.rst

f8caa99

adds environment file and fixes explain import

28faee0

removes scratch work

065c0d3

npanczyk and others added 12 commits February 7, 2025 18:45

adds explainability to user guide

b93be20

adds explainability to API ref

53e56ef

adds dependencies to installation

3f5f8c3

adds explain dependencies

0ac6e13

updated main page and moved references to their own pages

8b3fc26

fixing data reference links in notebooks

988069e

add pypi tag to readme

daf8d46

changing jupyter formatting to black

28a90e9

reformatting docstrings for sphinx

580e2c9

merge develop

4cb18f7

update links for org

e9e8438

Merge branch 'explain_stash' of https://github.com/aims-umich/pyMAISE …

b988b7a

…into explain_stash

myerspat requested changes Feb 20, 2025

View reviewed changes

npanczyk and others added 12 commits March 21, 2025 08:00

Update docs/source/index.rst

b5e9b32

Co-authored-by: Patrick Myers <90068356+myerspat@users.noreply.github.com>

Update docs/source/index.rst

e250675

Co-authored-by: Patrick Myers <90068356+myerspat@users.noreply.github.com>

adds punctuation

86c00cf

adds explainability info to readme

53b43db

adds ShapExplainers to init

aa64f5a

moved environment file to scripts

0783284

fixes formatting

642a2e4

adds reference to chf benchmark

30562ec

fixes explain punctuation

c8866f9

resolves merge conflict

4364bc4

trying to fix sphinx

394a21c

fixes html and notebook

eaf1676

Conversation

npanczyk commented Nov 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Description

What changes were made?

Uh oh!

mradaideh commented Nov 8, 2024

Uh oh!

myerspat left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

myerspat left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

npanczyk commented Nov 4, 2024 •

edited

Loading