- @mayer79 fixed the use of the deprecated
sizeargument inggplot2version 4, which is now calledlinewidth(#576)
- fixed NOTE during
devtools::document()such asS3 method plot.... needs @export or @exportS3method tag.
- adding the support for calculating kernel SHAP values via
predict_parts()function
- breaking change: change the name of
loss_yardstick()toget_loss_yardstick()andloss_default()toget_loss_default() - add
loss_one_minus_accuracy()andget_loss_one_minus_accuracy()(#535)
- added implementation of aSHAP (aggregated SHAP) and waterfall plot (#519)
- adding a new system for default color schemes (#541)
- added
cross_entropyas model performance measure to multilabel settings #542
- removed the
yardstickdependency - new vignette added 'How to use DALEX with the yardstick package?'
- new datasets from World Happiness Report:
happiness_testandhappiness_train(#513) - new datasets from COVID morality:
covid_summerandcovid_spring(#513)
- changed URLs in the DESCRIPTION as requested in (#484)
- Fix model_info documentation (#498)
- Support for yardstic metrics (#495)
- Changed default in
explain(colorize=)according to (#473) - Added explain/yhat support for
partykit(#438) explain()warns if target has more than two values for classification (#418)
- The
plot.model_performance_roc,loss_one_minus_aucandmodel_performance_aucfunctions are rewritten to handle repeated predictions (#442)
- The
plotfunction works for list of explanations (if possible) (#424)
- Order of explainer labels in different plots is the same. To get to this point, orders in
plot.model_performance(..., geom = "histogram" & "boxplot")are reversed (#400) - Fixed multiclass explainer when data has one column (#405)
- Now explainer handles R functions (#396)
predict_partsfunction handles theNargument natively (#394)
- All encouters of
nieghbour(s)(EN-spelling) were replaced withneighbor(s)(US-spelling) for the consistency and backword compatibility. - Fixed bug when
predict_diagnosticsraised error ifneighborvalue was higer thannrow(explainer$data).
- Added new parameter (
predict_function_target_column) toexplainfunction that allows specifying positive class in binary classification tasks (#250). - Fixed
model_diagnostics()returning an error whendataismatrix(#355)
- Fixed R package not working with Python Explainer (#318)
- Fixed
model_diagnostics()returning an error wheny_hatorresidualsis ofarrayclass (#319) - Fixed grid lines in
theme_drwhyon Windows - Fixed logical values in y rising unnecessery warnings for classification task (#336)
plot.predict_diagnosticsnow passess ellipsis toplot.ceteris_paribus_explainer- This version requires
iBreakDown v1.3.1andingredients v1.3.1 - Fixed
plot.predict_partsandplot.model_profile(#277). - Fixed
plot.model_profilefor multiple profiles (#237). - External tests for not suggested packages added to gh-actions (#237).
- Extended and refreshed documentation (#237).
- All dontrun statements changed to donttest according to CRAN policy.
- Added value for
sparameter inyhat.glmnetandyhat.cv.glmnet. - Fixed
model_diagnosticspassing wrong arguments to residual_function. - Fixed aesthetic for
histgeometry inplot.model_performanceusing wrong arugments. model_performancewill not work ifmodel_info$typeisNULL.- Corrected description of
Ninmodel_parts(#287). - New warning messages for
yparameter inexplainfunction. - Solved bug in
yhat.rangercausingpredicts_partsnot to plot correctly when task is multiclass. variable_effectis now deprecated
- fixed typo in
predict_parts_oscillations_emp - rewrite tests
- added
predict_partsclass to objects andplot.predict_partsfunction - added
model_partsclass to objects andplot.model_partsfunction - plot parameters added to the documentation
- Now in the
predict_profilefunction one can specify how grid points shall be calculated, seevariable_splits_type(#267). - The
predict_partfunction has two new options for type:oscillations_uniandoscillations_emp(#267). - The
plot.model_performancefunction has a newgeom="prc"for Precision Recall curve (#273).
DALEXnow fully supports multiclass classification.explain()will use new residual function (1 - true class probability) if multiclass classification is detected.model_performance()now support measures for multiclass classification.- Remove
ggpubrfrom suggests. lossFunctionargument is now deprecated inplot.model_performance(). Use theloss_functionargument.model_profilecolor changed tocolors_discrete_drwhy(1)which impacts the color of the line inplot.model_profileloss_nameattribute added to loss functions. It will be passed to plot function for objects created withmodel_parts.
- fixed tests and WARNINGs on CRAN
model_profilefor Accumulated Local rofiles by default use centering (center = TRUE)- deprecate
n_sampleargument inmodel_parts(now it'sN) (#175)
ingredientsandiBreakDownare now imported by DALEX
- updated title for
plot.model_performance(#160). - in
explainremoved check related to duplicated target variable (#164).
variable_profilecallsingredients::ceteris_paribus(#131).variable_responseandfeature_responsemoved tovariable_effectand now it callsingredients::partial_dependency(#131).prediction_breakdownmoved tovariable_attributionand now it callsiBreakDown::break_down(#131).- updated
variable_importance, not it calls theingredients::variable_importance(#131). - updated
model_performance(#130). - added
yhatforlrmmodels fromrmspackage theme_drwhyhas now left aligned title and subtitle.residuals_distributioncalculates now diagnostic plots based on residuals (#143).model_performancecalculates several metrics for classification and regression models (#146).plot.model_performancenow supports ROC charts, LIFT charts, Cummulative Gain charts, histograms, boxplots and ecdfresiduals_distributonis nowindividual_diagnosticsand produces objects of the classindividual_diagnostics_explainersplot.individual_diagnostics_explainersnow plots objects of the classindividual_diagnostics_explainersyhatfor caret models now returns matrix instead of data.framemodel_diagnosticsnew function that plots residuals againes selected variable- names of functions are changed to be compliant with latest version of the XAI pyramide
- updated
titanic_imputed(#113). - added
weightsto the explainer. Note that not all explanations know how to handle weights (#118). yhat()andmodel_info()now support models created withgbmpackage.
- new argument
colorizein theexplain()as requested in (#112). - new generic function
model_info(). It will extract basic irnformation like model package nam version and task type. (#109, #110) - new functions
update_data()andupdate_label(). (#114))
- new dataset
titanic_imputedas requested in (#104). - the
explain()function now detects if target variableyis present in thedataas requested in (#103). - the DALEX GitHub repository is transfered from
pbiecek/DALEXto ModelOriented/DALEX.
- Examples updated. Now they use only datasets available from DALEX.
- yhat.H2ORegressionModel and yhat.H2OBinomialModel moved to (DALEXtra) and merged into explain_h2o() function.
- yhat.WrappedModelmoved to (DALEXtra) and merged as explain_mlr() function.
- Wrapper for scikit-learn models restored in (DALEXtra) package.
- loss_one_minus_auc function added to loss_functions.R. It uses 1-auc to compute loss. Function created by Alicja Gosiewska.
- Extension for DALEX avaiable at (DALEXtra)
- the
explain()function is more verbose. Withverbose = TRUE(default) it prints detailed information about elements of an explainer (#95).
- new color schemes:
colors_breakdown_drwhy(),colors_discrete_drwhy()andcolors_diverging_drwhy(). - in this version the
scikitlearn_model()is removed as it is not working with python 2.7
- New support for scikit-learn models via
scikitlearn_model()
- New
yhatfunctions formlr,h2oandcaretpackages (added by Szymon).
plot.variable_importance_explainer()has nowdesc_sortingargument. If FALSE then variable importance will be sorted in an increasing order (#41).
ingredientsandiBreakDownare added to additional features (#72).feature_response()andvariable_response()are marked as Deprecated. It is suggested to useingredients::partial_dependency(),ingredients::accumulated_dependency()instead (#74).variable_importance()is marked as Deprecated. It is suggested to useingredients::feature_importance()instead (#75).prediction_breakdown()is marked as Deprecated. It is suggested to useiBreakDown::break_down()oriBreakDown::shap()instead (#76).
- updated filenames
pdp,factorMergerandALEPlotare going toSuggested. (#60). In next releases they will be deprecated.- added
predictfunction that calls thepredict_functionhidden in theexplainerobject. (#58).
- the
titanicdataset is copied fromstablelearnerpackage. Some features are transformed (someNAreplaced with0, more numeric features).
DALEXis being prepared for tighter integration withiBreakDownandingredients.- temporally there is a duplicated
single_variableandsingle_feature - Added new
theme_drwhy(). - New arguments in the
plot.variable_importance_explainer(). Namelybar_widthwith widths of bars andshow_baselineif baseline shall be included in these plots. - New skin in the
plot.variable_response_explainer(). - New skin in the
plot.prediction_breakdown_explainer().
- Test datasets are now named
apartments_testandHR_test - For binary classification we return just a second column. NOTE: this may cause some unexpected problems with code dependend on defaults for DALEX 0.2.6.
- New versions of
yhatforrangerandsvmmodels.
- Residual distribution plots for model performance are now more legible when multiple models are plotted. The styling of plot and axis titles have also been improved (@kevinykuo).
- The defaults of
single_prediction()are now consistent withbreakDown::broken(). Specifically,baselineis now0by default instead of"Intercept". The user can also specify thebaselineand other arguments by passing them tosingle_prediction(@kevinykuo, #39). WARNING: Change in the default value ofbaseline. - New
yhat.*functions help to handle additional parameters to differentpredict()functions. - Updated
CITATIONinfo
- New dataset
HRandHRTest. Target variable is a factor with three levels. Is used in examples for classification. - The
plot.model_performance()has nowshow_outliersparameter. Set it to anything >0 and observations with largest residuals will be presented in the plot. (#34)
- Small fixes in
variable_response()to better support ofgbmmodels (c8393120ffb05e2f3c70b0143c4e92dc91f6c823). - Better title for
plot_model_performance()(e5e61d0398459b78ea38ccc980c4040fd853f449). - Tested with
breakDownv 0.1.6.
- The
single_variable() / variable_response()function usespredict_functionfromexplainer(#17)
- The
explain()function convertstibblestodata.framewhen specified asdataargument (#15) - The default generic
explain.default()should help whenexplain()fromdplyris loaded afterDALEX(#16)
- New names for some functions:
model_performance(),variable_importance(),variable_response(),outlier_detection(),prediction_breakdown(). Old names are now deprecated but still working. (#12) - A new dataset
apartments- will be used in examples variable_importance()allows work on full dataset ifn_sampleis negativeplot_model_performance()uses ecdf or boxplots (depending ongeomparameter).
- Function
single_variable()supports factor variables as well (with the use offactorMergerpackage). Remember to usetype='factor'when playing with factors. (#10) - Change in the function
explain(). Old version has an argumentpredict.function, now it'spredict_function. New name is more consistent with other arguments. (#7) - New vigniette for
xgboostmodel (#11)
- Support for global model structure explainers with
variable_dropout()function
- DALEX package is now public
explain()function implementedsingle_prediction()function implementedsingle_variable()function implemented