allow to pass multiple documents to explain_prediction #225

kmike · 2017-06-23T01:19:25Z

Currently we allow to pass a single document to explain_prediction. I think we should allow to pass multiple documents as well.

There are two ways to handle multiple documents; both of them look useful.

just show explanations for all instances - something similar to show_prediction() doesn't work inside a loop #205.
aggregate resulting explanations in some way. For example, https://github.com/andosa/treeinterpreter has this feature - you can check which features are important for predicting labels for a part of dataset. I'm not sure we should show all individual documents in this case; probably there should be an option to turn it off (or maybe there is already an option - can we turn it off using show argument?).

It may also help with #213 (comment).

The text was updated successfully, but these errors were encountered:

lopuhin · 2017-08-11T13:46:20Z

I wrote a PoC for (2) outside of eli5, targeting only XGBoost. It worked much faster than calling explain_prediction multiple times and then aggregating the results. The only caveat is that I needed to do special handling of missing values: xgboost might use some positive feature and if it's missing, will make the score lower. The weight for this missing positive feature would be negative in this case - so I excluded them from aggregation and the feature weights started to make sense. But it might be that this handling is not always desired. This approach also requires custom code in each classifier we want to support, although it's not a huge change and one document can be seen as a special case of multiple documents here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

allow to pass multiple documents to explain_prediction #225

allow to pass multiple documents to explain_prediction #225

kmike commented Jun 23, 2017

lopuhin commented Aug 11, 2017

allow to pass multiple documents to explain_prediction #225

allow to pass multiple documents to explain_prediction #225

Comments

kmike commented Jun 23, 2017

lopuhin commented Aug 11, 2017