Probabilistic Sufficient Explanations (bibtex)

by Eric Wang, Pasha Khosravi and Guy Van den Broeck
Abstract:
Understanding the behavior of learned classifiers is an important task, and various black-box explanations, logical reasoning approaches, and model-specific methods have been proposed. In this paper, we introduce probabilistic sufficient explanations, which formulate explaining an instance of classification as choosing the "simplest" subset of features such that only observing those features is "sufficient" to explain the classification. That is, sufficient to give us strong probabilistic guarantees that the model will behave similarly when all features are observed under the data distribution. In addition, we leverage tractable probabilistic reasoning tools such as probabilistic circuits and expected predictions to design a scalable algorithm for finding the desired explanations while keeping the guarantees intact. Our experiments demonstrate the effectiveness of our algorithm in finding sufficient explanations, and showcase its advantages compared to Anchors and logical explanations.
Reference:
Eric Wang, Pasha Khosravi and Guy Van den Broeck. Probabilistic Sufficient Explanations, In Proceedings of the UAI Workshop on Tractable Probabilistic Modeling (TPM), 2021.
Bibtex Entry:
@inproceedings{WangTPM21,
  author    = {Wang, Eric and Khosravi, Pasha and Van den Broeck, Guy},
  title     = {Probabilistic Sufficient Explanations},
  booktitle = {Proceedings of the UAI Workshop on Tractable Probabilistic Modeling (TPM)},
  month     = 7,
  year      = {2021},
  url       = "http://starai.cs.ucla.edu/papers/WangTPM21.pdf",
  keywords  = {workshop,duplicate}
}
PDF Preview:
(PDF preview not available, download PDF instead)
Powered by bibtexbrowser