Explanation in artificial intelligence: insights from the social sciences
From MaRDI portal
Publication:2321252
Abstract: There has been a recent resurgence in the area of explainable artificial intelligence as researchers and practitioners seek to make their algorithms more understandable. Much of this research is focused on explicitly explaining decisions or actions to a human observer, and it should not be controversial to say that looking at how humans explain to each other can serve as a useful starting point for explanation in artificial intelligence. However, it is fair to say that most work in explainable artificial intelligence uses only the researchers' intuition of what constitutes a `good' explanation. There exists vast and valuable bodies of research in philosophy, psychology, and cognitive science of how people define, generate, select, evaluate, and present explanations, which argues that people employ certain cognitive biases and social expectations towards the explanation process. This paper argues that the field of explainable artificial intelligence should build on this existing research, and reviews relevant papers from philosophy, cognitive psychology/science, and social psychology, which study these topics. It draws out some important findings, and discusses ways that these can be infused with work on explainable artificial intelligence.
Recommendations
- Explanation in AI and law: past, present and future
- Artificial explanations: The epistemological interpretation of explanation in AI
- Explainable deep learning: a field guide for the uninitiated
- A survey on the explainability of supervised machine learning
- Using state abstractions to compute personalized contrastive explanations for AI agent behavior
Cites work
- scientific article; zbMATH DE number 4174350 (Why is no real title available?)
- scientific article; zbMATH DE number 1467490 (Why is no real title available?)
- scientific article; zbMATH DE number 1869489 (Why is no real title available?)
- scientific article; zbMATH DE number 795590 (Why is no real title available?)
- scientific article; zbMATH DE number 2201583 (Why is no real title available?)
- scientific article; zbMATH DE number 3285169 (Why is no real title available?)
- scientific article; zbMATH DE number 2243367 (Why is no real title available?)
- scientific article; zbMATH DE number 4185074 (Why is no real title available?)
- A Short Introduction to Computational Social Choice
- A theory of diagnosis from first principles
- Abductive Inference
- Causes and Explanations: A Structural-Model Approach. Part I: Causes
- Causes and Explanations: A Structural-Model Approach. Part II: Explanations
- Causes and explanations in the structural-model approach: Tractable cases
- Complexity results for structure-based causality.
- Conditional logic of actions and causation
- How to explain individual classification decisions
- The Oxford handbook of causal reasoning
- The book of why. The new science of cause and effect
Cited in
(only showing first 100 items - show all)- Modeling and generating user-centered contrastive explanations for the workforce scheduling and routing problem
- Common abductive explanations in first order logic
- scientific article; zbMATH DE number 7626729 (Why is no real title available?)
- SpICE: an interpretable method for spatial data
- A machine learning approach to differentiate between COVID-19 and influenza infection using synthetic infection and immune response data
- Formal Methods in FCA and Big Data
- Detecting correlations and triangular arbitrage opportunities in the Forex by means of multifractal detrended cross-correlations analysis
- Causal CSSE: integrating counterfactuals and causality in the explanation of machine learning models
- A logic of ``black box classifier systems
- Is there a role for statistics in artificial intelligence?
- The explanation game: a formal framework for interpretable machine learning
- Explainability requirements as hyperproperties
- A new model for counterfactual analysis for functional data
- Tailoring explanations through conversation
- Story embedding: learning distributed representations of stories based on character networks
- Causal explanations for sequential decision making
- Forks over knives: predictive inconsistency in criminal justice algorithmic risk assessment tools
- A maximum-margin multisphere approach for binary multiple instance learning
- Exploiting Game Theory for Analysing Justifications
- On the (complete) reasons behind decisions
- Certified logic-based explainable AI -- the case of monotonic classifiers
- Feature necessity \& relevancy in ML classifier explanations
- Explanation-friendly query answering under uncertainty
- A Neural Phillips Curve and a Deep Output Gap
- A new class of explanations for classifiers with non-binary features
- Contrastive explanations for answer-set programs
- Declarative reasoning on explanations using constraint logic programming
- Minimality, necessity and sufficiency for argumentation and explanation
- Counterfactual state explanations for reinforcement learning agents via generative deep learning
- Triadic patterns for explainable artificial intelligence
- Mathematical optimization in classification and regression trees
- The effects of explanations on automation bias
- On computing probabilistic abductive explanations
- scientific article; zbMATH DE number 7450036 (Why is no real title available?)
- Counterfactuals as modal conditionals, and their probability
- A tutorial in proof-theoretic approaches to logical argumentation
- Critical observations in model-based diagnosis
- A Comprehensive Framework for Learning Declarative Action Models
- Using analogical proportions for explanations
- Probabilistic causes in Markov chains
- Defining formal explanation in classical logic by substructural derivability
- Mathematical optimization modelling for group counterfactual explanations
- Interpretable generalized additive neural networks
- What makes accidents severe! Explainable analytics framework with parameter optimization
- Explaining answers generated by knowledge graph embeddings
- Learning decision catalogues for situated decision making: the case of scoring systems
- On the failings of Shapley values for explainability
- Questionable stepwise explanations for a robust additive preference model
- Synergies between machine learning and reasoning -- an introduction by the Kay R. Amel group
- A model for intelligible interaction between agents that predict and explain
- Explaining black-box classifiers: properties and functions
- A local method for identifying causal relations under Markov equivalence
- Relation between prognostics predictor evaluation metrics and local interpretability SHAP values
- Synthesizing explainable counterfactual policies for algorithmic recourse with program synthesis
- ASP and subset minimality: enumeration, cautious reasoning and MUSes
- Learning Optimal Decision Sets and Lists with SAT
- Argumentative explanations for interactive recommendations
- Paracoherent answer set computation
- Why bad coffee? Explaining BDI agent behaviour with valuings
- Modifications of the Miller definition of contrastive (counterfactual) explanations
- The spherical \(k\)-means++ algorithm via local search
- A fast iterative PDE-based algorithm for feedback controls of nonsmooth mean-field control problems
- Editable machine learning models? A rule-based framework for user studies of explainability
- The role of political risk, uncertainty, and crude oil in predicting stock markets: evidence from the UAE economy
- Argumentative review aggregation and dialogical explanations
- \textit{Explain it as simple as possible, but no simpler} -- explanation via model simplification for addressing inferential gap
- Explaining commonalities of clusters of RDF resources in natural language
- A framework for inherently interpretable optimization models
- Comments on ``Data science, big data and statistics
- On Tackling Explanation Redundancy in Decision Trees
- Interval abstractions for robust counterfactual explanations
- An abstract and structured account of dialectical argument strength
- ASQ-IT: interactive explanations for reinforcement-learning agents
- Planning with mental models -- balancing explanations and explicability
- Efficient search for relevance explanations using MAP-independence in Bayesian networks
- Providing personalized explanations: a conversational approach
- Objective-based counterfactual explanations for linear discrete optimization
- Proof theory and decision procedures for deontic STIT logics
- Witnesses for Answer Sets of Logic Programs
- Explainable acceptance in probabilistic and incomplete abstract argumentation frameworks
- Non-monotonic explanation functions
- Necessary and sufficient explanations for argumentation-based conclusions
- Persuasive contrastive explanations for Bayesian networks
- Tractability of explaining classifier decisions
- On the local coordination of fuzzy valuations
- L. A. Zadeh, the visionary in explainable artificial intelligence
- Predictable artificial intelligence
- Some models are useful, but how do we know which ones? Towards a unified Bayesian model taxonomy
- A k-additive Choquet integral-based approach to approximate the SHAP values for local interpretability in machine learning
- Centroid cross-efficiency approach for clustering
- Feature necessity and relevancy in machine learning explanations
- A general framework for personalising post hoc explanations through user knowledge integration
- Towards a trade-off of interpretability, accuracy and scalability: enhanced formulations in linear classification models
- Explanation in AI and law: past, present and future
- The Bateson game: a model of strategic ambiguity, frame uncertainty, and pathological learning
- Epistemic injustice as a philosophical conception for considering fairness and diversity in human-centered AI principles
- Model Uncertainty and Correctability for Directed Graphical Models
- Relative sparsity for medical decision problems
- Generating contrastive explanations for inductive logic programming based on a near miss approach
- scientific article; zbMATH DE number 7626724 (Why is no real title available?)
This page was built for publication: Explanation in artificial intelligence: insights from the social sciences
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2321252)