From Explanations to Interpretability and Back

Räz, Tim (2024). From Explanations to Interpretability and Back (Submitted). In: Philosophy of Science for Machine Learning: Core Issues, New Perspectives. Synthese Library. Springer

Text
raez_ml_expl_philsci.pdf - Submitted Version
Restricted to registered users only
Available under License Publisher holds Copyright.
Download (329kB)

This chapter discusses connections between interpretability of machine learning and (scientific and mathematical) explanations, provides novel perspectives on interpretability, and highlights under-explored issues. Interpretability types are proposed: kinds of interpretability should be distinguished using both the parts of ML we want to explain and the parts of ML we use to explain. It is argued that not all explanations are contrastive, and that we should also consider contrasts with respect to models and data, not only with respect to inputs. Theoretical explanations are highlighted; they include issues like generalization, optimization, and expressivity. It is proposed that there are two threats to the objectivity of explanations: One from radical subject-dependence, the other from a lack of factivity. Finally, pluralism is advocated: There are different notions of interpretability and different notions of (scientific and mathematical) explanations. However, the heterogeneity of one area does not transfer to the other in a straightforward manner.

Item Type:	Book Section (Book Chapter)
Division/Institute:	06 Faculty of Humanities > Department of Art and Cultural Studies > Institute of Philosophy
UniBE Contributor:	Räz, Tim
Subjects:	100 Philosophy 600 Technology > 620 Engineering
Series:	Synthese Library
Publisher:	Springer
Language:	English
Submitter:	Tim Räz
Date Deposited:	08 Mar 2024 15:05
Last Modified:	08 Mar 2024 15:05
Related URLs:	https://philsci-archive.pitt.edu/23169/
BORIS DOI:	10.48350/193896
URI:	https://boris.unibe.ch/id/eprint/193896

Actions (login required)

Edit item

From Explanations to Interpretability and Back

Interest & Impact

Downloads

Citations

Search

Services

Actions (login required)

Item Type:

Division/Institute:

UniBE Contributor:

Subjects:

Series:

Publisher:

Language:

Submitter:

Date Deposited:

Last Modified:

Related URLs:

BORIS DOI:

URI:

Actions (login required)