Räz, Tim (2024). From Explanations to Interpretability and Back (Submitted). In: Philosophy of Science for Machine Learning: Core Issues, New Perspectives. Synthese Library. Springer
Text
raez_ml_expl_philsci.pdf - Submitted Version Restricted to registered users only Available under License Publisher holds Copyright. Download (329kB) |
This chapter discusses connections between interpretability of machine learning and (scientific and mathematical) explanations, provides novel perspectives on interpretability, and highlights under-explored issues. Interpretability types are proposed: kinds of interpretability should be distinguished using both the parts of ML we want to explain and the parts of ML we use to explain. It is argued that not all explanations are contrastive, and that we should also consider contrasts with respect to models and data, not only with respect to inputs. Theoretical explanations are highlighted; they include issues like generalization, optimization, and expressivity. It is proposed that there are two threats to the objectivity of explanations: One from radical subject-dependence, the other from a lack of factivity. Finally, pluralism is advocated: There are different notions of interpretability and different notions of (scientific and mathematical) explanations. However, the heterogeneity of one area does not transfer to the other in a straightforward manner.
Item Type: |
Book Section (Book Chapter) |
---|---|
Division/Institute: |
06 Faculty of Humanities > Department of Art and Cultural Studies > Institute of Philosophy |
UniBE Contributor: |
Räz, Tim |
Subjects: |
100 Philosophy 600 Technology > 620 Engineering |
Series: |
Synthese Library |
Publisher: |
Springer |
Language: |
English |
Submitter: |
Tim Räz |
Date Deposited: |
08 Mar 2024 15:05 |
Last Modified: |
08 Mar 2024 15:05 |
Related URLs: |
|
BORIS DOI: |
10.48350/193896 |
URI: |
https://boris.unibe.ch/id/eprint/193896 |