Opening the Black Box: Explanation and Transparency in Machine Learning

2021 
Machine learning algorithms often predict and classify without offering human-cognizable reasons for their evaluations. When confronted with the opacity of machine learning in science, what is our epistemic situation and what ought we to do to resolve it? In order to answer this question, I first outline a framework for increasing transparency in complex computational systems such as climate simulations and machine learning on big scientific data. I identify three different ways to attain knowledge about these opaque systems and argue that each fulfills a different explanatory purpose. Second, I argue that analogy with the renormalization group helps us choose the better of two philosophically suggestive explanatory strategies that rely on different diagnoses of the success of deep learning. The coarse-graining strategy suggests that highlighting the parts of the input which most contributed to the output will be misleading without two things: an explanation for why the irrelevant parts are themselves irrelevant, and an explanation for the stability of the output under minor perturbations of the input. Armed with a framework for understanding transparency and an analysis of explanatory strategies appropriate for deep learning, I turn to an application of these frameworks to automated science. Automated science is the use of machine learning to automate hypothesis generation, experimental design, performance of experiment, and evaluation of results. If automated science is to find patterns on its own, then it must be able to solve the Molyneux problem for science, namely recognizing identity across modalities or data streams without the aid of causation or correlation.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []