Interpreting Hierarchical Linguistic Interactions in DNNs

Die Zhang,Huilin Zhou,Xiaoyi Bao,Da Huo,Ruizhao Chen,Xu Cheng,Hao Zhang,Mengyue Wu,Quanshi Zhang

Interpreting Hierarchical Linguistic Interactions in DNNs

2020

Die Zhang
Huilin Zhou
Xiaoyi Bao
Da Huo
Ruizhao Chen
Xu Cheng
Hao Zhang
Mengyue Wu
Quanshi Zhang

This paper proposes a method to disentangle and quantify interactions among words that are encoded inside a DNN for natural language processing. We construct a tree to encode salient interactions extracted by the DNN. Six metrics are proposed to analyze properties of interactions between constituents in a sentence. The interaction is defined based on Shapley values of words, which are considered as an unbiased estimation of word contributions to the network prediction. Our method is used to quantify word interactions encoded inside the BERT, ELMo, LSTM, CNN, and Transformer networks. Experimental results have provided a new perspective to understand these DNNs, and have demonstrated the effectiveness of our method.

Keywords:

Salient
Unbiased Estimation
ENCODE
Natural language processing
Sentence
Artificial intelligence
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations