Capturing Pragmatic Knowledge in Article Usage Prediction using LSTMs

Jad Kabbara,Yulan Feng,Jackie Chi Kit Cheung

Capturing Pragmatic Knowledge in Article Usage Prediction using LSTMs

2016

Jad Kabbara
Yulan Feng
Jackie Chi Kit Cheung

We examine the potential of recurrent neural networks for handling pragmatic inferences involving complex contextual cues for the task of article usage prediction. We train and compare several variants of Long Short-Term Memory (LSTM) networks with an attention mechanism. Our model outperforms a previous state-of-the-art system, achieving up to 96.63 accuracy on the WSJ/PTB corpus. In addition, we perform a series of analyses to understand the impact of various model choices. We find that the gain in performance can be attributed to the ability of LSTMs to pick up on contextual cues, both local and further away in distance, and that the model is able to solve cases involving reasoning about coreference and synonymy. We also show how the attention mechanism contributes to the interpretability of the models effectiveness.

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations