Less is More: Filtering Abnormal Dimensions in GloVe

Yang-Yin Lee,Hao Ke,Hen-Hsen Huang,Hsin-Hsi Chen

Less is More: Filtering Abnormal Dimensions in GloVe

2016

Yang-Yin Lee
Hao Ke
Hen-Hsen Huang
Hsin-Hsi Chen

GloVe, global vectors for word representation, performs well in some word analogy and semantic relatedness tasks. However, we find that some dimensions of the trained word embedding are abnormal. We verify our conjecture via removing these abnormal dimensions using Kolmogorov-Smimov test and experiment on several benchmark datasets for semantic relatedness measurement. The experimental results confirm our finding. Interestingly, some of the tasks outperform the state-of-the-art model SensEmbed by simply removing these abnormal dimensions. The novel rule of thumb technique which leads to better performance is expected to be useful in practice.

Keywords:

Computer science
Word embedding
Semantic similarity
Analogy
Filter (signal processing)
Conjecture
Rule of thumb
Machine learning
Artificial intelligence
word representation

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations