DirectProbe: Studying Representations without Classifiers

Yichu Zhou,Vivek Srikumar

DirectProbe: Studying Representations without Classifiers

2021

Yichu Zhou
Vivek Srikumar

Understanding how linguistic structure is encoded in contextualized embedding could help explain their impressive performance across NLP. Existing approaches for probing them usually call for training classifiers and use the accuracy, mutual information, or complexity as a proxy for the representation’s goodness. In this work, we argue that doing so can be unreliable because different representations may need different classifiers. We develop a heuristic, DirectProbe, that directly studies the geometry of a representation by building upon the notion of a version space for a task. Experiments with several linguistic tasks and contextualized embeddings show that, even without training classifiers, DirectProbe can shine lights on how an embedding space represents labels and also anticipate the classifier performance for the representation.

Keywords:

Computer science
Artificial intelligence
space
Mutual information
Machine learning
Embedding
task
Structure (mathematical logic)
Heuristic
Classifier (linguistics)
Representation (mathematics)

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations