Discriminating Homonymy from Polysemy in Wordnets: English, Spanish and Polish Nouns

Arkadiusz Janz,Marek Maziarz

Discriminating Homonymy from Polysemy in Wordnets: English, Spanish and Polish Nouns

2021

Arkadiusz Janz
Marek Maziarz

We propose a novel method of homonymy-polysemy discrimination for three Indo-European Languages (English, Spanish and Polish). Support vector machines and LASSO logistic regression were successfully used in this task, outperforming baselines. The feature set utilised lemma properties, gloss similarities, graph distances and polysemy patterns. The proposed ML models performed equally well for English and the other two languages (constituting testing data sets). The algorithms not only ruled out most cases of homonymy but also were efficacious in distinguishing between closer and indirect semantic relatedness.

Keywords:

feature set
Natural language processing
Lasso (statistics)
Graph (abstract data type)
Support vector machine
Lemma (mathematics)
Semantic similarity
Polysemy
Artificial intelligence
Computer science
Noun

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations