Cleora: A Simple, Strong and Scalable Graph Embedding Scheme

2021 
The area of graph embeddings is currently dominated by contrastive learning methods, which demand formulation of an explicit objective function and sampling of positive and negative examples. One of the leading class of models are graph convolutional networks (GCNs), which suffer from numerous performance issues. In this paper we present Cleora: a purely unsupervised and highly scalable graph embedding scheme. Cleora can be likened to a GCN stripped down to its most effective core operation - the repeated neighborhood aggregation. Cleora does not require the application of a GPU and can embed massive graphs on CPU only, beating other state-of-the-art CPU algorithms in terms of speed and quality as measured on downstream tasks. Cleora has been applied in top machine learning competitions involving recommendations and graph processing, taking the podium in KDD Cup 2021, WSDM Challenge 2021, and SIGIR eCom Challenge 2020. We open-source Cleora under the MIT license allowing commercial use under https://github.com/Synerise/cleora.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []