On Linear Learning with Manycore Processors

Eliza Wszola,Celestine Mendler-Dünner,Martin Jäggi,Markus Püschel

On Linear Learning with Manycore Processors

2019

Eliza Wszola
Celestine Mendler-Dünner
Martin Jäggi
Markus Püschel

A new generation of manycore processors is on the rise that offers dozens and more cores on a chip and, in a sense, fuses host processor and accelerator. In this paper we target the efficient training of generalized linear models on these machines. We propose a novel approach for achieving parallelism which we call Heterogeneous Tasks on Homogeneous Cores (HTHC). It divides the problem into multiple fundamentally different tasks, which themselves are parallelized. For evaluation, we design a detailed, architecture-cognizant implementation of our scheme on a recent 72-core Knights Landing processor that is adaptive to the cache, memory, and core structure. Experiments for Lasso and SVM with different data sets show a speedup of typically an order of magnitude compared to straightforward parallel implementations in C++.

Keywords:

Fuse (electrical)
Parallel computing
Speedup
Support vector machine
Chip
Computer science
Lasso (statistics)
Implementation
Cache
Data set

Correction
Cite
Save
Machine Reading By IdeaReader

References

Citations