Optically connected and reconfigurable GPU architecture for optimized peer-to-peer access

Erik Anderson,Jorge González,Alexander Gazman,Rodolfo Azevedo,Keren Bergman

Optically connected and reconfigurable GPU architecture for optimized peer-to-peer access

2018

Erik Anderson
Jorge González
Alexander Gazman
Rodolfo Azevedo
Keren Bergman

Increasing industry interest in the optimization of inter-GPU communication has motivated this work to explore new ways to enable peer-to-peer access. Specifically, this paper investigates how reconfigurable optical links between GPUs in multi-GPU servers can allow for minimized memory transfer latencies for given machine learning applications. Silicon photonics (SiP) is proposed as the enabling technology for such a reconfigurable architecture due to the potential for scalable and cost-efficient production. We evaluated our architecture using traffic obtained from an NVLink-connected 8 GPU server executing a set of machine learning models including AlexNet, DenseNet, NASNet, ResNet, MobileNet, and VGG16. Our results show up to 24.91% reduction of the total relative transmission latency (RTL) between peers.

Keywords:

Computer architecture
Silicon photonics
Peer-to-peer
ResNet
Latency (engineering)
Scalability
Architecture
Computer science
Random forest
Server
residual neural network
transmission latency

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations