One weird trick for parallelizing convolutional neural networks

Alex Krizhevsky

   Papers with code   Abstract  PDF

I present a new way to parallelize the training of convolutional neural networks across multiple GPUs. The method scales significantly better than all alternatives when applied to modern convolutional neural networks...

Benchmarked Models

RANK
MODEL
REPO
CODE RESULT
PAPER RESULT
ε-REPRODUCED
BUILD
2
AlexNet-b
58.4%
--
3
AlexNet
(single)
56.1%
57.1%