A Domain-Specific Architecture for Deep Neural Networks

N. Jouppi, et al., Google

Communications of the ACM, September 2018

Google engineers apply new Tensor Processing Unit (TPU) chips to neural network applications, up to 30 times faster and up to 80 times more energy efficient than competing chips.

These improvements have been applied to real-time data center applications of deep neural networks, such as image recognition, language translation, and search. The applications are designed using open-source TensorFlow software.