SLIDE 11 Performance is dependent on:
- 1. Hardware Architectures
– GPUs – Multi-/Many-core CPUs
– cuDNN (for GPUs) – MKL-DNN/MKL 2017 (for CPUs)
- 3. Hardware/Software co-design
– Software libraries optimized for
- ne platform will not help the
- ther!
Understanding the Impact of Execution Environments
DLApplications(Image R ecognition, S peech P rocessing, etc.) DLFrameworks(Caffe, T ensorFlow, etc.) BLASLibraries Hardware Many-core GPU (P ascal P100) Generic ConvolutionLayer MKL Optimized ConvolutionLayer MKL 2017 cuDNN/ cuBLAS Multi-/ Many-core (Xeon, XeonP hi) cuDNN Optimized ConvolutionLayer Other BLASLibraries OpenBLAS
11
A TLAS Other Processors