Posts by John Tran
Data Science
Jun 10, 2019
Tips for Optimizing GPU Performance Using Tensor Cores
Our most popular question is "What can I do to get great GPU performance for deep learning?"?We’ve recently published a detailed Deep Learning Performance...
13 MIN READ
Simulation / Modeling / Design
Dec 05, 2017
CUTLASS: Fast Linear Algebra in CUDA C++
Update May 21, 2018: CUTLASS 1.0 is now available as Open Source software at the CUTLASS repository. CUTLASS 1.0 has changed substantially from our preview...
25 MIN READ