Optimized Transformer implementation in GPU

Krishna Panthi

School of Computing, Clemson University
kpanthi@clemson.edu
In this research, we attempt to reproduce the work done in the paper "E.T.: Re-Thinking Self-Attention for Transformer Models on GPUs". Full report is presented in the paper.