Optimized Transformer implementation in GPU
Krishna Panthi
School of Computing, Clemson University
kpanthi@clemson.eduIn this research, we attempt to reproduce the work done in the paper "E.T.: Re-Thinking Self-Attention for Transformer Models on GPUs".
Full report is presented in the paper.