Commit History

Author SHA1 Message Date
  Ramya Ramineni 7bcb4fabeb Enable CG headers on ROCm (#1821) 2 years ago
  Jeff Rasley c3c8d5dd93 AMD support (#1430) 2 years ago
  Reza Yazdani 8e891aa568 Transformer kernel/fix layer norm (#1587) 2 years ago
  Jeff Rasley a10e4811fe force set lf instead of crlf (https://github.com/pre-commit/pre-commit-hooks#mixed-line-ending) (#1598) 2 years ago
  Ivan Komarov bfe7f0db2a Fix cudaErrorInvalidConfiguration in attn_softmax() for large seq_length*heads values (#1239) 3 years ago
  Reza Yazdani fd2f970bdf Transformer-kernel - supporting any arbitrary sequence-length (#587) 3 years ago
  RezaYazdaniAminabadi f0f2a70268 support dynamic sequence length in transformer kernels (#424) 4 years ago
  Jeff Rasley 734d8991c8 Transformer kernel release (#242) 4 years ago