提交历史

作者 SHA1 备注 提交日期
  Ramya Ramineni 7bcb4fabeb Enable CG headers on ROCm (#1821) 2 年之前
  Jeff Rasley c3c8d5dd93 AMD support (#1430) 2 年之前
  Reza Yazdani 8e891aa568 Transformer kernel/fix layer norm (#1587) 2 年之前
  Jeff Rasley a10e4811fe force set lf instead of crlf (https://github.com/pre-commit/pre-commit-hooks#mixed-line-ending) (#1598) 2 年之前
  Ivan Komarov bfe7f0db2a Fix cudaErrorInvalidConfiguration in attn_softmax() for large seq_length*heads values (#1239) 3 年之前
  Reza Yazdani fd2f970bdf Transformer-kernel - supporting any arbitrary sequence-length (#587) 3 年之前
  RezaYazdaniAminabadi f0f2a70268 support dynamic sequence length in transformer kernels (#424) 4 年之前
  Jeff Rasley 734d8991c8 Transformer kernel release (#242) 4 年之前