Ramya Ramineni
|
7bcb4fabeb
Enable CG headers on ROCm (#1821)
|
2 年之前 |
Jeff Rasley
|
c3c8d5dd93
AMD support (#1430)
|
2 年之前 |
Reza Yazdani
|
8e891aa568
Transformer kernel/fix layer norm (#1587)
|
2 年之前 |
Jeff Rasley
|
a10e4811fe
force set lf instead of crlf (https://github.com/pre-commit/pre-commit-hooks#mixed-line-ending) (#1598)
|
2 年之前 |
Ivan Komarov
|
bfe7f0db2a
Fix cudaErrorInvalidConfiguration in attn_softmax() for large seq_length*heads values (#1239)
|
3 年之前 |
Reza Yazdani
|
fd2f970bdf
Transformer-kernel - supporting any arbitrary sequence-length (#587)
|
3 年之前 |
RezaYazdaniAminabadi
|
f0f2a70268
support dynamic sequence length in transformer kernels (#424)
|
4 年之前 |
Jeff Rasley
|
734d8991c8
Transformer kernel release (#242)
|
4 年之前 |