.. |
__init__.py
|
69d1b9f978
DeepSpeed-Triton for Inference (#3748)
|
1 年之前 |
attention.py
|
0e0748c579
adds triton flash attention2 kernel (#4337)
|
1 年之前 |
gelu.py
|
e8ed7419ed
update deepspeed to run with the most recent triton 2.1.0 (#4278)
|
1 年之前 |
layer_norm.py
|
69d1b9f978
DeepSpeed-Triton for Inference (#3748)
|
1 年之前 |
matmul_ext.py
|
e8ed7419ed
update deepspeed to run with the most recent triton 2.1.0 (#4278)
|
1 年之前 |
mlp.py
|
69d1b9f978
DeepSpeed-Triton for Inference (#3748)
|
1 年之前 |
ops.py
|
69d1b9f978
DeepSpeed-Triton for Inference (#3748)
|
1 年之前 |
residual_add.py
|
807d1b5dfc
scripts/check-torchcuda.py: add checking for tensor.is_cuda (#3843)
|
1 年之前 |
softmax.py
|
69d1b9f978
DeepSpeed-Triton for Inference (#3748)
|
1 年之前 |
triton_matmul_kernel.py
|
3f3e9fb11e
Fix autotune to support Triton 2.1 (#4340)
|
1 年之前 |