.. |
__init__.py
|
f6fce50deb
Isolate src code and testing for DeepSpeed-FastGen (#4610)
|
11 月之前 |
test_blas_linear_module.py
|
f6fce50deb
Isolate src code and testing for DeepSpeed-FastGen (#4610)
|
11 月之前 |
test_blocked_attn.py
|
c00388a2ef
Mixtral FastGen Support (#4828)
|
10 月之前 |
test_cuda_pre_ln_module.py
|
f6fce50deb
Isolate src code and testing for DeepSpeed-FastGen (#4610)
|
11 月之前 |
test_custom_module.py
|
f6fce50deb
Isolate src code and testing for DeepSpeed-FastGen (#4610)
|
11 月之前 |
test_cutlass_moe.py
|
c00388a2ef
Mixtral FastGen Support (#4828)
|
10 月之前 |
test_post_ln_module.py
|
f6fce50deb
Isolate src code and testing for DeepSpeed-FastGen (#4610)
|
11 月之前 |
test_pre_rms_module.py
|
f6fce50deb
Isolate src code and testing for DeepSpeed-FastGen (#4610)
|
11 月之前 |
test_quantized_linear_module.py
|
e3d873a00e
Fix the FP6 kernels compilation problem on non-Ampere GPUs. (#5333)
|
6 月之前 |