ZHENG, Zhen e3d873a00e Fix the FP6 kernels compilation problem on non-Ampere GPUs. (#5333) 6 月之前
..
__init__.py f6fce50deb Isolate src code and testing for DeepSpeed-FastGen (#4610) 11 月之前
test_blas_linear_module.py f6fce50deb Isolate src code and testing for DeepSpeed-FastGen (#4610) 11 月之前
test_blocked_attn.py c00388a2ef Mixtral FastGen Support (#4828) 10 月之前
test_cuda_pre_ln_module.py f6fce50deb Isolate src code and testing for DeepSpeed-FastGen (#4610) 11 月之前
test_custom_module.py f6fce50deb Isolate src code and testing for DeepSpeed-FastGen (#4610) 11 月之前
test_cutlass_moe.py c00388a2ef Mixtral FastGen Support (#4828) 10 月之前
test_post_ln_module.py f6fce50deb Isolate src code and testing for DeepSpeed-FastGen (#4610) 11 月之前
test_pre_rms_module.py f6fce50deb Isolate src code and testing for DeepSpeed-FastGen (#4610) 11 月之前
test_quantized_linear_module.py e3d873a00e Fix the FP6 kernels compilation problem on non-Ampere GPUs. (#5333) 6 月之前