ZHENG, Zhen e3d873a00e Fix the FP6 kernels compilation problem on non-Ampere GPUs. (#5333) 6 months ago
..
configs 834272531a Add support of Microsoft Phi-2 model to DeepSpeed-FastGen (#4812) 9 months ago
implementations ccfdb84e2a FP6 quantization end-to-end. (#5234) 7 months ago
interfaces 5411030529 Inference Checkpoints in V2 (#4664) 11 months ago
__init__.py 38b41dffa1 DeepSpeed-FastGen (#4604) 11 months ago
ds_module.py 38b41dffa1 DeepSpeed-FastGen (#4604) 11 months ago
heuristics.py e3d873a00e Fix the FP6 kernels compilation problem on non-Ampere GPUs. (#5333) 6 months ago
module_registry.py 38b41dffa1 DeepSpeed-FastGen (#4604) 11 months ago