.. |
configs
|
834272531a
Add support of Microsoft Phi-2 model to DeepSpeed-FastGen (#4812)
|
9 months ago |
implementations
|
ccfdb84e2a
FP6 quantization end-to-end. (#5234)
|
7 months ago |
interfaces
|
5411030529
Inference Checkpoints in V2 (#4664)
|
11 months ago |
__init__.py
|
38b41dffa1
DeepSpeed-FastGen (#4604)
|
11 months ago |
ds_module.py
|
38b41dffa1
DeepSpeed-FastGen (#4604)
|
11 months ago |
heuristics.py
|
e3d873a00e
Fix the FP6 kernels compilation problem on non-Ampere GPUs. (#5333)
|
6 months ago |
module_registry.py
|
38b41dffa1
DeepSpeed-FastGen (#4604)
|
11 months ago |