Connor Holmes
|
0a61d5d664
Hybrid Engine Refactor and Llama Inference Support (#3425)
|
1 year ago |
Olatunji Ruwase
|
47f9f13bd3
DeepSpeed Chat (#3186)
|
1 year ago |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
Jeff Rasley
|
91d63e0228
update formatter version and style settings (#3098)
|
1 year ago |
Ma, Guokai
|
0acf7e9c48
[RFC] add device abstraction to allow other device than CUDA be used (#2221)
|
1 year ago |
Lev Kurilenko
|
fd1449c766
Port Reza's INT8-quantization fix to container architecture (#2725)
|
1 year ago |
Lev Kurilenko
|
10f3c301a0
Add container load checkpoint error reporting + refactor (#2792)
|
1 year ago |
Ammar Ahmad Awan
|
867da307d0
Inference Refactor (replace_with_policy, model_implementations) (#2554)
|
1 year ago |