Olatunji Ruwase
|
aa4a7401f8
ZeRO-Inference refresh (#4197)
|
1 year ago |
Polisetty V R K Jyothendra Varma
|
abe293b402
use correct ckpt path when base_dir not available (#4101)
|
1 year ago |
stephen youn
|
69d1b9f978
DeepSpeed-Triton for Inference (#3748)
|
1 year ago |
Michael Wyatt
|
da8f4e01b5
allow dict datatype for checkpoints (#3007)
|
1 year ago |
Connor Holmes
|
0a61d5d664
Hybrid Engine Refactor and Llama Inference Support (#3425)
|
1 year ago |
Olatunji Ruwase
|
47f9f13bd3
DeepSpeed Chat (#3186)
|
1 year ago |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
Jeff Rasley
|
91d63e0228
update formatter version and style settings (#3098)
|
1 year ago |
Molly Smith
|
27e1b02deb
Remove bf16 from inference config dtye enum (#3010)
|
1 year ago |
Jeff Rasley
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 year ago |
Ammar Ahmad Awan
|
e4b3b610ba
Refactor DS inference API. No longer need replace_method. (#2831)
|
1 year ago |
Michael Wyatt
|
c5f85858a8
add missing moe deprecated fields to inference config (#2556)
|
1 year ago |
Ammar Ahmad Awan
|
90ae688442
Pass down the new DS inference config to replace_transformer_layer. (#2539)
|
1 year ago |
Michael Wyatt
|
c5ee27f737
Add MII tests (#2533)
|
1 year ago |
Michael Wyatt
|
fe6785447d
Add missing Inference sub-configs (#2518)
|
1 year ago |
Michael Wyatt
|
e59f80549e
Fix backward compatibility for InferenceConfig (#2516)
|
1 year ago |
Lev Kurilenko
|
d40a15fcf0
Add max_tokens alias to max_out_tokens arg to maintain backwards compatibility (#2508)
|
1 year ago |
Ammar Ahmad Awan
|
b5d18a6ab3
DeepSpeed inference config. (#2459) (#2472)
|
1 year ago |