Connor Holmes
|
0a61d5d664
Hybrid Engine Refactor and Llama Inference Support (#3425)
|
1 年之前 |
Reza Yazdani
|
3e8564645d
Add HE support for the rest of model containers (#3191)
|
1 年之前 |
Olatunji Ruwase
|
47f9f13bd3
DeepSpeed Chat (#3186)
|
1 年之前 |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
Jeff Rasley
|
91d63e0228
update formatter version and style settings (#3098)
|
1 年之前 |
Molly Smith
|
17fa0876ad
Always convert input mask to half (#2851)
|
1 年之前 |
Lev Kurilenko
|
0a73e6e613
Container param cleanup + remove qkv_merging (#2780)
|
1 年之前 |
Ma, Guokai
|
9548d48f48
Abstract accelerator (step 2) (#2560)
|
1 年之前 |
Jeff Rasley
|
d9b788d773
tweaks to ds-attn, distilbert policy, and mup (#2649)
|
1 年之前 |
Jeff Rasley
|
bb68c526ad
[inference] ds-attention refactor w.r.t. ops (#2623)
|
1 年之前 |
Reza Yazdani
|
35b350b28c
Fix quantized-inference & Add generic support of checkpoint loading (#2547)
|
1 年之前 |
Connor Holmes
|
e7e7595502
Stable Diffusion Enhancements (#2491)
|
1 年之前 |
Kevin Ko
|
6f77da1bae
Add `scale_attn_by_inverse_layer_idx` feature (#2486)
|
1 年之前 |
Ammar Ahmad Awan
|
35458da0e0
Create a new folder structure to isolate model-specific code in DS (#2464)
|
1 年之前 |