Lev Kurilenko
|
0a73e6e613
Container param cleanup + remove qkv_merging (#2780)
|
1 年之前 |
Reza Yazdani
|
9f41ffe4a6
Reset KV-cache at the beginning of text-generation (#2669)
|
1 年之前 |
Ma, Guokai
|
98cc35b6a8
Abstract accelerator (step 3) (#2677)
|
1 年之前 |
Jeff Rasley
|
d9b788d773
tweaks to ds-attn, distilbert policy, and mup (#2649)
|
1 年之前 |
Jeff Rasley
|
bb68c526ad
[inference] ds-attention refactor w.r.t. ops (#2623)
|
1 年之前 |
Reza Yazdani
|
35b350b28c
Fix quantized-inference & Add generic support of checkpoint loading (#2547)
|
1 年之前 |
Connor Holmes
|
e7e7595502
Stable Diffusion Enhancements (#2491)
|
1 年之前 |
Ammar Ahmad Awan
|
35458da0e0
Create a new folder structure to isolate model-specific code in DS (#2464)
|
1 年之前 |