Heyang Qin
|
dc01cee5ca
using container when loading inference checkpoints (#2875)
|
1 年之前 |
Jeff Rasley
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 年之前 |
Lev Kurilenko
|
fd1449c766
Port Reza's INT8-quantization fix to container architecture (#2725)
|
1 年之前 |
Lev Kurilenko
|
10f3c301a0
Add container load checkpoint error reporting + refactor (#2792)
|
1 年之前 |
Lev Kurilenko
|
0a73e6e613
Container param cleanup + remove qkv_merging (#2780)
|
1 年之前 |
Ammar Ahmad Awan
|
867da307d0
Inference Refactor (replace_with_policy, model_implementations) (#2554)
|
1 年之前 |