Peng Zou
|
ed10cc7382
Add support of Qwen models (7b, 14b, 72b) to DeepSpeed-FastGen (#4913)
|
9 月之前 |
Arash Bakhtiari
|
834272531a
Add support of Microsoft Phi-2 model to DeepSpeed-FastGen (#4812)
|
9 月之前 |
Arash Bakhtiari
|
a7900bcc3d
Add support of Falcon models (7b, 40b, 180b) to DeepSpeed-FastGen (#4790)
|
10 月之前 |
Masahiro Tanaka
|
ab6b1e16bb
Add Japanese blog for DeepSpeed-FastGen (#4651)
|
11 月之前 |
Masahiro Tanaka
|
d89027be61
Fix figure in FlexGen blog (#4624)
|
11 月之前 |
Masahiro Tanaka
|
ff53c22485
Add number for latency comparison (#4612)
|
11 月之前 |
Jeff Rasley
|
1d9e256c03
DeepSpeed-FastGen blog (#4607)
|
11 月之前 |