Yejing-Lai
|
85132adc31
enable starcode((kv_head=1)) autotp (#4896)
|
9 月之前 |
Wang, Yi
|
c8c57b8c24
add sharded loading for safetensors in AutoTP (#4854)
|
9 月之前 |
baodi
|
c20f6fa4e0
support baichuan model: (#4721)
|
10 月之前 |
RyanInnerpeace
|
7b818ee961
improve the way to determine whether a variable is None (#4782)
|
10 月之前 |
Ma, Guokai
|
f15cccfa0c
[AutoTP] Make AutoTP work when num_heads not divisible by number of workers (#4011)
|
1 年之前 |
Yejing-Lai
|
6763e2de61
add lm_head and embed_out tensor parallel (#3962)
|
1 年之前 |
Wang, Yi
|
d72edb3b0d
fix lm head overriden issue, move it from checkpoint in-loop loading to out loop (#4206)
|
1 年之前 |
Ammar Ahmad Awan
|
b9d719a6d3
Pass base_dir to model files can be loaded for auto-tp/meta-tensor. (#4348)
|
1 年之前 |
Satpal Singh Rathore
|
430510bfce
Checks for user injection policy (#3052)
|
1 年之前 |
Dino Chen
|
0712e29920
add meta onDevice support for LLAMA2 (#4147)
|
1 年之前 |
digger yu
|
4cde5da88e
fix typo: change polciies to policies (#4090)
|
1 年之前 |
Lev Kurilenko
|
1ba4098918
Fix Stable Diffusion Injection (#4078)
|
1 年之前 |
Molly Smith
|
94c7233a8b
Refactor autoTP inference for HE (#4040)
|
1 年之前 |
mzl
|
6b877d2dbc
autoTP for fused qkv weight (#3844)
|
1 年之前 |
Wang, Yi
|
0bafeac491
enable autoTP for MPT (#3861)
|
1 年之前 |
Wang, Yi
|
76953a37b7
fix opt-350m shard loading issue in AutoTP (#3600)
|
1 年之前 |
Dino Chen
|
f3943cf910
add llama2 autoTP support in replace_module (#4022)
|
1 年之前 |
digger yu
|
ce535945e6
fix: change ==NONE to is (#3923)
|
1 年之前 |
Reza Yazdani
|
f3c93b056d
Add FALCON Auto-TP Support (#3640)
|
1 年之前 |
Ma, Guokai
|
1f72082fc0
[CPU] Support Intel CPU inference (#3041)
|
1 年之前 |
Wang, Yi
|
b31b46c0d1
fix regression in shard checkpoint loading in AutoTP Path caused by qkv_copy() is deleted and add UT case for shard checkpoint loading in AutoTP (#3457)
|
1 年之前 |
Wang, Yi
|
d10b8ca011
add sharded checkpoint loading for AutoTP path to reduce the peak mem… (#3102)
|
1 年之前 |
Connor Holmes
|
0a61d5d664
Hybrid Engine Refactor and Llama Inference Support (#3425)
|
1 年之前 |
Reza Yazdani
|
3e8564645d
Add HE support for the rest of model containers (#3191)
|
1 年之前 |
Molly Smith
|
496a9a3a62
Diffusers 0.15.0 bug fix (#3345)
|
1 年之前 |
Olatunji Ruwase
|
47f9f13bd3
DeepSpeed Chat (#3186)
|
1 年之前 |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
Jeff Rasley
|
91d63e0228
update formatter version and style settings (#3098)
|
1 年之前 |
Molly Smith
|
9ea0fdc2ce
Assert mp_size is factor of model dimensions (#2891)
|
1 年之前 |
Heyang Qin
|
dc01cee5ca
using container when loading inference checkpoints (#2875)
|
1 年之前 |