Yejing-Lai
|
1787673edc
fix num_kv_heads sharding in uneven autoTP for Falcon-40b (#4712)
|
9 月之前 |
Omar Elayan
|
4c2cac0340
Inference changes for incorporating meta loading checkpoint (#4692)
|
10 月之前 |
baodi
|
faa00b1373
fix falcon model load from_config meta_data error (#4783)
|
10 月之前 |
RyanInnerpeace
|
7b818ee961
improve the way to determine whether a variable is None (#4782)
|
10 月之前 |
Wang, Yi
|
29f840fd1a
fix autoTP issue for mpt (trust_remote_code=True) (#4787)
|
10 月之前 |
Dino Chen
|
6ea44d02c6
fix num_kv_heads sharding in autoTP for the new in-repo Falcon-40B (#4654)
|
11 月之前 |
Ma, Guokai
|
f15cccfa0c
[AutoTP] Make AutoTP work when num_heads not divisible by number of workers (#4011)
|
1 年之前 |
Yejing-Lai
|
6763e2de61
add lm_head and embed_out tensor parallel (#3962)
|
1 年之前 |
Yejing-Lai
|
7220e7f8f7
fix cpu loading model partition OOM (#4353)
|
1 年之前 |
Reza Yazdani
|
468882fb68
Add the policy to run llama model from the official repo (#4313)
|
1 年之前 |
Dino Chen
|
0712e29920
add meta onDevice support for LLAMA2 (#4147)
|
1 年之前 |
Wang, Yi
|
5e16eb2c93
enable autoTP for mpt in huggingface model hub without trust_remote_code (#4062)
|
1 年之前 |
Molly Smith
|
341cefd2a4
Return nn.parameter type for weights and biases (#4146)
|
1 年之前 |
digger yu
|
4cde5da88e
fix typo: change polciies to policies (#4090)
|
1 年之前 |
Molly Smith
|
94c7233a8b
Refactor autoTP inference for HE (#4040)
|
1 年之前 |
mzl
|
6b877d2dbc
autoTP for fused qkv weight (#3844)
|
1 年之前 |
Yejing-Lai
|
d6f622176d
Add GPTNeoX AutoTP support (#3778)
|
1 年之前 |
Reza Yazdani
|
f3c93b056d
Add FALCON Auto-TP Support (#3640)
|
1 年之前 |
jianan-gu
|
2c63e349e4
Enable auto TP policy for llama model (#3170)
|
1 年之前 |
Wang, Yi
|
6ba0024d54
Enable autoTP for bloom (#3035)
|
1 年之前 |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
Jeff Rasley
|
91d63e0228
update formatter version and style settings (#3098)
|
1 年之前 |
Molly Smith
|
2ede0d942a
AutoTP Assert Kernel Injection Support (#2939)
|
1 年之前 |
Molly Smith
|
4ae3a3da0d
TP unsupported models and assertions (#2810)
|
1 年之前 |
Jeff Rasley
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 年之前 |
Molly Smith
|
46784cb58e
Fix auto TP for duplicate modules with different gems (#2784)
|
1 年之前 |
Molly Smith
|
d59b572911
Automatic tensor parallelism v2 (#2670)
|
1 年之前 |