Raza Sikander
|
b5cf30a085
Dtype support check for accelerator in UTs (#6360)
|
1 月之前 |
Logan Adams
|
263bfe2892
Update to HF_HOME from TRANSFORMERS_CACHE (#4816)
|
5 月之前 |
Wang, Yi
|
76953a37b7
fix opt-350m shard loading issue in AutoTP (#3600)
|
1 年之前 |
Yejing-Lai
|
7290aace9b
[CPU] Skip CPU support unimplemented error (#3633)
|
1 年之前 |
Wang, Yi
|
b31b46c0d1
fix regression in shard checkpoint loading in AutoTP Path caused by qkv_copy() is deleted and add UT case for shard checkpoint loading in AutoTP (#3457)
|
1 年之前 |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
Jeff Rasley
|
91d63e0228
update formatter version and style settings (#3098)
|
1 年之前 |
Jeff Rasley
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 年之前 |
Ammar Ahmad Awan
|
e4b3b610ba
Refactor DS inference API. No longer need replace_method. (#2831)
|
1 年之前 |
Olatunji Ruwase
|
3f210c9715
CUDA optional deepspeed ops (#2507)
|
1 年之前 |
Michael Wyatt
|
ccb8eb81fb
Add checkpoint sharding unit tests (#2561)
|
1 年之前 |