Masahiro Tanaka
|
c56a4b9e0d
Improve universal checkpoint (#5289)
|
6 months ago |
Ma, Guokai
|
c08e69f212
Make op builder detection adapt to accelerator change (#5206)
|
7 months ago |
Masahiro Tanaka
|
0a10bd427e
Fix alignment of optimizer states when loading (#5105)
|
8 months ago |
Alexander Jipa
|
e801e6d718
skipping redundant MoE optimizer state loading (#4120)
|
1 year ago |
Olatunji Ruwase
|
dd8df20fe0
zero3 checkpoint frozen params (#3205)
|
1 year ago |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
Jeff Rasley
|
91d63e0228
update formatter version and style settings (#3098)
|
1 year ago |
Jeff Rasley
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 year ago |
Alexander Jipa
|
cfead55132
fixes #2389 (#2411)
|
2 years ago |
Michael Wyatt
|
7e085b6258
fix for pytest picking up local deepspeed dir instead of installed deepspeed (#2299)
|
2 years ago |
Olatunji Ruwase
|
217338beb6
Refactor dist tests: Checkpointing (#2202)
|
2 years ago |