CurryRice233
|
d873ce6159
[NPU] Fix npu offload bug (#4883)
|
9 months ago |
hipudding
|
c1ba6a104f
[CANN] Support cpu offload optimizer for Ascend NPU (#4568)
|
11 months ago |
Ma, Guokai
|
0f5406323c
[CPU] FusedAdam and CPU training support (#3991)
|
1 year ago |
Olatunji Ruwase
|
47f9f13bd3
DeepSpeed Chat (#3186)
|
1 year ago |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
Jeff Rasley
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 year ago |
Olatunji Ruwase
|
3f210c9715
CUDA optional deepspeed ops (#2507)
|
1 year ago |
Reza Yazdani
|
a04480e192
Fix the half-precision version of CPU-Adam (#2032)
|
2 years ago |
Victor
|
74493b2bee
support CPU Adam and Adagrad on Windows with SDK 10.0.22000 (#1634)
|
2 years ago |
Jeff Rasley
|
a10e4811fe
force set lf instead of crlf (https://github.com/pre-commit/pre-commit-hooks#mixed-line-ending) (#1598)
|
2 years ago |
Wenhao Hu
|
8abdaee243
Add cpu adagrad (#1358)
|
3 years ago |
Ammar Ahmad Awan
|
f28432441b
DeepSpeed MoE (#1310)
|
3 years ago |
Adam Moody
|
f65ff908ab
enable cpu adam op on powerpc architectures (#1213)
|
3 years ago |
Jeff Rasley
|
0d4a54a04d
ZeRO-Infinity (#976)
|
3 years ago |
Reza Yazdani
|
ee1ffe2e88
CPU-Adam fix for scalar mode (#735)
|
3 years ago |
Reza Yazdani
|
9f52a36fad
tracking optimizer step in cpu-adam when loading checkpoint (#564)
|
3 years ago |
Reza Yazdani
|
7d4d742bf0
Fixing CPU-Adam convergence issue (#503)
|
4 years ago |
Reza Yazdani
|
f5aa2547d8
Add CPUAdam optimizer for zero-offload in deepspeed engine (#484)
|
4 years ago |
Jeff Rasley
|
41db1c2f03
ZeRO-Offload release (#391)
|
4 years ago |