Jeff Rasley
|
a10e4811fe
force set lf instead of crlf (https://github.com/pre-commit/pre-commit-hooks#mixed-line-ending) (#1598)
|
2 年之前 |
Wenhao Hu
|
8abdaee243
Add cpu adagrad (#1358)
|
3 年之前 |
Ammar Ahmad Awan
|
f28432441b
DeepSpeed MoE (#1310)
|
3 年之前 |
Stas Bekman
|
a029239812
clean up logging (#1190)
|
3 年之前 |
Stas Bekman
|
c79184ebcc
fix cpu_adam memory leak on deepspeed re-use in the same process (#896)
|
3 年之前 |
Reza Yazdani
|
ee1ffe2e88
CPU-Adam fix for scalar mode (#735)
|
3 年之前 |
Reza Yazdani
|
9f52a36fad
tracking optimizer step in cpu-adam when loading checkpoint (#564)
|
3 年之前 |
Reza Yazdani
|
7d4d742bf0
Fixing CPU-Adam convergence issue (#503)
|
4 年之前 |
Reza Yazdani
|
4c37d70520
fixing the AVX_256 compatibility (#497)
|
4 年之前 |
Reza Yazdani
|
f5aa2547d8
Add CPUAdam optimizer for zero-offload in deepspeed engine (#484)
|
4 年之前 |
Jeff Rasley
|
41db1c2f03
ZeRO-Offload release (#391)
|
4 年之前 |