Olatunji Ruwase
|
19aac8ad19
ZeRO-Offload: Integration code fixes (#370)
|
4 年之前 |
Jeff Rasley
|
e45b5e4cd0
ZeRO-Offload v1 (squash) (#345)
|
4 年之前 |
Jeff Rasley
|
e5bbc2e559
Sparse attn + ops/runtime refactor + v0.3.0 (#343)
|
4 年之前 |
Olatunji Ruwase
|
607814feb9
Fix bug in fp32 optimizer state loading (#289)
|
4 年之前 |
Jeff Rasley
|
abe2204ddd
Support fp32 grad clipping and fix max_grad_norm confusion (#232)
|
4 年之前 |
Jeff Rasley
|
f2ac7eafd5
ZeRO-2 (#217)
|
4 年之前 |
Shaden Smith
|
b2c87edfb6
Fix global_steps checkpoint loading. (#139)
|
4 年之前 |
Samyam Rajbhandari
|
936117b589
Enhancement: Ability to load checkpoint without loading the optimizer… (#128)
|
4 年之前 |