Jeff Rasley
|
91b4a93db0
pytest skips for tests requiring certain ops (#411)
|
4 years ago |
Shaden Smith
|
65c2f974d8
Pipeline parallel training engine. (#392)
|
4 years ago |
Jeff Rasley
|
41db1c2f03
ZeRO-Offload release (#391)
|
4 years ago |
Jeff Rasley
|
e5bbc2e559
Sparse attn + ops/runtime refactor + v0.3.0 (#343)
|
4 years ago |
Olatunji Ruwase
|
607814feb9
Fix bug in fp32 optimizer state loading (#289)
|
4 years ago |
Jeff Rasley
|
abe2204ddd
Support fp32 grad clipping and fix max_grad_norm confusion (#232)
|
4 years ago |
Jeff Rasley
|
f2ac7eafd5
ZeRO-2 (#217)
|
4 years ago |
Shaden Smith
|
b2c87edfb6
Fix global_steps checkpoint loading. (#139)
|
4 years ago |
Samyam Rajbhandari
|
936117b589
Enhancement: Ability to load checkpoint without loading the optimizer… (#128)
|
4 years ago |