Commit History

Author SHA1 Message Date
  Jeff Rasley 91b4a93db0 pytest skips for tests requiring certain ops (#411) 4 years ago
  Shaden Smith 65c2f974d8 Pipeline parallel training engine. (#392) 4 years ago
  Jeff Rasley 41db1c2f03 ZeRO-Offload release (#391) 4 years ago
  Jeff Rasley e5bbc2e559 Sparse attn + ops/runtime refactor + v0.3.0 (#343) 4 years ago
  Olatunji Ruwase 607814feb9 Fix bug in fp32 optimizer state loading (#289) 4 years ago
  Jeff Rasley abe2204ddd Support fp32 grad clipping and fix max_grad_norm confusion (#232) 4 years ago
  Jeff Rasley f2ac7eafd5 ZeRO-2 (#217) 4 years ago
  Shaden Smith b2c87edfb6 Fix global_steps checkpoint loading. (#139) 4 years ago
  Samyam Rajbhandari 936117b589 Enhancement: Ability to load checkpoint without loading the optimizer… (#128) 4 years ago