Olatunji Ruwase 0e942df008 Add Linear warmup+decay lr schedule (#414) 4 years ago
..
model 41db1c2f03 ZeRO-Offload release (#391) 4 years ago
onebitadam 01726ce2b8 Add 1-bit Adam support to DeepSpeed (#380) 4 years ago
perf 41db1c2f03 ZeRO-Offload release (#391) 4 years ago
small_model_debugging f2ac7eafd5 ZeRO-2 (#217) 4 years ago
unit 0e942df008 Add Linear warmup+decay lr schedule (#414) 4 years ago