Commit History

Author SHA1 Message Date
  Logan Adams 4b35833379 Revert "Update megatron GPT2Model" 1 year ago
  Logan Adams 1ec34e54de Update megatron GPT2Model 1 year ago
  Michael Wyatt b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 year ago
  Jeff Rasley 91d63e0228 update formatter version and style settings (#3098) 1 year ago
  Ma, Guokai 0acf7e9c48 [RFC] add device abstraction to allow other device than CUDA be used (#2221) 1 year ago
  Jeff Rasley da84e60d98 add missing license info to top of all source code (#2889) 1 year ago
  Alex Hedges 316c4a43e0 Add flake8 to pre-commit checks (#2051) 2 years ago
  Justin Chiu 4912e0ad7e Various ZeRO Stage3 Optimizations + Improvements (including bfloat16 support) (#1453) 2 years ago
  Jeff Rasley e2fdd254ed Big science related changes (#1407) 3 years ago
  Jeff Rasley 6996bb0159 Sparse attn triton v1.0 support + torch1.8 test runner (#1374) 3 years ago
  Reza Yazdani ed3de0c21b Quantization + inference release (#1091) 3 years ago