Commit History

Author SHA1 Message Date
  stephen youn 69d1b9f978 DeepSpeed-Triton for Inference (#3748) 1 year ago
  Connor Holmes 0a61d5d664 Hybrid Engine Refactor and Llama Inference Support (#3425) 1 year ago
  Olatunji Ruwase 47f9f13bd3 DeepSpeed Chat (#3186) 1 year ago
  Michael Wyatt b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 year ago
  Jeff Rasley 91d63e0228 update formatter version and style settings (#3098) 1 year ago
  Jeff Rasley da84e60d98 add missing license info to top of all source code (#2889) 1 year ago
  Lev Kurilenko 10f3c301a0 Add container load checkpoint error reporting + refactor (#2792) 1 year ago
  Lev Kurilenko 0a73e6e613 Container param cleanup + remove qkv_merging (#2780) 1 year ago
  Ammar Ahmad Awan 867da307d0 Inference Refactor (replace_with_policy, model_implementations) (#2554) 1 year ago