Commit History

Author SHA1 Message Date
  Olatunji Ruwase aa4a7401f8 ZeRO-Inference refresh (#4197) 1 year ago
  Polisetty V R K Jyothendra Varma abe293b402 use correct ckpt path when base_dir not available (#4101) 1 year ago
  stephen youn 69d1b9f978 DeepSpeed-Triton for Inference (#3748) 1 year ago
  Michael Wyatt da8f4e01b5 allow dict datatype for checkpoints (#3007) 1 year ago
  Connor Holmes 0a61d5d664 Hybrid Engine Refactor and Llama Inference Support (#3425) 1 year ago
  Olatunji Ruwase 47f9f13bd3 DeepSpeed Chat (#3186) 1 year ago
  Michael Wyatt b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 year ago
  Jeff Rasley 91d63e0228 update formatter version and style settings (#3098) 1 year ago
  Molly Smith 27e1b02deb Remove bf16 from inference config dtye enum (#3010) 1 year ago
  Jeff Rasley da84e60d98 add missing license info to top of all source code (#2889) 1 year ago
  Ammar Ahmad Awan e4b3b610ba Refactor DS inference API. No longer need replace_method. (#2831) 1 year ago
  Michael Wyatt c5f85858a8 add missing moe deprecated fields to inference config (#2556) 1 year ago
  Ammar Ahmad Awan 90ae688442 Pass down the new DS inference config to replace_transformer_layer. (#2539) 1 year ago
  Michael Wyatt c5ee27f737 Add MII tests (#2533) 1 year ago
  Michael Wyatt fe6785447d Add missing Inference sub-configs (#2518) 1 year ago
  Michael Wyatt e59f80549e Fix backward compatibility for InferenceConfig (#2516) 1 year ago
  Lev Kurilenko d40a15fcf0 Add max_tokens alias to max_out_tokens arg to maintain backwards compatibility (#2508) 1 year ago
  Ammar Ahmad Awan b5d18a6ab3 DeepSpeed inference config. (#2459) (#2472) 1 year ago