Commit History

Author SHA1 Message Date
  Xiaoxia (Shirley) Wu 41744d59f8 DeepSpeed-VisualChat Blog (#4446) 1 year ago
  Yuxiang Wei 986b5958e2 fix: wrong documentation of `ignore_unused_parameters` (#4418) 1 year ago
  Logan Adams cd0d2ba2df Enable ad-hoc running of cpu_inference (#4444) 1 year ago
  Logan Adams fd98af256e Fixup check release version script (#4413) 1 year ago
  Jackmin801 2c67b58b5f fix: check-license (#4432) 1 year ago
  Jackmin801 58a206059f Small docstring fix (#4431) 1 year ago
  Ma, Guokai 9a55291452 [CCLBackend] Using parallel memcpy for inference_all_reduce (#4404) 1 year ago
  Liangliang-Ma 1760627eb9 Zero infinity xpu support (#4130) 1 year ago
  Jackmin801 2f73b834b5 change default set_to_none in zero_grad methods (#4438) 1 year ago
  Logan Adams 0636c74c5e Update cp_inf wokrflow (#4424) 1 year ago
  Yejing-Lai 7220e7f8f7 fix cpu loading model partition OOM (#4353) 1 year ago
  Yejing-Lai 388c84834f add CPU autotp UT (#4263) 1 year ago
  Abhishek Jindal 28b9d5c231 Add condition when dimension is greater than 2 (#4390) 1 year ago
  Logan Adams 58619402b5 Update nv-transformers workflow to use cu11.6 (#4412) 1 year ago
  Abhishek Jindal e339364127 Add torch no grad condition (#4391) 1 year ago
  Conglong Li aea10eec5a update deepspeed4science blog (#4408) 1 year ago
  Ziyang 60bf78454c Fix incorrect assignment of self.quantized_nontrainable_weights (#4399) 1 year ago
  Masahiro Tanaka f8d3ec7fa1 save/restore step in param groups with zero 1 (#4396) 1 year ago
  Michael Wyatt 4c35880b16 Allow multiple inference engines in single script (#4384) 1 year ago
  stephen youn 0e0748c579 adds triton flash attention2 kernel (#4337) 1 year ago
  Elsa Granger 4fc2c8e7d5 Fix llama meta tensor loading in AutoTP and kernel injected inference (#3608) 1 year ago
  Olatunji Ruwase 463dea2722 Fix min torch version (#4375) 1 year ago
  Logan Adams 17957728c0 Fix multinode runner to properly append to PDSH_SSH_ARGS_APPEND (#4373) 1 year ago
  Cheng Li 727609df4a add the missing method (#4363) 1 year ago
  cctry c58146471e Openfold fix (#4368) 1 year ago
  Conglong Li a99e5d3fb7 deepspeed4science japanese blog (#4369) 1 year ago
  Conglong Li 3592a22cfe deepspeed4science chinese blog (#4366) 1 year ago
  Logan Adams dcd3ae1954 Enable workflow dispatch on Torch 1.10 CI tests (#4361) 1 year ago
  Logan Adams dcf649c3e0 Update conda env to have max pydantic version (#4362) 1 year ago
  Conglong Li da7a1851c4 add deepspeed4science blog link (#4364) 1 year ago