提交历史

作者 SHA1 备注 提交日期
  Alejandro Dubrovsky 9a8f6a1d3c Account for expert parameters when calculating the total number of parameters in the model (#3720) 1 年之前
  Conglong Li b692d236d5 add Chinese Zhihu social account (#3755) 1 年之前
  mzl 5a5340d03b remove UtilsBuilder load, use torch (un)flatten ops (#3728) 1 年之前
  Logan Adams cd911f9ab2 Fix output transpose dimension bugs (#3747) 1 年之前
  tensor-tang 45466afa34 fix hybrid engine mlp module (#3736) 1 年之前
  john li 46bb08c2df Include cublas error details when getting cublas handle fails (#3695) 1 年之前
  StrayWarrior 09332dbf9f Fix autotuner get_gas_from_user_config (#3664) 1 年之前
  Logan Adams 1b40182312 Fix apex install bugs (#3741) 1 年之前
  Joe Mayer 6f4fc30b58 FP8 unittest for H100 (#3731) 1 年之前
  Ma, Guokai 5289d691d3 Documentation for DeepSpeed Accelerator Abstraction Interface (#3184) 1 年之前
  Jeff Rasley 54bd9e290a bump to 0.9.5 1 年之前
  Logan Adams a65f6b9e9b Update Dockerfile with newer cuda and torch. (#3716) 1 年之前
  Abhilash Majumder 26b3e73298 single node pdsh sigkill (#3730) 1 年之前
  Ma, Guokai 8bfbb0e3ca [Bugfix][CPU] Remove C++ version in CPU OpBuilder (#3643) 1 年之前
  Olatunji Ruwase 046afcedb4 Increase tensor creator coverage (#3684) 1 年之前
  Logan Adams fc8e5c8858 Fix typo in name of hybrid engine function (#3704) 1 年之前
  hablb 0977106ac9 zero3 performance optimizations (#3622) 1 年之前
  Conglong Li df42509786 DeepSpeed overview in Japanese (#3709) 1 年之前
  john li d414678df7 Small tweak on cuda version mismatch documentation (#3706) 1 年之前
  Michael Wyatt fb2b4ab11a Fix unit test typo in tests/unit/ops/transformer/inference (#3697) 1 年之前
  digger yu c5edc91ecb change partititon_name to partition_name (#3700) 1 年之前
  Reza Yazdani 34a9fbf1a3 Fix gpt-j inference issue (#3639) 1 年之前
  Logan Adams 7e59ef1230 Revert "fix typo name (#3689)" (#3702) 1 年之前
  tensor-tang f2f5f21b52 fix typo name (#3689) 1 年之前
  Logan Adams d8aaa58122 Fix incorrectly formatted f string (#3698) 1 年之前
  Abhilash Majumder c17313fb24 Correct world_size/backend for mpi (#3694) 1 年之前
  Byungsoo Oh b7f463ddeb Fix local rank mismatch for heterogeneous nodes (#3409) 1 年之前
  Ramya Ramineni 4cd0a003f5 non-JIT build fix on ROCm (#3638) 1 年之前
  Siddharth Singh 2d737eddcc Update README to add ICS'23 paper (#3687) 1 年之前
  Olatunji Ruwase e5fe5f65e8 Use logger in accelerator (#3682) 1 年之前