提交历史

作者 SHA1 备注 提交日期
  Alexander Jipa b354c28b76 polishing timers and log_dist (#3996) 1 年之前
  digger yu fc8de76f1d Simplify chain comparisons, remove redundant parentheses (#3912) 1 年之前
  Ma, Guokai 1f72082fc0 [CPU] Support Intel CPU inference (#3041) 1 年之前
  Michael Wyatt b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
  Jeff Rasley 91d63e0228 update formatter version and style settings (#3098) 1 年之前
  Ma, Guokai 98cc35b6a8 Abstract accelerator (step 3) (#2677) 1 年之前
  Alexander Jipa 0f0e38c520 fixes #2498 (#2603) 1 年之前
  ShijieZZZZ 340fc0cf19 Report progress at gradient accumulation boundary (#2553) 1 年之前
  Jeff Rasley 5bd09a8f83 Allow turning off loss scaling wrt GAS + update tput calculator (#2140) 2 年之前
  Alex Hedges 316c4a43e0 Add flake8 to pre-commit checks (#2051) 2 年之前
  Quentin Anthony 5349347bb6 DeepSpeed Communication Profiling and Logging (#2012) 2 年之前
  Zeyu b05237876e fixed "None type has no len()" (#2091) 2 年之前
  Karim Foda 735406e536 fix import errors (#2026) 2 年之前
  Ammar Ahmad Awan 36ad3119d5 DeepSpeed comm backend v1 (#1985) 2 年之前
  Quentin Anthony 0d36893281 Fix timer typo (#1964) 2 年之前
  Olatunji Ruwase fee7313598 Use cuda events to improve timing for multi-stream execution (#1881) 2 年之前
  Justin Chiu 4912e0ad7e Various ZeRO Stage3 Optimizations + Improvements (including bfloat16 support) (#1453) 2 年之前
  Cheng Li 9caa74e577 Autotuning (#1554) 2 年之前
  Cheng Li 4544b7d2f1 Improve flops profiler functionality (#1065) 3 年之前
  Sean Naren 6fb16100ba Replace timer print rank 0 with logging (#732) 3 年之前
  Jeff Rasley 0dc8420042 Dependency pruning (#528) 3 年之前
  Shaden Smith 65c2f974d8 Pipeline parallel training engine. (#392) 4 年之前
  Jeff Rasley e5bbc2e559 Sparse attn + ops/runtime refactor + v0.3.0 (#343) 4 年之前