Commit History

Author SHA1 Message Date
  Lzhang-hub cc897ecf15 resolve KeyError: 'PDSH_SSH_ARGS_APPEND' (#5318) 6 months ago
  Yizhou Wang 6e1a6801d1 deepspeed/launcher: add launcher_helper as each rank's start portal (#4699) 9 months ago
  Michael Wyatt d37fc25d56 Refactor launcher user arg parsing (#4824) 10 months ago
  Ma, Guokai 04cd6af130 turn off I_MPI_PIN for impi launcher (#4531) 1 year ago
  Logan Adams 17957728c0 Fix multinode runner to properly append to PDSH_SSH_ARGS_APPEND (#4373) 1 year ago
  Hiromasa 8145b5e41f added port argument for ssh (#4117) 1 year ago
  Ma, Guokai 1f72082fc0 [CPU] Support Intel CPU inference (#3041) 1 year ago
  Yizhou Wang 4e886f0568 launcher/multinode_runner.py: mapping env variables in running cmd for mpich runner (#3372) 1 year ago
  Wang, Yi a748bfc6d0 fix mpich launcher issue in multi-node (#3078) 1 year ago
  Michael Wyatt b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 year ago
  Jeff Rasley 91d63e0228 update formatter version and style settings (#3098) 1 year ago
  mzl 8d53ac0cd3 Add MPICH Multinode Runner (#2839) 1 year ago
  Jeff Rasley da84e60d98 add missing license info to top of all source code (#2889) 1 year ago
  Logan Adams d038dbd268 Fix Slurm launcher user args (#2806) 1 year ago
  Logan Adams 4af1f76a99 Add user defined launcher args for PDSH launcher (#2804) 1 year ago
  Ma, Guokai 98cc35b6a8 Abstract accelerator (step 3) (#2677) 1 year ago
  Dashiell Stander 3db0b5e2de Add SLURM Multinode Runner (#2404) 2 years ago
  Arpan Jain 1ed5aa96a8 Elastic Training support in DeepSpeed (#2153) (#2156) 2 years ago
  Alex Hedges 316c4a43e0 Add flake8 to pre-commit checks (#2051) 2 years ago
  Jerry Mannil d0eae5ad7a Propagate max errorcode to deepspeed when using PDSH launcher (#1994) 2 years ago
  Michael Wyatt 3678ee1778 [bug] Add user-defined launcher args for MPI launcher (#1933) 2 years ago
  Shuai Zheng 4575b2b792 fix launcher for reading env vars (#1907) 2 years ago
  Jeff Rasley 9351266f78 Multi-node save pid support + allow sparse-attn extra (#1728) 2 years ago
  liamcli fead387f78 support module and no python args for launcher (#1690) 2 years ago
  Jeff Rasley a10e4811fe force set lf instead of crlf (https://github.com/pre-commit/pre-commit-hooks#mixed-line-ending) (#1598) 2 years ago
  Chunyang Wen 93c71831c7 fstr for multnode_runner (#1532) 2 years ago
  Ammar Ahmad Awan 01726ce2b8 Add 1-bit Adam support to DeepSpeed (#380) 4 years ago