Yizhou Wang 66fb92413c launcher/launcher_helper.py: fix PMI name and add EnvironmentError (#5025) 8 月之前
..
autotuning c052545122 DS #4993 #662 : autotune single node hostfile bugfix (#4996) 8 月之前
checkpoint 8998707a2f Universal Checkpoint for Sequence Parallelism (#4752) 10 月之前
comm d8d865f492 [Fix] Fix cpu inference UT failure (#4430) 9 月之前
compression 389bf69319 fix: Remove duplicate word the (#4051) 1 年之前
elasticity d2e9adce39 Fix error report of DSElasticAgent._set_master_addr_port() (#4985) 9 月之前
inference 1d35db76a0 Refactor the Qwen positional emebdding config code (#4955) 9 月之前
launcher 66fb92413c launcher/launcher_helper.py: fix PMI name and add EnvironmentError (#5025) 8 月之前
model_implementations d5a7c1e0b4 Capture short kernel sequences to graph (#4318) 10 月之前
module_inject 62afafe812 Update falcon fused type order (#5007) 9 月之前
moe 2afa1c7f2f Communication Optimization for Large-Scale Training (#4695) 11 月之前
monitor 604d701e35 Introduce pydantic_v1 compatibility module for pydantic>=2.0.0 support (#4407) 1 年之前
nebula cd4e473ee6 fix typo with deepspeed/ (#3547) 1 年之前
ops fd0a52c1ac use all_gather_into_tensor instead of all_gather (#4705) 10 月之前
pipe b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
profiling 61391229c9 Update flops profiler to recurse (#4374) 11 月之前
runtime e81369318e [minor] improve code quality and readablilty (#5011) 9 月之前
sequence 2afa1c7f2f Communication Optimization for Large-Scale Training (#4695) 11 月之前
utils 9500ab7d47 [minor] Improve logging for multiprocesses (#5004) 8 月之前
__init__.py 538ffb4b60 Delete unused --deepspeed_mpi command line argument (#4981) 9 月之前
accelerator 9548d48f48 Abstract accelerator (step 2) (#2560) 1 年之前
constants.py 706a72562a Allow env var for timeout (#4405) 1 年之前
env_report.py c1ba6a104f [CANN] Support cpu offload optimizer for Ascend NPU (#4568) 11 月之前
git_version_info.py 57a27b0803 add type checker ignore to resolve that pylance can't resolved noqa annotation (#4102) 1 年之前
pydantic_v1.py 604d701e35 Introduce pydantic_v1 compatibility module for pydantic>=2.0.0 support (#4407) 1 年之前