Olatunji Ruwase 60ae3e0e53 Merge branch 'master' into jomayeri/aio-locked-tensor 1 周之前
..
accelerator b20c46745b add missing methods to MPS_Accelerator (#5134) 8 月之前
autotuning b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
checkpoint 774b897736 fix the missing argument in test and typo (#5730) 3 月之前
comm 19da95f783 [CPU] add fp16 support to shm inference_all_reduce (#5669) 3 月之前
compression 6dcced1d5c Cleanup required_torch_version code and references. (#5370) 6 月之前
elasticity 8e4f6e48db Skip the UT cases that use unimplemented op builders. (#5372) 5 月之前
hybrid_engine f69f8840fc Removal of cuda hardcoded string with get_device function (#5351) 6 月之前
inference 89c4d9f5a7 TestLowCpuMemUsage UT get device by device_name (#6397) 1 月之前
launcher c08e69f212 Make op builder detection adapt to accelerator change (#5206) 7 月之前
linear 6e5d58d248 OptimizedLinear updates (#5791) 2 月之前
model_parallelism 6dcced1d5c Cleanup required_torch_version code and references. (#5370) 6 月之前
moe 9a3ede7079 add moe topk(k>2) gate support (#5881) 2 月之前
monitor 488a823f64 New integration - CometMonitor (#5466) 5 月之前
ops 60ae3e0e53 Merge branch 'master' into jomayeri/aio-locked-tensor 1 周之前
pipe 7ddc3b01dd Fix pipeline module evaluation when contiguous activation checkpointing is enabled (#3005) 1 年之前
profiling 6dcced1d5c Cleanup required_torch_version code and references. (#5370) 6 月之前
runtime 90e25da4eb Skip fp16 tests on CPU 1 周之前
sequence_parallelism 8b191d7ccf Long sequence parallelism (Ulysses) integration with HuggingFace (#5774) 2 月之前
utils 08e0733e4a Support MoE for pipeline models (#5338) 6 月之前
__init__.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
alexnet_model.py 9bc4cd01b7 Store/Load CIFAR from local/offline (#6390) 1 月之前
common.py 659f6be105 Avoid security issues of subprocess shell (#6498) 1 月之前
ds_batch_config.json ff42743865 Refactor remaining distributed tests (#2216) 2 年之前
gpt2-merges.txt ff42743865 Refactor remaining distributed tests (#2216) 2 年之前
gpt2-vocab.json ff42743865 Refactor remaining distributed tests (#2216) 2 年之前
megatron_model.py 4b35833379 Revert "Update megatron GPT2Model" 1 年之前
modeling.py 180dd39714 Clean up modeling code (#4320) 1 年之前
modelingpreln.py 180dd39714 Clean up modeling code (#4320) 1 年之前
multi_output_model.py c08e69f212 Make op builder detection adapt to accelerator change (#5206) 7 月之前
simple_model.py c08e69f212 Make op builder detection adapt to accelerator change (#5206) 7 月之前
util.py 1ab1928d79 Enable dynamic shapes for pipeline parallel engine inputs (#5481) 2 月之前