Conglong Li f876d81d34 DeepSpeed4Science (#4357) 1 year ago
..
autotuning b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 year ago
checkpoint e801e6d718 skipping redundant MoE optimizer state loading (#4120) 1 year ago
comm 1bc3b78423 [CPU] Use allreduce_low_latency for AutoTP and implement low latency allreduce for CPU backend (single node) (#3919) 1 year ago
compression 9bf77782b2 Fix a bug in the implementation of dequantization for inference (#3433) 1 year ago
elasticity 7290aace9b [CPU] Skip CPU support unimplemented error (#3633) 1 year ago
hybrid_engine 7290aace9b [CPU] Skip CPU support unimplemented error (#3633) 1 year ago
inference 8c7f7fd2fd Fix skipped inference tests (#4336) 1 year ago
launcher 8145b5e41f added port argument for ssh (#4117) 1 year ago
model_parallelism 6b2365e4fa Re-enable elastic training for torch 2+ (#4010) 1 year ago
moe 6b2365e4fa Re-enable elastic training for torch 2+ (#4010) 1 year ago
monitor b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 year ago
ops f876d81d34 DeepSpeed4Science (#4357) 1 year ago
pipe 7ddc3b01dd Fix pipeline module evaluation when contiguous activation checkpointing is enabled (#3005) 1 year ago
profiling 6b2365e4fa Re-enable elastic training for torch 2+ (#4010) 1 year ago
runtime 9adc73ff65 Handle empty parameter groups (#4277) 1 year ago
utils b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 year ago
__init__.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 year ago
alexnet_model.py aef6c65ce3 Reduce Unit Test Times (Part 3) (#3850) 1 year ago
common.py d9a889d559 Fix nv-nightly workflow (#4163) 1 year ago
ds_batch_config.json ff42743865 Refactor remaining distributed tests (#2216) 2 years ago
gpt2-merges.txt ff42743865 Refactor remaining distributed tests (#2216) 2 years ago
gpt2-vocab.json ff42743865 Refactor remaining distributed tests (#2216) 2 years ago
megatron_model.py 4b35833379 Revert "Update megatron GPT2Model" 1 year ago
modeling.py 180dd39714 Clean up modeling code (#4320) 1 year ago
modelingpreln.py 180dd39714 Clean up modeling code (#4320) 1 year ago
multi_output_model.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 year ago
simple_model.py 2ded2ff0be checking process_group before merging bucket ranges (#3521) (#3577) 1 year ago
util.py 7b850d3d04 Re-enable skipped unit tests (#3939) 1 year ago