.. |
autotuning
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
checkpoint
|
e801e6d718
skipping redundant MoE optimizer state loading (#4120)
|
1 year ago |
comm
|
1bc3b78423
[CPU] Use allreduce_low_latency for AutoTP and implement low latency allreduce for CPU backend (single node) (#3919)
|
1 year ago |
compression
|
9bf77782b2
Fix a bug in the implementation of dequantization for inference (#3433)
|
1 year ago |
elasticity
|
7290aace9b
[CPU] Skip CPU support unimplemented error (#3633)
|
1 year ago |
hybrid_engine
|
7290aace9b
[CPU] Skip CPU support unimplemented error (#3633)
|
1 year ago |
inference
|
8c7f7fd2fd
Fix skipped inference tests (#4336)
|
1 year ago |
launcher
|
8145b5e41f
added port argument for ssh (#4117)
|
1 year ago |
model_parallelism
|
6b2365e4fa
Re-enable elastic training for torch 2+ (#4010)
|
1 year ago |
moe
|
6b2365e4fa
Re-enable elastic training for torch 2+ (#4010)
|
1 year ago |
monitor
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
ops
|
f876d81d34
DeepSpeed4Science (#4357)
|
1 year ago |
pipe
|
7ddc3b01dd
Fix pipeline module evaluation when contiguous activation checkpointing is enabled (#3005)
|
1 year ago |
profiling
|
6b2365e4fa
Re-enable elastic training for torch 2+ (#4010)
|
1 year ago |
runtime
|
9adc73ff65
Handle empty parameter groups (#4277)
|
1 year ago |
utils
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
__init__.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
alexnet_model.py
|
aef6c65ce3
Reduce Unit Test Times (Part 3) (#3850)
|
1 year ago |
common.py
|
d9a889d559
Fix nv-nightly workflow (#4163)
|
1 year ago |
ds_batch_config.json
|
ff42743865
Refactor remaining distributed tests (#2216)
|
2 years ago |
gpt2-merges.txt
|
ff42743865
Refactor remaining distributed tests (#2216)
|
2 years ago |
gpt2-vocab.json
|
ff42743865
Refactor remaining distributed tests (#2216)
|
2 years ago |
megatron_model.py
|
4b35833379
Revert "Update megatron GPT2Model"
|
1 year ago |
modeling.py
|
180dd39714
Clean up modeling code (#4320)
|
1 year ago |
modelingpreln.py
|
180dd39714
Clean up modeling code (#4320)
|
1 year ago |
multi_output_model.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
simple_model.py
|
2ded2ff0be
checking process_group before merging bucket ranges (#3521) (#3577)
|
1 year ago |
util.py
|
7b850d3d04
Re-enable skipped unit tests (#3939)
|
1 year ago |