.. |
accelerator
|
b20c46745b
add missing methods to MPS_Accelerator (#5134)
|
8 月之前 |
autotuning
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
checkpoint
|
774b897736
fix the missing argument in test and typo (#5730)
|
3 月之前 |
comm
|
19da95f783
[CPU] add fp16 support to shm inference_all_reduce (#5669)
|
3 月之前 |
compression
|
6dcced1d5c
Cleanup required_torch_version code and references. (#5370)
|
6 月之前 |
elasticity
|
8e4f6e48db
Skip the UT cases that use unimplemented op builders. (#5372)
|
5 月之前 |
hybrid_engine
|
f69f8840fc
Removal of cuda hardcoded string with get_device function (#5351)
|
6 月之前 |
inference
|
1a45bd8e8c
Lock cache file of HF model list (#6628)
|
6 天之前 |
launcher
|
13c16c9562
Accept btl_tcp_if_include option through launcher_args (#6613)
|
1 周之前 |
linear
|
6e5d58d248
OptimizedLinear updates (#5791)
|
2 月之前 |
model_parallelism
|
6dcced1d5c
Cleanup required_torch_version code and references. (#5370)
|
6 月之前 |
moe
|
9a3ede7079
add moe topk(k>2) gate support (#5881)
|
2 月之前 |
monitor
|
488a823f64
New integration - CometMonitor (#5466)
|
5 月之前 |
ops
|
a1f98bdc70
AIO CPU Locked Tensor (#6592)
|
1 周之前 |
pipe
|
7ddc3b01dd
Fix pipeline module evaluation when contiguous activation checkpointing is enabled (#3005)
|
1 年之前 |
profiling
|
6dcced1d5c
Cleanup required_torch_version code and references. (#5370)
|
6 月之前 |
runtime
|
85b7469ea0
Add first Step in LR Schedulers (#6597)
|
1 周之前 |
sequence_parallelism
|
8b191d7ccf
Long sequence parallelism (Ulysses) integration with HuggingFace (#5774)
|
2 月之前 |
utils
|
08e0733e4a
Support MoE for pipeline models (#5338)
|
6 月之前 |
__init__.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
alexnet_model.py
|
9bc4cd01b7
Store/Load CIFAR from local/offline (#6390)
|
1 月之前 |
common.py
|
c9fc34a4be
Use file store for tests (#6632)
|
4 天之前 |
ds_batch_config.json
|
ff42743865
Refactor remaining distributed tests (#2216)
|
2 年之前 |
gpt2-merges.txt
|
ff42743865
Refactor remaining distributed tests (#2216)
|
2 年之前 |
gpt2-vocab.json
|
ff42743865
Refactor remaining distributed tests (#2216)
|
2 年之前 |
megatron_model.py
|
4b35833379
Revert "Update megatron GPT2Model"
|
1 年之前 |
modeling.py
|
180dd39714
Clean up modeling code (#4320)
|
1 年之前 |
modelingpreln.py
|
180dd39714
Clean up modeling code (#4320)
|
1 年之前 |
multi_output_model.py
|
c08e69f212
Make op builder detection adapt to accelerator change (#5206)
|
7 月之前 |
simple_model.py
|
c08e69f212
Make op builder detection adapt to accelerator change (#5206)
|
7 月之前 |
util.py
|
1ab1928d79
Enable dynamic shapes for pipeline parallel engine inputs (#5481)
|
2 月之前 |