.. |
activation_checkpointing
|
42c1e916f6
feat(activation_checkpointing): add `non_reentrant_checkpoint` to support inputs require no grad (#4118)
|
1 年之前 |
checkpoint_engine
|
c5edc91ecb
change partititon_name to partition_name (#3700)
|
1 年之前 |
comm
|
f0463b4d1f
Pass correct node size for ZeRO++ (#4085)
|
1 年之前 |
compression
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
data_pipeline
|
736bf1853b
bug fix (#3609)
|
1 年之前 |
fp16
|
b354c28b76
polishing timers and log_dist (#3996)
|
1 年之前 |
pipe
|
c69bd1f7b7
Fix pipline dataloader when batch elements contain tuple (#565)
|
1 年之前 |
swap_tensor
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
zero
|
a23cda6c3b
Allow modification of zero partitioned parameters (#4192)
|
1 年之前 |
__init__.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
bf16_optimizer.py
|
5a5340d03b
remove UtilsBuilder load, use torch (un)flatten ops (#3728)
|
1 年之前 |
config.py
|
9647ea791d
Add MuP optimizers (#2043)
|
1 年之前 |
config_utils.py
|
4d27225f3e
zero.Init() should pin params in GPU memory as requested (#2953)
|
1 年之前 |
constants.py
|
0411a9f871
Expose Consecutive Hysteresis to Users (#3553)
|
1 年之前 |
dataloader.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
eigenvalue.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
engine.py
|
9647ea791d
Add MuP optimizers (#2043)
|
1 年之前 |
hybrid_engine.py
|
43188ff077
Pass missing positional arguments in `DeepSpeedHybridEngine.generate()` (#4026)
|
1 年之前 |
lr_schedules.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
progressive_layer_drop.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
quantize.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
sparse_tensor.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
state_dict_factory.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
utils.py
|
7f3e82fe09
do allgather only in shared optimizer states groups (#4167)
|
1 年之前 |
weight_quantizer.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |