Ziyang 60bf78454c Fix incorrect assignment of self.quantized_nontrainable_weights (#4399) 1 年之前
..
activation_checkpointing 42c1e916f6 feat(activation_checkpointing): add `non_reentrant_checkpoint` to support inputs require no grad (#4118) 1 年之前
checkpoint_engine c5edc91ecb change partititon_name to partition_name (#3700) 1 年之前
comm f0463b4d1f Pass correct node size for ZeRO++ (#4085) 1 年之前
compression b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
data_pipeline 736bf1853b bug fix (#3609) 1 年之前
fp16 b354c28b76 polishing timers and log_dist (#3996) 1 年之前
pipe e20e4a9d02 clear redundant timers (#4308) 1 年之前
swap_tensor b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
zero 60bf78454c Fix incorrect assignment of self.quantized_nontrainable_weights (#4399) 1 年之前
__init__.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
bf16_optimizer.py 5a5340d03b remove UtilsBuilder load, use torch (un)flatten ops (#3728) 1 年之前
config.py aa4a7401f8 ZeRO-Inference refresh (#4197) 1 年之前
config_utils.py 4d27225f3e zero.Init() should pin params in GPU memory as requested (#2953) 1 年之前
constants.py 0411a9f871 Expose Consecutive Hysteresis to Users (#3553) 1 年之前
dataloader.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
eigenvalue.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
engine.py 9adc73ff65 Handle empty parameter groups (#4277) 1 年之前
hybrid_engine.py 43188ff077 Pass missing positional arguments in `DeepSpeedHybridEngine.generate()` (#4026) 1 年之前
lr_schedules.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
progressive_layer_drop.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
quantize.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
sparse_tensor.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
state_dict_factory.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
utils.py 7f3e82fe09 do allgather only in shared optimizer states groups (#4167) 1 年之前
weight_quantizer.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前