Olatunji Ruwase a23cda6c3b Allow modification of zero partitioned parameters (#4192) 1 年之前
..
__init__.py 2e99f6edf6 [DRAFT] Tentative implementation of MiCS (#2964) 1 年之前
config.py 7711bdbbd2 MP ZeRO++ (#3954) 1 年之前
contiguous_memory_allocator.py 389bf69319 fix: Remove duplicate word the (#4051) 1 年之前
linear.py 42858a9891 save tensors in context of memory_efficient_linear (#3413) 1 年之前
mics.py 7711bdbbd2 MP ZeRO++ (#3954) 1 年之前
mics_utils.py 2e99f6edf6 [DRAFT] Tentative implementation of MiCS (#2964) 1 年之前
offload_config.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
parameter_offload.py 462def451e Enable hpz when running with torch.no_grad (#4232) 1 年之前
partition_parameters.py 57d629a17e Empty tensor size check (#4186) 1 年之前
partitioned_param_coordinator.py 7711bdbbd2 MP ZeRO++ (#3954) 1 年之前
partitioned_param_profiler.py d18aa2c79c ZeRO++ (#3784) 1 年之前
stage3.py a23cda6c3b Allow modification of zero partitioned parameters (#4192) 1 年之前
stage_1_and_2.py f69031909f Simplify Gradient Attribute Names (#4214) 1 年之前
test.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
tiling.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
utils.py d3550dc88a Adagrad support in ZeRO (#3401) 1 年之前