Zhen Zhang d7b764e3d8 Unit tests for MiCS (#4792) 9 months ago
..
__init__.py 2e99f6edf6 [DRAFT] Tentative implementation of MiCS (#2964) 1 year ago
config.py 2afa1c7f2f Communication Optimization for Large-Scale Training (#4695) 11 months ago
contiguous_memory_allocator.py 9ec55bd99b Fix f-string messages (#4865) 9 months ago
linear.py 28b9d5c231 Add condition when dimension is greater than 2 (#4390) 1 year ago
mics.py d7b764e3d8 Unit tests for MiCS (#4792) 9 months ago
mics_utils.py 2e99f6edf6 [DRAFT] Tentative implementation of MiCS (#2964) 1 year ago
offload_config.py b1cb0dfc46 Guanhua/partial offload rebase v2 (#590) (#4636) 11 months ago
parameter_offload.py 18a04d04a5 Use clearer naming (#2548) 1 year ago
partition_parameters.py 81cc32075c Partition parameters: Minor refactoring of use_secondary_tensor condition (#4868) 9 months ago
partitioned_param_coordinator.py 7711bdbbd2 MP ZeRO++ (#3954) 1 year ago
partitioned_param_profiler.py d18aa2c79c ZeRO++ (#3784) 1 year ago
stage3.py d058d4b39b Nvme offload checkpoint (#4707) 9 months ago
stage_1_and_2.py 9ec55bd99b Fix f-string messages (#4865) 9 months ago
test.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 year ago
tiling.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 year ago
utils.py 8e64c3b550 feat: add Lion optimizer (#4331) 1 year ago