Masahiro Tanaka 19e0dc39ba Delay reduce-scatter for ZeRO3 leaf modules (#5008) 8 月之前
..
__init__.py 19e0dc39ba Delay reduce-scatter for ZeRO3 leaf modules (#5008) 8 月之前
comms_logging.py 0b507253e5 fix comm logging for inference (#4043) 1 年之前
debug.py 40342055ce Remove hooks on gradient accumulation on engine/optimizer destroy (#4858) 9 月之前
exceptions.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
groups.py 9500ab7d47 [minor] Improve logging for multiprocesses (#5004) 9 月之前
init_on_device.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
logging.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
mixed_precision_linkage.py a23cda6c3b Allow modification of zero partitioned parameters (#4192) 1 年之前
numa.py 5dadf68771 support HBM in utils/numa.py (#3918) 1 年之前
nvtx.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
tensor_fragment.py 0ec2d3e4bf Add get and set APIs for the ZeRO-3 partitioned parameters (#4681) 11 月之前
timer.py b354c28b76 polishing timers and log_dist (#3996) 1 年之前
types.py 0a61d5d664 Hybrid Engine Refactor and Llama Inference Support (#3425) 1 年之前
z3_leaf_module.py 19e0dc39ba Delay reduce-scatter for ZeRO3 leaf modules (#5008) 8 月之前
zero_to_fp32.py 691458f8b6 zero_to_fp32.py: Handle a case where shape doesn't have numel attr (#4842) 9 月之前