Masahiro Tanaka b421e8c8f3 Disable nvtx decorator to avoid graph break (#5697) 3 月之前
..
activation_checkpointing 6e2899fbc6 WA for Torch-compile-Z3-act-apt accuracy issue from the Pytorch repo (#5590) 4 月之前
checkpoint_engine c5edc91ecb change partititon_name to partition_name (#3700) 1 年之前
comm 11a62a0635 Add Compressedbackend for Onebit optimizers (#5473) 4 月之前
compression b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
data_pipeline 64defe65b7 Parallel map step for `DistributedDataAnalyzer` map-reduce (#5291) 6 月之前
fp16 11a62a0635 Add Compressedbackend for Onebit optimizers (#5473) 4 月之前
pipe 05cb79db8e _exec_forward_pass: place zeros(1) on the same device as the param (#5576) 4 月之前
swap_tensor d058d4b39b Nvme offload checkpoint (#4707) 9 月之前
zero d2b1d7fc08 Universal checkpoint for zero stage 3 (#5475) 3 月之前
__init__.py c56a4b9e0d Improve universal checkpoint (#5289) 6 月之前
base_optimizer.py c56a4b9e0d Improve universal checkpoint (#5289) 6 月之前
bf16_optimizer.py d2b1d7fc08 Universal checkpoint for zero stage 3 (#5475) 3 月之前
compiler.py 2a0c0e3c27 Remove compile wrapper to simplify access to model attributes (#5581) 4 月之前
config.py 2a0c0e3c27 Remove compile wrapper to simplify access to model attributes (#5581) 4 月之前
config_utils.py 604d701e35 Introduce pydantic_v1 compatibility module for pydantic>=2.0.0 support (#4407) 1 年之前
constants.py 3c0bd31288 BF16 optimizer: Improve device utilization by immediate grad update (#4975) 8 月之前
dataloader.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
eigenvalue.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
engine.py b421e8c8f3 Disable nvtx decorator to avoid graph break (#5697) 3 月之前
hybrid_engine.py 5f41bd06dd Fix Hybrid Engine metrics printing (#4789) 10 月之前
lr_schedules.py ce0ebdade2 [Bug fix] WarmupCosineLR issues (#4688) 11 月之前
progressive_layer_drop.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
quantize.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
sparse_tensor.py c84c28d23b Support cpu tensors without direct device invocation (#3842) 9 月之前
state_dict_factory.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
utils.py 0fc19b6a32 Fix crash when creating Torch tensor on NPU with device=get_accelerator().current_device() (#5464) 5 月之前
weight_quantizer.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前