Zhen Zhang d7b764e3d8 Unit tests for MiCS (#4792) 9 月之前
..
activation_checkpointing 9e455d7651 Checkpointing: Avoid assigning tensor storage with different device (#4836) 10 月之前
checkpoint_engine c5edc91ecb change partititon_name to partition_name (#3700) 1 年之前
comm 2ce6bf8ce0 [NPU] Add HcclBackend for 1-bit adam, 1-bit lamb, 0/1 adam (#4733) 10 月之前
compression b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
data_pipeline 736bf1853b bug fix (#3609) 1 年之前
fp16 2ce6bf8ce0 [NPU] Add HcclBackend for 1-bit adam, 1-bit lamb, 0/1 adam (#4733) 10 月之前
pipe ac84cf3ff1 Pipeline: Add support to eval micro bs configuration (#4859) 9 月之前
swap_tensor d058d4b39b Nvme offload checkpoint (#4707) 9 月之前
zero d7b764e3d8 Unit tests for MiCS (#4792) 9 月之前
__init__.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
bf16_optimizer.py d5a7c1e0b4 Capture short kernel sequences to graph (#4318) 10 月之前
config.py d5a7c1e0b4 Capture short kernel sequences to graph (#4318) 10 月之前
config_utils.py 604d701e35 Introduce pydantic_v1 compatibility module for pydantic>=2.0.0 support (#4407) 1 年之前
constants.py d5a7c1e0b4 Capture short kernel sequences to graph (#4318) 10 月之前
dataloader.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
eigenvalue.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
engine.py d058d4b39b Nvme offload checkpoint (#4707) 9 月之前
hybrid_engine.py 5f41bd06dd Fix Hybrid Engine metrics printing (#4789) 10 月之前
lr_schedules.py ce0ebdade2 [Bug fix] WarmupCosineLR issues (#4688) 11 月之前
progressive_layer_drop.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
quantize.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
sparse_tensor.py c84c28d23b Support cpu tensors without direct device invocation (#3842) 9 月之前
state_dict_factory.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
utils.py d5a7c1e0b4 Capture short kernel sequences to graph (#4318) 10 月之前
weight_quantizer.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前