Logan Adams 5ce448d326 Switch hasattr to check for compiler and not compile since compile was introduced in torch 2.0 but compiler was introduced in torch 2.1, this fixes issues for those building with torch 2.0 8 月之前
..
activation_checkpointing 9e455d7651 Checkpointing: Avoid assigning tensor storage with different device (#4836) 10 月之前
checkpoint_engine c5edc91ecb change partititon_name to partition_name (#3700) 1 年之前
comm 592325abde [Zero++ qgZ] Fall back to reduce_scatter if `tensor.numel() % (2 * global_world_size) != 0` (#5056) 8 月之前
compression b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
data_pipeline 736bf1853b bug fix (#3609) 1 年之前
fp16 2ce6bf8ce0 [NPU] Add HcclBackend for 1-bit adam, 1-bit lamb, 0/1 adam (#4733) 10 月之前
pipe ac84cf3ff1 Pipeline: Add support to eval micro bs configuration (#4859) 9 月之前
swap_tensor d058d4b39b Nvme offload checkpoint (#4707) 9 月之前
zero 4f477328c4 [NPU] replace 'cuda' with get_accelerator().device_name() (#5095) 8 月之前
__init__.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
bf16_optimizer.py d5a7c1e0b4 Capture short kernel sequences to graph (#4318) 10 月之前
compiler.py 5ce448d326 Switch hasattr to check for compiler and not compile since compile was introduced in torch 2.0 but compiler was introduced in torch 2.1, this fixes issues for those building with torch 2.0 8 月之前
config.py c3cfe96bb3 Enable torch.compile with ZeRO (Experimental) (#4878) 8 月之前
config_utils.py 604d701e35 Introduce pydantic_v1 compatibility module for pydantic>=2.0.0 support (#4407) 1 年之前
constants.py d5a7c1e0b4 Capture short kernel sequences to graph (#4318) 10 月之前
dataloader.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
eigenvalue.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
engine.py c3cfe96bb3 Enable torch.compile with ZeRO (Experimental) (#4878) 8 月之前
hybrid_engine.py 5f41bd06dd Fix Hybrid Engine metrics printing (#4789) 10 月之前
lr_schedules.py ce0ebdade2 [Bug fix] WarmupCosineLR issues (#4688) 11 月之前
progressive_layer_drop.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
quantize.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
sparse_tensor.py c84c28d23b Support cpu tensors without direct device invocation (#3842) 9 月之前
state_dict_factory.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
utils.py 961bc85624 optimize clip_grad_norm_ function (#4915) 8 月之前
weight_quantizer.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前