.. |
activation_checkpointing
|
9e455d7651
Checkpointing: Avoid assigning tensor storage with different device (#4836)
|
10 月之前 |
checkpoint_engine
|
c5edc91ecb
change partititon_name to partition_name (#3700)
|
1 年之前 |
comm
|
2ce6bf8ce0
[NPU] Add HcclBackend for 1-bit adam, 1-bit lamb, 0/1 adam (#4733)
|
10 月之前 |
compression
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
data_pipeline
|
736bf1853b
bug fix (#3609)
|
1 年之前 |
fp16
|
2ce6bf8ce0
[NPU] Add HcclBackend for 1-bit adam, 1-bit lamb, 0/1 adam (#4733)
|
10 月之前 |
pipe
|
ac84cf3ff1
Pipeline: Add support to eval micro bs configuration (#4859)
|
9 月之前 |
swap_tensor
|
d058d4b39b
Nvme offload checkpoint (#4707)
|
9 月之前 |
zero
|
d7b764e3d8
Unit tests for MiCS (#4792)
|
9 月之前 |
__init__.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
bf16_optimizer.py
|
d5a7c1e0b4
Capture short kernel sequences to graph (#4318)
|
10 月之前 |
config.py
|
d5a7c1e0b4
Capture short kernel sequences to graph (#4318)
|
10 月之前 |
config_utils.py
|
604d701e35
Introduce pydantic_v1 compatibility module for pydantic>=2.0.0 support (#4407)
|
1 年之前 |
constants.py
|
d5a7c1e0b4
Capture short kernel sequences to graph (#4318)
|
10 月之前 |
dataloader.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
eigenvalue.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
engine.py
|
d058d4b39b
Nvme offload checkpoint (#4707)
|
9 月之前 |
hybrid_engine.py
|
5f41bd06dd
Fix Hybrid Engine metrics printing (#4789)
|
10 月之前 |
lr_schedules.py
|
ce0ebdade2
[Bug fix] WarmupCosineLR issues (#4688)
|
11 月之前 |
progressive_layer_drop.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
quantize.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
sparse_tensor.py
|
c84c28d23b
Support cpu tensors without direct device invocation (#3842)
|
9 月之前 |
state_dict_factory.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
utils.py
|
d5a7c1e0b4
Capture short kernel sequences to graph (#4318)
|
10 月之前 |
weight_quantizer.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |