.. |
activation_checkpointing
|
6e2899fbc6
WA for Torch-compile-Z3-act-apt accuracy issue from the Pytorch repo (#5590)
|
4 月之前 |
checkpoint_engine
|
c5edc91ecb
change partititon_name to partition_name (#3700)
|
1 年之前 |
comm
|
11a62a0635
Add Compressedbackend for Onebit optimizers (#5473)
|
4 月之前 |
compression
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
data_pipeline
|
64defe65b7
Parallel map step for `DistributedDataAnalyzer` map-reduce (#5291)
|
6 月之前 |
fp16
|
11a62a0635
Add Compressedbackend for Onebit optimizers (#5473)
|
4 月之前 |
pipe
|
05cb79db8e
_exec_forward_pass: place zeros(1) on the same device as the param (#5576)
|
4 月之前 |
swap_tensor
|
d058d4b39b
Nvme offload checkpoint (#4707)
|
9 月之前 |
zero
|
d2b1d7fc08
Universal checkpoint for zero stage 3 (#5475)
|
3 月之前 |
__init__.py
|
c56a4b9e0d
Improve universal checkpoint (#5289)
|
6 月之前 |
base_optimizer.py
|
c56a4b9e0d
Improve universal checkpoint (#5289)
|
6 月之前 |
bf16_optimizer.py
|
d2b1d7fc08
Universal checkpoint for zero stage 3 (#5475)
|
3 月之前 |
compiler.py
|
2a0c0e3c27
Remove compile wrapper to simplify access to model attributes (#5581)
|
4 月之前 |
config.py
|
2a0c0e3c27
Remove compile wrapper to simplify access to model attributes (#5581)
|
4 月之前 |
config_utils.py
|
604d701e35
Introduce pydantic_v1 compatibility module for pydantic>=2.0.0 support (#4407)
|
1 年之前 |
constants.py
|
3c0bd31288
BF16 optimizer: Improve device utilization by immediate grad update (#4975)
|
8 月之前 |
dataloader.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
eigenvalue.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
engine.py
|
b421e8c8f3
Disable nvtx decorator to avoid graph break (#5697)
|
3 月之前 |
hybrid_engine.py
|
5f41bd06dd
Fix Hybrid Engine metrics printing (#4789)
|
10 月之前 |
lr_schedules.py
|
ce0ebdade2
[Bug fix] WarmupCosineLR issues (#4688)
|
11 月之前 |
progressive_layer_drop.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
quantize.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
sparse_tensor.py
|
c84c28d23b
Support cpu tensors without direct device invocation (#3842)
|
9 月之前 |
state_dict_factory.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
utils.py
|
0fc19b6a32
Fix crash when creating Torch tensor on NPU with device=get_accelerator().current_device() (#5464)
|
5 月之前 |
weight_quantizer.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |