.. |
activation_checkpointing
|
6e2899fbc6
WA for Torch-compile-Z3-act-apt accuracy issue from the Pytorch repo (#5590)
|
4 months ago |
checkpoint_engine
|
c5edc91ecb
change partititon_name to partition_name (#3700)
|
1 year ago |
comm
|
11a62a0635
Add Compressedbackend for Onebit optimizers (#5473)
|
4 months ago |
compression
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
data_pipeline
|
64defe65b7
Parallel map step for `DistributedDataAnalyzer` map-reduce (#5291)
|
6 months ago |
fp16
|
11a62a0635
Add Compressedbackend for Onebit optimizers (#5473)
|
4 months ago |
pipe
|
05cb79db8e
_exec_forward_pass: place zeros(1) on the same device as the param (#5576)
|
4 months ago |
swap_tensor
|
d058d4b39b
Nvme offload checkpoint (#4707)
|
9 months ago |
zero
|
d2b1d7fc08
Universal checkpoint for zero stage 3 (#5475)
|
3 months ago |
__init__.py
|
c56a4b9e0d
Improve universal checkpoint (#5289)
|
6 months ago |
base_optimizer.py
|
c56a4b9e0d
Improve universal checkpoint (#5289)
|
6 months ago |
bf16_optimizer.py
|
d2b1d7fc08
Universal checkpoint for zero stage 3 (#5475)
|
3 months ago |
compiler.py
|
2a0c0e3c27
Remove compile wrapper to simplify access to model attributes (#5581)
|
4 months ago |
config.py
|
2a0c0e3c27
Remove compile wrapper to simplify access to model attributes (#5581)
|
4 months ago |
config_utils.py
|
604d701e35
Introduce pydantic_v1 compatibility module for pydantic>=2.0.0 support (#4407)
|
1 year ago |
constants.py
|
3c0bd31288
BF16 optimizer: Improve device utilization by immediate grad update (#4975)
|
8 months ago |
dataloader.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
eigenvalue.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
engine.py
|
b421e8c8f3
Disable nvtx decorator to avoid graph break (#5697)
|
3 months ago |
hybrid_engine.py
|
5f41bd06dd
Fix Hybrid Engine metrics printing (#4789)
|
10 months ago |
lr_schedules.py
|
ce0ebdade2
[Bug fix] WarmupCosineLR issues (#4688)
|
11 months ago |
progressive_layer_drop.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
quantize.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
sparse_tensor.py
|
c84c28d23b
Support cpu tensors without direct device invocation (#3842)
|
9 months ago |
state_dict_factory.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
utils.py
|
0fc19b6a32
Fix crash when creating Torch tensor on NPU with device=get_accelerator().current_device() (#5464)
|
5 months ago |
weight_quantizer.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |