.. |
activation_checkpointing
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 年之前 |
checkpoint_engine
|
e355863b83
Update torch_checkpoint_engine.py (#3019)
|
1 年之前 |
comm
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 年之前 |
compression
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 年之前 |
data_pipeline
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 年之前 |
fp16
|
ac2c9ffae4
Improve loss overflow logs (#3008)
|
1 年之前 |
pipe
|
b528f50e3d
Fix buffer size for pipeline parallel and communication schedule (#2862)
|
1 年之前 |
swap_tensor
|
98cc35b6a8
Abstract accelerator (step 3) (#2677)
|
1 年之前 |
zero
|
ac2c9ffae4
Improve loss overflow logs (#3008)
|
1 年之前 |
__init__.py
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 年之前 |
bf16_optimizer.py
|
541e423ae6
Enable tensor fragments for zero 2 & 3 (#2727)
|
1 年之前 |
config.py
|
457850dc5a
[zero] prevent poor configs from running w. zero-offload (#2971)
|
1 年之前 |
config_utils.py
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 年之前 |
constants.py
|
457850dc5a
[zero] prevent poor configs from running w. zero-offload (#2971)
|
1 年之前 |
dataloader.py
|
7c99def0f0
Data efficiency library update (#2866)
|
1 年之前 |
eigenvalue.py
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 年之前 |
engine.py
|
94f7da26b6
Convert model parameters from generator to list. (#3017)
|
1 年之前 |
lr_schedules.py
|
316c4a43e0
Add flake8 to pre-commit checks (#2051)
|
2 年之前 |
progressive_layer_drop.py
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 年之前 |
quantize.py
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 年之前 |
sparse_tensor.py
|
fcb3ca5e66
Proposal of how we might use sparse tensors for gradients (#1484)
|
3 年之前 |
state_dict_factory.py
|
ccb8eb81fb
Add checkpoint sharding unit tests (#2561)
|
1 年之前 |
utils.py
|
d3de737550
Remove deprecated `torch._six` imports (#2863)
|
1 年之前 |
weight_quantizer.py
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 年之前 |