Quentin Anthony ac2c9ffae4 Improve loss overflow logs (#3008) 1 年之前
..
activation_checkpointing da84e60d98 add missing license info to top of all source code (#2889) 1 年之前
checkpoint_engine e355863b83 Update torch_checkpoint_engine.py (#3019) 1 年之前
comm da84e60d98 add missing license info to top of all source code (#2889) 1 年之前
compression da84e60d98 add missing license info to top of all source code (#2889) 1 年之前
data_pipeline da84e60d98 add missing license info to top of all source code (#2889) 1 年之前
fp16 ac2c9ffae4 Improve loss overflow logs (#3008) 1 年之前
pipe b528f50e3d Fix buffer size for pipeline parallel and communication schedule (#2862) 1 年之前
swap_tensor 98cc35b6a8 Abstract accelerator (step 3) (#2677) 1 年之前
zero ac2c9ffae4 Improve loss overflow logs (#3008) 1 年之前
__init__.py da84e60d98 add missing license info to top of all source code (#2889) 1 年之前
bf16_optimizer.py 541e423ae6 Enable tensor fragments for zero 2 & 3 (#2727) 1 年之前
config.py 457850dc5a [zero] prevent poor configs from running w. zero-offload (#2971) 1 年之前
config_utils.py da84e60d98 add missing license info to top of all source code (#2889) 1 年之前
constants.py 457850dc5a [zero] prevent poor configs from running w. zero-offload (#2971) 1 年之前
dataloader.py 7c99def0f0 Data efficiency library update (#2866) 1 年之前
eigenvalue.py da84e60d98 add missing license info to top of all source code (#2889) 1 年之前
engine.py 94f7da26b6 Convert model parameters from generator to list. (#3017) 1 年之前
lr_schedules.py 316c4a43e0 Add flake8 to pre-commit checks (#2051) 2 年之前
progressive_layer_drop.py da84e60d98 add missing license info to top of all source code (#2889) 1 年之前
quantize.py da84e60d98 add missing license info to top of all source code (#2889) 1 年之前
sparse_tensor.py fcb3ca5e66 Proposal of how we might use sparse tensors for gradients (#1484) 3 年之前
state_dict_factory.py ccb8eb81fb Add checkpoint sharding unit tests (#2561) 1 年之前
utils.py d3de737550 Remove deprecated `torch._six` imports (#2863) 1 年之前
weight_quantizer.py da84e60d98 add missing license info to top of all source code (#2889) 1 年之前