.. |
activation-checkpointing.rst
|
f2ac7eafd5
ZeRO-2 (#217)
|
4 年之前 |
autotuning.rst
|
389bf69319
fix: Remove duplicate word the (#4051)
|
1 年之前 |
conf.py
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
flops-profiler.rst
|
e0f36ed5a1
Add optimizers and schedules to RTD and updated the corresponding part in the website (#799)
|
3 年之前 |
index.rst
|
d923f7c895
Refactor/Pydantify monitoring config (#2640)
|
1 年之前 |
inference-engine.rst
|
0449cbd36d
formatting fix
|
3 年之前 |
inference-init.rst
|
e4b3b610ba
Refactor DS inference API. No longer need replace_method. (#2831)
|
1 年之前 |
initialize.rst
|
be789b1665
Fix many typos (#1423)
|
3 年之前 |
kernel.rst
|
734d8991c8
Transformer kernel release (#242)
|
4 年之前 |
memory.rst
|
99fde3b7a5
[memory estimators] new config args sync (#2431)
|
2 年之前 |
model-checkpointing.rst
|
5c3ebd7ede
Clone tensors to avoid torch.save bloat (#3348)
|
1 年之前 |
moe.rst
|
c0af6d90f7
Refactor MoE and Groups API to simplify model creation and mangement (#1798)
|
2 年之前 |
monitor.rst
|
389bf69319
fix: Remove duplicate word the (#4051)
|
1 年之前 |
optimizers.rst
|
b80e5624e2
01 adam optimizer (#1790)
|
2 年之前 |
pipeline.rst
|
fafc827d64
Render docs for pipe.ProcessTopology (#1505)
|
2 年之前 |
schedulers.rst
|
a10e4811fe
force set lf instead of crlf (https://github.com/pre-commit/pre-commit-hooks#mixed-line-ending) (#1598)
|
2 年之前 |
training.rst
|
4912e0ad7e
Various ZeRO Stage3 Optimizations + Improvements (including bfloat16 support) (#1453)
|
2 年之前 |
zero3.rst
|
a23cda6c3b
Allow modification of zero partitioned parameters (#4192)
|
1 年之前 |