Zhewei Yao
|
0f4f2f982c
Adding DeepSpeed Compression Composer (#2105)
|
2 years ago |
Karim Foda
|
735406e536
fix import errors (#2026)
|
2 years ago |
Quentin Anthony
|
c87f6ee209
DeepSpeed Monitor Module (Master) (#2013)
|
2 years ago |
Ammar Ahmad Awan
|
36ad3119d5
DeepSpeed comm backend v1 (#1985)
|
2 years ago |
Olatunji Ruwase
|
56c5223868
bf16+pipeline parallelism (#1801)
|
2 years ago |
Yucheng Lu
|
b80e5624e2
01 adam optimizer (#1790)
|
2 years ago |
Jeff Rasley
|
8eef742f0c
bf16 is supported w. zero 1, fix assert (#1779)
|
2 years ago |
Stas Bekman
|
ed4bbe08d6
[config] fix assert message (#1734)
|
2 years ago |
Justin Chiu
|
4912e0ad7e
Various ZeRO Stage3 Optimizations + Improvements (including bfloat16 support) (#1453)
|
2 years ago |
Alex Hedges
|
fc2f378ece
Improve pre-commit hooks (#1602)
|
2 years ago |
Mikhail Druzhinin
|
d14baad940
allreduce_always_fp16 (#1487)
|
2 years ago |
Stas Bekman
|
bcf2bdde89
remove debug prints (#1585)
|
2 years ago |
Cheng Li
|
9caa74e577
Autotuning (#1554)
|
2 years ago |
Rana Ali Amjad
|
648f7bfa50
Bfloat16 zero2 (#1398)
|
3 years ago |
Wenhao Hu
|
8abdaee243
Add cpu adagrad (#1358)
|
3 years ago |
Alex Hedges
|
be789b1665
Fix many typos (#1423)
|
3 years ago |
Hari Prasad
|
c0b27fb019
Added drop_last to DeepSpeedDataLoader (#1321)
|
3 years ago |
Ammar Ahmad Awan
|
f28432441b
DeepSpeed MoE (#1310)
|
3 years ago |
Conglong Li
|
b2b34ae342
Curriculum learning (#1307)
|
3 years ago |
Stas Bekman
|
c697d7ae1c
fix config name (#1103)
|
3 years ago |
Reza Yazdani
|
ed3de0c21b
Quantization + inference release (#1091)
|
3 years ago |
Jeff Rasley
|
cfa63f5dad
ZeRO stage 1 refresh (#1042)
|
3 years ago |
Cheng Li
|
4544b7d2f1
Improve flops profiler functionality (#1065)
|
3 years ago |
Samyam Rajbhandari
|
dad26428e3
Samyamr/full precision for ZeRO Stage2 and Stage3 (#1004)
|
3 years ago |
Sean Naren
|
41ab660b5d
Refactor param_dict to config (#1008)
|
3 years ago |
Conglong Li
|
67a48aaa89
1-bit LAMB optimizer (#970)
|
3 years ago |
Jeff Rasley
|
0d4a54a04d
ZeRO-Infinity (#976)
|
3 years ago |
Stas Bekman
|
c87118b0c5
[config] turn exponential notation back on for config dump (#955)
|
3 years ago |
Jeff Rasley
|
dd03cff29f
set adamw_mode default true (follows FusedAdam and < 0.3.11 logic) (#844)
|
3 years ago |
Samyam Rajbhandari
|
599258f979
ZeRO 3 Offload (#834)
|
3 years ago |