Carlos Mocholí
|
02e95e6ab4
Pin minimum `packaging` requirement (#2771)
|
1 year ago |
Quentin Anthony
|
c87f6ee209
DeepSpeed Monitor Module (Master) (#2013)
|
2 years ago |
Victor
|
64c2946a23
use py-cpuinfo to detect SIMD_WIDTH in platform-independent way (#1616)
|
2 years ago |
Alex Hedges
|
fc2f378ece
Improve pre-commit hooks (#1602)
|
2 years ago |
Jeff Rasley
|
a90497ecff
Remove hard tensorboardX requirement (#1571)
|
2 years ago |
Cheng Li
|
9caa74e577
Autotuning (#1554)
|
2 years ago |
Jeff Rasley
|
6996bb0159
Sparse attn triton v1.0 support + torch1.8 test runner (#1374)
|
3 years ago |
Jeff Rasley
|
3b68984498
remove torchvision dependency (#1178)
|
3 years ago |
Jeff Rasley
|
cfa63f5dad
ZeRO stage 1 refresh (#1042)
|
3 years ago |
Samyam Rajbhandari
|
599258f979
ZeRO 3 Offload (#834)
|
3 years ago |
Jeff Rasley
|
81aeea361d
Elastic training support (#602)
|
3 years ago |
Jeff Rasley
|
0dc8420042
Dependency pruning (#528)
|
3 years ago |
Jeff Rasley
|
31f46feee2
DeepSpeed JIT op + PyPI support (#496)
|
4 years ago |
Jeff Rasley
|
41db1c2f03
ZeRO-Offload release (#391)
|
4 years ago |
Jeff Rasley
|
e5bbc2e559
Sparse attn + ops/runtime refactor + v0.3.0 (#343)
|
4 years ago |