Olatunji Ruwase
|
977254c148
Disable z3 tracing profiler (#4106)
|
1 年之前 |
Ma, Guokai
|
0f5406323c
[CPU] FusedAdam and CPU training support (#3991)
|
1 年之前 |
Heyang Qin
|
f8551b439e
Fix racing condition in GatheredParameters (#3819)
|
1 年之前 |
Heyang Qin
|
d18aa2c79c
ZeRO++ (#3784)
|
1 年之前 |
hablb
|
0977106ac9
zero3 performance optimizations (#3622)
|
1 年之前 |
Heyang Qin
|
4716b0f769
share inflight registry between PartitionedParameterCoordinators (#3462)
|
1 年之前 |
Olatunji Ruwase
|
47f9f13bd3
DeepSpeed Chat (#3186)
|
1 年之前 |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
Jeff Rasley
|
91d63e0228
update formatter version and style settings (#3098)
|
1 年之前 |
Ma, Guokai
|
98cc35b6a8
Abstract accelerator (step 3) (#2677)
|
1 年之前 |
AGUL
|
aeda7f9f8c
Fix invalid check of recorded parameter orders in zero stage3. (#2550)
|
1 年之前 |
Olatunji Ruwase
|
2210ebe70f
Release swap buffers for persisted params (#2089)
|
2 年之前 |
Michael Wyatt
|
5997589683
Refactor ZeRO configs to use Pydantic (#2004)
|
2 年之前 |
Alex Hedges
|
316c4a43e0
Add flake8 to pre-commit checks (#2051)
|
2 年之前 |
Olatunji Ruwase
|
2a1a409644
Retain available params until last use (#2016)
|
2 年之前 |
Karim Foda
|
735406e536
fix import errors (#2026)
|
2 年之前 |
Ammar Ahmad Awan
|
36ad3119d5
DeepSpeed comm backend v1 (#1985)
|
2 年之前 |
Olatunji Ruwase
|
673cb60808
Improve z3 trace management (#1916)
|
2 年之前 |
Olatunji Ruwase
|
32d97976ce
Fix OOM and type mismatch (#1884)
|
2 年之前 |
Olatunji Ruwase
|
ef17c89570
Fix multiple zero 3 tracing errors (#1901)
|
2 年之前 |