Sam Ade Jacobs
|
a855405e0b
DeepSpeed Ulysses release (#4198)
|
1 year ago |
Ma, Guokai
|
1bc3b78423
[CPU] Use allreduce_low_latency for AutoTP and implement low latency allreduce for CPU backend (single node) (#3919)
|
1 year ago |
Heyang Qin
|
d18aa2c79c
ZeRO++ (#3784)
|
1 year ago |
Zhen Zhang
|
c88af21432
[MiCS] [Fix] saving and loading model checkpoint logic for MiCS sharding (#3440)
|
1 year ago |
Zhen Zhang
|
2e99f6edf6
[DRAFT] Tentative implementation of MiCS (#2964)
|
1 year ago |
Michael Wyatt
|
ad168a6954
Fix for dist not being initialized when constructing main config (#3324)
|
1 year ago |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
Mayank Mishra
|
a6317eb509
♻️ replace deprecated functions for communication (#2995)
|
1 year ago |
Jeff Rasley
|
91d63e0228
update formatter version and style settings (#3098)
|
1 year ago |
noabauma
|
db15ef578a
deepspeed.init_distributed() support for TCP protocols (#2905)
|
1 year ago |
Quentin Anthony
|
18d55e54b0
Update barrier and reduce_scatter_base to conform to PyTorch signatures (#2570)
|
1 year ago |
Jeff Rasley
|
d4cab2cec2
use get_global_rank if available (#2567)
|
1 year ago |
Alex Hedges
|
316c4a43e0
Add flake8 to pre-commit checks (#2051)
|
2 years ago |
Quentin Anthony
|
5349347bb6
DeepSpeed Communication Profiling and Logging (#2012)
|
2 years ago |
Quentin Anthony
|
9f5895cb7a
fix issue in accelerate. (#2121)
|
2 years ago |
Jeff Rasley
|
844d9f31a9
reduce ds-inference log verbosity (#2111)
|
2 years ago |
Karim Foda
|
735406e536
fix import errors (#2026)
|
2 years ago |
Ammar Ahmad Awan
|
36ad3119d5
DeepSpeed comm backend v1 (#1985)
|
2 years ago |