Ma, Guokai
|
b22706a721
[CPU] Support SHM based inference_all_reduce in TorchBackend (#5391)
|
6 月之前 |
Nadav Elyahu
|
2eafe41be7
adding hccl to init_distributed function description (#5034)
|
8 月之前 |
CurryRice233
|
6de31de73f
[NPU] Change log level to debug (#5051)
|
8 月之前 |
Heyang Qin
|
c37fe9cbfb
Fix exception handling in get_all_ranks_from_group() function (#4862)
|
10 月之前 |
Quentin Anthony
|
8a93ded874
Fixed deepspeed.comm.monitored_barrier call (#4496)
|
1 年之前 |
Sam Ade Jacobs
|
a855405e0b
DeepSpeed Ulysses release (#4198)
|
1 年之前 |
hipudding
|
23a11a3951
Make Ascend NPU available (#3831)
|
1 年之前 |
Michael Wyatt
|
11f8e4a5c8
silence warning (#4009)
|
1 年之前 |
Ma, Guokai
|
1bc3b78423
[CPU] Use allreduce_low_latency for AutoTP and implement low latency allreduce for CPU backend (single node) (#3919)
|
1 年之前 |
digger yu
|
ce535945e6
fix: change ==NONE to is (#3923)
|
1 年之前 |
hipudding
|
c5e55d3d14
Fix a typo of global variable in comm.py(#3852) (#3852)
|
1 年之前 |
Michael Wyatt
|
b58e0fa92a
avoid init for deepspeed backend first (#3893)
|
1 年之前 |
Ma, Guokai
|
5d1124f2aa
[profiling]add show_straggler argument to log_summary() (#3579)
|
1 年之前 |
Heyang Qin
|
d18aa2c79c
ZeRO++ (#3784)
|
1 年之前 |
Dino Chen
|
3f5e493109
fix ccl_backend and residual_add problems (#3642)
|
1 年之前 |
Guo Yejun
|
3b29999761
deepspeed/comm/comm.py: fix typo of warning message (#3636)
|
1 年之前 |
Ma, Guokai
|
1f72082fc0
[CPU] Support Intel CPU inference (#3041)
|
1 年之前 |
digger-yu
|
198166423d
fix spelling error with deepspeed/ (#3494)
|
1 年之前 |
Zhen Zhang
|
2e99f6edf6
[DRAFT] Tentative implementation of MiCS (#2964)
|
1 年之前 |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
Mayank Mishra
|
a6317eb509
♻️ replace deprecated functions for communication (#2995)
|
1 年之前 |
Jeff Rasley
|
91d63e0228
update formatter version and style settings (#3098)
|
1 年之前 |
Stas Bekman
|
50a49e42fb
[logger] implement warning_once (#3021)
|
1 年之前 |
noabauma
|
db15ef578a
deepspeed.init_distributed() support for TCP protocols (#2905)
|
1 年之前 |
Ma, Guokai
|
9548d48f48
Abstract accelerator (step 2) (#2560)
|
1 年之前 |
Quentin Anthony
|
18d55e54b0
Update barrier and reduce_scatter_base to conform to PyTorch signatures (#2570)
|
1 年之前 |
Jeff Rasley
|
7d8ad45d6a
Fix regression w. dist_init_required (#2225)
|
2 年之前 |
Alex Hedges
|
316c4a43e0
Add flake8 to pre-commit checks (#2051)
|
2 年之前 |
Quentin Anthony
|
5349347bb6
DeepSpeed Communication Profiling and Logging (#2012)
|
2 年之前 |
Quentin Anthony
|
9b70ce56e7
Comms Benchmarks (#2040)
|
2 年之前 |