Michael Wyatt
|
78b7693591
Re-enable GPT-J unit tests and refactor inference tests (#3618)
|
1 年之前 |
Michael Wyatt
|
7726fc8d54
Reduce Unit Test Times (Part 1) (#3829)
|
1 年之前 |
Sam Ade Jacobs
|
25e500e8dd
Update zeropp.md (#3821)
|
1 年之前 |
Masahiro Tanaka
|
203ac9d7ac
support model declaration in zero.Init context (#3592)
|
1 年之前 |
Nadav Timor
|
a63f9152b9
July 2022 update: chrome://tracing is deprecated, and by default will redirect to https://ui.perfetto.dev. (see https://chromium.googlesource.com/catapult/+/refs/heads/main/tracing/docs/perfetto.md) (#3805)
|
1 年之前 |
Jeff Rasley
|
6102d128f2
Revert "Prevent hangs in CI during parallel run compilation (#2844)" (#3817)
|
1 年之前 |
Michael Wyatt
|
2b2be85f43
Prevent hangs in CI during parallel run compilation (#2844)
|
1 年之前 |
kisseternity
|
1b888399dc
Add an api in deepspeed engine for adjusting micro batch size during training (#3773)
|
1 年之前 |
stephen youn
|
bafaf3c0bb
bug fix: triton importing error (#3799)
|
1 年之前 |
Ramya Ramineni
|
aebdfb3b92
Fix Bug in transform.cu (#3534)
|
1 年之前 |
Jeff Rasley
|
d33f1f851f
bump to 0.10.0
|
1 年之前 |
Joe Mayer
|
5eb2598623
Requires grad checking. (#3789)
|
1 年之前 |
Connor Holmes
|
c86e4e31b8
Missing strided copy for gated MLP (#3788)
|
1 年之前 |
Cheng Li
|
c80855b543
Bug Fixes for autotuner and flops profiler (#1880)
|
1 年之前 |
Masahiro Tanaka
|
b0752b2ef6
Add ZeRO++ Japanese blog (#3797)
|
1 年之前 |
Heyang Qin
|
94479e2b75
adding zero++ to navigation panel of deepspeed.ai (#3796)
|
1 年之前 |
Heyang Qin
|
d18aa2c79c
ZeRO++ (#3784)
|
1 年之前 |
stephen youn
|
69d1b9f978
DeepSpeed-Triton for Inference (#3748)
|
1 年之前 |
Jeff Rasley
|
52c6baa933
remove staging trigger (#3792)
|
1 年之前 |
Heyang Qin
|
b332094015
ZeRO++ chinese blog (#3793)
|
1 年之前 |
Jeff Rasley
|
6e4faf869d
bump to 0.9.6
|
1 年之前 |
Jeff Rasley
|
80ccaf9c7a
revert PR #3611 (#3786)
|
1 年之前 |
Guorun
|
24c7d7f14a
use `Flops Profiler` to test `model.generate()` (#2515)
|
1 年之前 |
Cheng Li
|
a76cced3fa
fix interpolate flops compute (#3782)
|
1 年之前 |
Bill Luo
|
062408683c
[Fix] _conv_flops_compute when padding is a str and stride=1 (#3169)
|
1 年之前 |
Heyang Qin
|
d2bf38d6f2
zero++ tutorial PR (#3783)
|
1 年之前 |
Logan Adams
|
dd59341001
Add H100 workflow and status badge. (#3754)
|
1 年之前 |
Ikko Eltociear Ashimine
|
1491e14e1d
Update deepspeed-chat/japanese/README.md (#3765)
|
1 年之前 |
Vlad
|
044dd0e2c3
Fix url in getting-started guide (docs) (#3768)
|
1 年之前 |
Dino Chen
|
3f5e493109
fix ccl_backend and residual_add problems (#3642)
|
1 年之前 |