Sam Ade Jacobs
|
5bbbf41fe5
Ulysses: add col-ai evaluation (#4517)
|
1 年之前 |
Liangliang-Ma
|
4fc181b010
[CCLBackend] update API (#4378)
|
1 年之前 |
Conglong Li
|
574fbc0d68
add DeepSpeed4Science white paper (#4502)
|
1 年之前 |
Logan Adams
|
a25a67a083
Fix scale factor on flops profiler (#4500)
|
1 年之前 |
stephen youn
|
6c86ff393f
adding 8bit dequantization kernel for asym fine-grained block quantization in zero-inference (#4450)
|
1 年之前 |
Logan Adams
|
427253b94b
Update ROCm verison (#4486)
|
1 年之前 |
Bram Vanroy
|
706a72562a
Allow env var for timeout (#4405)
|
1 年之前 |
Matthew Hoffman
|
604d701e35
Introduce pydantic_v1 compatibility module for pydantic>=2.0.0 support (#4407)
|
1 年之前 |
Logan Adams
|
e7acee4933
Remove DS_BUILD_UTILS reference (#4485)
|
1 年之前 |
Michael Wyatt
|
77a2837163
[create-pull-request] automated change (#4484)
|
1 年之前 |
Michael Wyatt
|
e9503fe40e
fix missing package
|
1 年之前 |
Michael Wyatt
|
923f3590ee
fix bad build command (#4483)
|
1 年之前 |
Yejing-Lai
|
6763e2de61
add lm_head and embed_out tensor parallel (#3962)
|
1 年之前 |
Michael Wyatt
|
6b634d0e7e
move torch import (#4468)
|
1 年之前 |
Max Kovalenko
|
59d62d614e
Move tensors to device if mp is not enabled (#4461)
|
1 年之前 |
Kazuki Fujii
|
7ed952eff1
Fix bug in bfloat16 optimizer related to checkpointing (#4434)
|
1 年之前 |
Jeff Rasley
|
c4d4679533
bump to 0.11.1
|
1 年之前 |
Michael Wyatt
|
26d0dd927b
bump to 0.11.0
|
1 年之前 |
Logan Adams
|
2118c63591
Add release flow (#4467)
|
1 年之前 |
Nadav Elyahu
|
f9698c7307
pipe engine eval_batch: add option to disable loss broadcast (#4326)
|
1 年之前 |
Hongjiu "Enneamer" Zhang
|
8e64c3b550
feat: add Lion optimizer (#4331)
|
1 年之前 |
Wang, Yi
|
d72edb3b0d
fix lm head overriden issue, move it from checkpoint in-loop loading to out loop (#4206)
|
1 年之前 |
Michael Wyatt
|
4294ea172c
CI fix for torch 2.1 release (#4452)
|
1 年之前 |
Conglong Li
|
2c220d6593
DeepSpeed-VisualChat Chinese blog (#4458)
|
1 年之前 |
Conglong Li
|
f63c35b4b3
Update README-Japanese.md (#4457)
|
1 年之前 |
Conglong Li
|
93a6d7a547
fix blog format (#4456)
|
1 年之前 |
Masahiro Tanaka
|
43a7f73594
add Japanese blog of DS visual chat (#4454)
|
1 年之前 |
Alexander Jipa
|
9c22801f5e
documenting load_from_fp32_weights config parameter (#4449)
|
1 年之前 |
Alex Kogan
|
7099f99333
Fix a bug in DeepSpeedMLP (#4389)
|
1 年之前 |
Ammar Ahmad Awan
|
fa582c581a
Update README.md
|
1 年之前 |