Aman Sanger
|
ae198e20f7
DataLoader Length Fix (#1718)
|
2 年之前 |
Reza Yazdani
|
5dce73fa61
Fix transformer API for training-evaluation pipeline (#2018)
|
2 年之前 |
Jeff Rasley
|
7c3344e215
DeepSpeed examples refresh (#2021)
|
2 年之前 |
Jeff Rasley
|
b666d5cd73
[inference] test suite for ds-kernels (bert, roberta, gpt2, gpt-neo, gpt-j) (#1992)
|
2 年之前 |
Jeff Rasley
|
e6f444aee2
[CI] force upgrade HF dependencies & output py env (#2015)
|
2 年之前 |
Conglong Li
|
117c9cdf25
update CODEOWNERS (#2017)
|
2 年之前 |
Quentin Anthony
|
25b2fc29fb
Relax assertion to allow Megatron-DeepSpeed MoE to use ZeRO-1 (#2007)
|
2 年之前 |
Ammar Ahmad Awan
|
36ad3119d5
DeepSpeed comm backend v1 (#1985)
|
2 年之前 |
Jeff Rasley
|
828ab7185a
[docs] add new build badges to landing page (#1998)
|
2 年之前 |
Jerry Mannil
|
d0eae5ad7a
Propagate max errorcode to deepspeed when using PDSH launcher (#1994)
|
2 年之前 |
Michael Wyatt
|
3678ee1778
[bug] Add user-defined launcher args for MPI launcher (#1933)
|
2 年之前 |
Michael Wyatt
|
7fc3065074
Add torch-latest and torch-nightly CI workflows (#1990)
|
2 年之前 |
Cheng Li
|
6719b46bd8
fix typo when getting kernel dim in conv calculation (#1989)
|
2 年之前 |
Michael Wyatt
|
b6f2a5602b
added temp fix for our CI jobs (#1988)
|
2 年之前 |
Michael Wyatt
|
5d3c67189b
Add unit test for various model families and inference tasks (#1981)
|
2 年之前 |
Reza Yazdani
|
0ebd81dfa9
small fix for the HF Bert models (#1984)
|
2 年之前 |
Jeff Rasley
|
3da841853c
add 530b paper (#1979)
|
2 年之前 |
Jeff Rasley
|
eb2ec7f458
bump to 0.6.6
|
2 年之前 |
Reza Yazdani
|
8164ea9e6d
Fixing several bugs in the inference-api and the kernels (#1951)
|
2 年之前 |
Mikhail Druzhinin
|
b8ff4825aa
Fix: Sparse tensors not updating (#1914)
|
2 年之前 |
Quentin Anthony
|
5208eb73da
Add Unidirectional Sparse Attention Type to BigBird and BSLongformer (#1959)
|
2 年之前 |
Jeff Rasley
|
737fee63ba
Update cpu_adam.py (#1968)
|
2 年之前 |
Jeff Rasley
|
79249428a4
Update Gemfile.lock (#1966)
|
2 年之前 |
Quentin Anthony
|
0d36893281
Fix timer typo (#1964)
|
2 年之前 |
dependabot[bot]
|
e329e9858f
Bump nokogiri from 1.13.4 to 1.13.6 in /docs (#1965)
|
2 年之前 |
liamcli
|
380d32f980
[launcher] add option to bypass ssh check (#1957)
|
2 年之前 |
Quentin Anthony
|
44085856a8
Add loss scale guard to avoid inf loop (#1958)
|
2 年之前 |
Reza Yazdani
|
a5adb90d72
Enabling CUDA-graph for the bert-type models (#1952)
|
2 年之前 |
kisseternity
|
5053217e5d
trivial fix (#1954)
|
2 年之前 |
Olatunji Ruwase
|
09f1ad5e58
DeepSpeed needs to start cleaning up (#1947)
|
2 年之前 |