提交历史

作者 SHA1 备注 提交日期
  Aman Sanger ae198e20f7 DataLoader Length Fix (#1718) 2 年之前
  Reza Yazdani 5dce73fa61 Fix transformer API for training-evaluation pipeline (#2018) 2 年之前
  Jeff Rasley 7c3344e215 DeepSpeed examples refresh (#2021) 2 年之前
  Jeff Rasley b666d5cd73 [inference] test suite for ds-kernels (bert, roberta, gpt2, gpt-neo, gpt-j) (#1992) 2 年之前
  Jeff Rasley e6f444aee2 [CI] force upgrade HF dependencies & output py env (#2015) 2 年之前
  Conglong Li 117c9cdf25 update CODEOWNERS (#2017) 2 年之前
  Quentin Anthony 25b2fc29fb Relax assertion to allow Megatron-DeepSpeed MoE to use ZeRO-1 (#2007) 2 年之前
  Ammar Ahmad Awan 36ad3119d5 DeepSpeed comm backend v1 (#1985) 2 年之前
  Jeff Rasley 828ab7185a [docs] add new build badges to landing page (#1998) 2 年之前
  Jerry Mannil d0eae5ad7a Propagate max errorcode to deepspeed when using PDSH launcher (#1994) 2 年之前
  Michael Wyatt 3678ee1778 [bug] Add user-defined launcher args for MPI launcher (#1933) 2 年之前
  Michael Wyatt 7fc3065074 Add torch-latest and torch-nightly CI workflows (#1990) 2 年之前
  Cheng Li 6719b46bd8 fix typo when getting kernel dim in conv calculation (#1989) 2 年之前
  Michael Wyatt b6f2a5602b added temp fix for our CI jobs (#1988) 2 年之前
  Michael Wyatt 5d3c67189b Add unit test for various model families and inference tasks (#1981) 2 年之前
  Reza Yazdani 0ebd81dfa9 small fix for the HF Bert models (#1984) 2 年之前
  Jeff Rasley 3da841853c add 530b paper (#1979) 2 年之前
  Jeff Rasley eb2ec7f458 bump to 0.6.6 2 年之前
  Reza Yazdani 8164ea9e6d Fixing several bugs in the inference-api and the kernels (#1951) 2 年之前
  Mikhail Druzhinin b8ff4825aa Fix: Sparse tensors not updating (#1914) 2 年之前
  Quentin Anthony 5208eb73da Add Unidirectional Sparse Attention Type to BigBird and BSLongformer (#1959) 2 年之前
  Jeff Rasley 737fee63ba Update cpu_adam.py (#1968) 2 年之前
  Jeff Rasley 79249428a4 Update Gemfile.lock (#1966) 2 年之前
  Quentin Anthony 0d36893281 Fix timer typo (#1964) 2 年之前
  dependabot[bot] e329e9858f Bump nokogiri from 1.13.4 to 1.13.6 in /docs (#1965) 2 年之前
  liamcli 380d32f980 [launcher] add option to bypass ssh check (#1957) 2 年之前
  Quentin Anthony 44085856a8 Add loss scale guard to avoid inf loop (#1958) 2 年之前
  Reza Yazdani a5adb90d72 Enabling CUDA-graph for the bert-type models (#1952) 2 年之前
  kisseternity 5053217e5d trivial fix (#1954) 2 年之前
  Olatunji Ruwase 09f1ad5e58 DeepSpeed needs to start cleaning up (#1947) 2 年之前