Commit History

Author SHA1 Message Date
  Alex Hedges 4abf637f96 Remove mutable default parameter in init_inference() (#2540) 1 year ago
  Michael Wyatt 8b4318b950 Make DS-Inference config readable from JSON (#2537) 1 year ago
  Michael Wyatt 43bf035cfc Update docs to autogenerate pydantic config model docs (#2509) 1 year ago
  Ammar Ahmad Awan b5d18a6ab3 DeepSpeed inference config. (#2459) (#2472) 1 year ago
  Connor Holmes 10e9d04c23 Cache Allocation and Softmax Fixes (#2433) 2 years ago
  Reza Yazdani afdc72879f Ds-inference Int8 support through ZeroQuant technology (#2217) 2 years ago
  Jeff Rasley 46401b3884 [zero-3] shutdown zero.Init from within ds.init (#2150) 2 years ago
  Reza Yazdani 556f005152 Fix random token-generation issue + MP-checkpoint loading/saving (#2132) 2 years ago
  Reza Yazdani aa88137b8d Add Inference support for running the BigScience-BLOOM Architecture (#2083) 2 years ago
  Ammar Ahmad Awan 36ad3119d5 DeepSpeed comm backend v1 (#1985) 2 years ago
  Reza Yazdani a5adb90d72 Enabling CUDA-graph for the bert-type models (#1952) 2 years ago
  Jeff Rasley 50893458d6 Fairseq support (#1915) 2 years ago
  Jeff Rasley b4fcd98ff0 Inference PP changes for neox (#1899) 2 years ago
  Jeff Rasley e46d808a1b MoE inference + PR-MoE model support (#1705) 2 years ago
  Jeff Rasley a8a17f234a Several fixes for our read-the-docs build (#1579) 2 years ago
  Reza Yazdani 9ce00a2171 Tensor-Parallelism general support (#1512) 2 years ago
  Rana Ali Amjad 648f7bfa50 Bfloat16 zero2 (#1398) 3 years ago
  Olatunji Ruwase 274c375c87 Support Callable type for client optimizer and lr_scheduler (#1316) 3 years ago
  Reza Yazdani ed3de0c21b Quantization + inference release (#1091) 3 years ago
  Jeff Rasley cfa63f5dad ZeRO stage 1 refresh (#1042) 3 years ago
  Sean Naren 41ab660b5d Refactor param_dict to config (#1008) 3 years ago
  Jeff Rasley 871f3048ad Allow args to be optional in deepspeed.initialize (#825) 3 years ago
  Samyam Rajbhandari 599258f979 ZeRO 3 Offload (#834) 3 years ago
  Jeff Rasley f032e56f8a Validate consistent ckpt tags across ranks (#667) 3 years ago
  Jeff Rasley 7435b2f10a Ability to initialize distributed backend outside deepspeed runtime (#608) 3 years ago
  Jeff Rasley dce054dbba backwards compatability w. v020 ckpts, fix issue with zero-1 ckpts (#543) 3 years ago
  Jeff Rasley 31f46feee2 DeepSpeed JIT op + PyPI support (#496) 4 years ago
  Reza Yazdani f5aa2547d8 Add CPUAdam optimizer for zero-offload in deepspeed engine (#484) 4 years ago
  Shaden Smith 65c2f974d8 Pipeline parallel training engine. (#392) 4 years ago
  Jeff Rasley 41db1c2f03 ZeRO-Offload release (#391) 4 years ago