提交历史

作者 SHA1 备注 提交日期
  Yejing-Lai 85132adc31 enable starcode((kv_head=1)) autotp (#4896) 9 月之前
  Wang, Yi c8c57b8c24 add sharded loading for safetensors in AutoTP (#4854) 9 月之前
  baodi c20f6fa4e0 support baichuan model: (#4721) 10 月之前
  RyanInnerpeace 7b818ee961 improve the way to determine whether a variable is None (#4782) 10 月之前
  Ma, Guokai f15cccfa0c [AutoTP] Make AutoTP work when num_heads not divisible by number of workers (#4011) 1 年之前
  Yejing-Lai 6763e2de61 add lm_head and embed_out tensor parallel (#3962) 1 年之前
  Wang, Yi d72edb3b0d fix lm head overriden issue, move it from checkpoint in-loop loading to out loop (#4206) 1 年之前
  Ammar Ahmad Awan b9d719a6d3 Pass base_dir to model files can be loaded for auto-tp/meta-tensor. (#4348) 1 年之前
  Satpal Singh Rathore 430510bfce Checks for user injection policy (#3052) 1 年之前
  Dino Chen 0712e29920 add meta onDevice support for LLAMA2 (#4147) 1 年之前
  digger yu 4cde5da88e fix typo: change polciies to policies (#4090) 1 年之前
  Lev Kurilenko 1ba4098918 Fix Stable Diffusion Injection (#4078) 1 年之前
  Molly Smith 94c7233a8b Refactor autoTP inference for HE (#4040) 1 年之前
  mzl 6b877d2dbc autoTP for fused qkv weight (#3844) 1 年之前
  Wang, Yi 0bafeac491 enable autoTP for MPT (#3861) 1 年之前
  Wang, Yi 76953a37b7 fix opt-350m shard loading issue in AutoTP (#3600) 1 年之前
  Dino Chen f3943cf910 add llama2 autoTP support in replace_module (#4022) 1 年之前
  digger yu ce535945e6 fix: change ==NONE to is (#3923) 1 年之前
  Reza Yazdani f3c93b056d Add FALCON Auto-TP Support (#3640) 1 年之前
  Ma, Guokai 1f72082fc0 [CPU] Support Intel CPU inference (#3041) 1 年之前
  Wang, Yi b31b46c0d1 fix regression in shard checkpoint loading in AutoTP Path caused by qkv_copy() is deleted and add UT case for shard checkpoint loading in AutoTP (#3457) 1 年之前
  Wang, Yi d10b8ca011 add sharded checkpoint loading for AutoTP path to reduce the peak mem… (#3102) 1 年之前
  Connor Holmes 0a61d5d664 Hybrid Engine Refactor and Llama Inference Support (#3425) 1 年之前
  Reza Yazdani 3e8564645d Add HE support for the rest of model containers (#3191) 1 年之前
  Molly Smith 496a9a3a62 Diffusers 0.15.0 bug fix (#3345) 1 年之前
  Olatunji Ruwase 47f9f13bd3 DeepSpeed Chat (#3186) 1 年之前
  Michael Wyatt b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
  Jeff Rasley 91d63e0228 update formatter version and style settings (#3098) 1 年之前
  Molly Smith 9ea0fdc2ce Assert mp_size is factor of model dimensions (#2891) 1 年之前
  Heyang Qin dc01cee5ca using container when loading inference checkpoints (#2875) 1 年之前