Yucheng Lu
|
b80e5624e2
01 adam optimizer (#1790)
|
2 年之前 |
Jeff Rasley
|
ac71a1a461
[docker] simplify and update rocm dockerfile (#1819)
|
2 年之前 |
Jeff Rasley
|
097efeb73e
update gemfile (#1817)
|
2 年之前 |
Jeff Rasley
|
013df6a9e8
bump to 0.6.1
|
2 年之前 |
Jeff Rasley
|
a32e9b3376
force amd install via sudo (#1815)
|
2 年之前 |
Jithun Nair
|
350d74ca39
Invoke hipify from op builder for JIT extension builds (#1807)
|
2 年之前 |
Andrii Oriekhov
|
d7684f4e81
add GitHub URL for PyPi (#1812)
|
2 年之前 |
Olatunji Ruwase
|
9f7126fc10
Refactor moe/non-moe gradient reduction (#1811)
|
2 年之前 |
Reza Yazdani
|
60fc06c610
Synchronize the GPUs for the text-generation inference test (#1805)
|
2 年之前 |
Jeff Rasley
|
c3c8d5dd93
AMD support (#1430)
|
2 年之前 |
Ammar Ahmad Awan
|
f0304bd103
Minjiaz/mos doc (#1802)
|
2 年之前 |
Jeff Rasley
|
3401d2516f
bump DSE to latest commit
|
2 年之前 |
Ammar Ahmad Awan
|
c0af6d90f7
Refactor MoE and Groups API to simplify model creation and mangement (#1798)
|
2 年之前 |
dependabot[bot]
|
a254f39ef0
Bump nokogiri from 1.12.5 to 1.13.3 in /docs (#1794)
|
2 年之前 |
Cheng Li
|
2151c787a2
Generalize the model input format of the flops profiler get_model_profile() API (#1768)
|
2 年之前 |
Jeff Rasley
|
8eef742f0c
bf16 is supported w. zero 1, fix assert (#1779)
|
2 年之前 |
Reza Yazdani
|
5ca2627743
Fix CPU-Offload: Send groups of parameter lists as the FP16 parameters (#1774)
|
2 年之前 |
Stas Bekman
|
baef92e26f
[save_16bit_model] add missing prologue (#1741)
|
2 年之前 |
Olatunji Ruwase
|
5fe5b38ea0
Prepare zero-3 optimizer for ckpt load/save (#1750)
|
2 年之前 |
Jeff Rasley
|
674c75882d
unset torch arch list for JIT mode (#1765)
|
2 年之前 |
Olatunji Ruwase
|
4f96ffd9ab
[Doc] Add async I/O op (#1754)
|
2 年之前 |
Reza Yazdani
|
841f99d162
Load MoE checkpint at deepspeed inference-engine (#1759)
|
2 年之前 |
Cheng Li
|
cccc5450e0
Fix autotuning config (#1660)
|
2 年之前 |
Reza Yazdani
|
d3cad05105
fixing the inference build path when pre-building the inference op (#1755)
|
2 年之前 |
Jeff Rasley
|
56eac3829f
fix pytest issue (#1764)
|
2 年之前 |
Du Li
|
97f8a9eb66
fixing a bf16 support issue (#1760)
|
2 年之前 |
Jeff Rasley
|
dbe8ee167b
Loosen requirement on packaging dependency (#1758)
|
2 年之前 |
liamcli
|
dac9056e13
Improve how runner parses env var file (#1747)
|
2 年之前 |
Olatunji Ruwase
|
135a625619
Move param_shapes to model files (#1732)
|
2 年之前 |
Cheng Li
|
ba9c4cc75c
separate add and mul flops compute function (#1745)
|
2 年之前 |