Jeff Rasley
|
5a16213201
[docs] add mii repo link
|
2 年之前 |
Jeff Rasley
|
ab77677d59
[docs] update news items
|
2 年之前 |
Ammar Ahmad Awan
|
d0c6b2958f
Fix figure reference (#2419)
|
2 年之前 |
Jeff Rasley
|
b8b98c6046
MII blog post (#2418)
|
2 年之前 |
Connor Holmes
|
c3001324b4
Add predicated global load (#2373)
|
2 年之前 |
Olatunji Ruwase
|
f4a92a19a6
Checkpoint backwards-compatbility workaround (#2384)
|
2 年之前 |
lekurile
|
46a886c068
Change type to tuple in replace_wo_policy isinstance check (#2387)
|
2 年之前 |
Michael Wyatt
|
6f3dec65ad
pin transformers version for unit tests (#2402)
|
2 年之前 |
Thomas-MMJ
|
f5a8348973
allow building with latest CUDA (11.8), it is backwards compatible (#2390)
|
2 年之前 |
Arash Bakhtiari
|
0a2ae2ef45
Fix the MLP output tensor's shape (#2380)
|
2 年之前 |
Michael Wyatt
|
ff42743865
Refactor remaining distributed tests (#2216)
|
2 年之前 |
Matt Smith
|
b609a29412
fix an exception when recursively casting dicts to fp16 (#2370)
|
2 年之前 |
Molly Smith
|
eed40324db
Capture error message during sweep tests (#2351)
|
2 年之前 |
Arash Bakhtiari
|
e14d40e5f3
Refactor fused_bias_residual kernels for better readability (#2356)
|
2 年之前 |
Arash Bakhtiari
|
79692af1ea
Extend residual_add kernel tests to conver pre_attn_norm (#2354)
|
2 年之前 |
Arash Bakhtiari
|
b450da4f70
Add missing pytest fixture scope (#2353)
|
2 年之前 |
Guanhua Wang
|
3486afb1a3
fix cuda invalid config error in dequant kernel (#2362)
|
2 年之前 |
Jeff Rasley
|
8e8c866ddf
Update issue templates
|
2 年之前 |
Jeff Rasley
|
70e883a103
Updated issue templates (#2363)
|
2 年之前 |
Arash Bakhtiari
|
9df604bf51
Refactor gptj_residual_add kernels for better readability (#2358)
|
2 年之前 |
Michael Wyatt
|
6ef16de1a3
download cifar to blob storage (#2342)
|
2 年之前 |
Jean-Louis Queguiner
|
2b1b0d2e86
docs(mixture-of-experts-inference): fix typo in tuto (#2345)
|
2 年之前 |
Saeyeol Lee
|
f210256a32
Add Onebit Optimzers in __init__ (#2340)
|
2 年之前 |
Connor Holmes
|
9aa7b638b7
Kernel Data Conversion Utility (#2327)
|
2 年之前 |
Ammar Ahmad Awan
|
993264388d
Inference profiling updates/fixes (#2348) (#2349)
|
2 年之前 |
Jeff Rasley
|
76de924b93
fix zero docs (#2350)
|
2 年之前 |
Connor Holmes
|
3d097bb865
Extend scratch buffer for long prompts (#2212)
|
2 年之前 |
Jeff Rasley
|
b76e0f4fe0
increase min pre-commit versions (#2346)
|
2 年之前 |
Guanhua Wang
|
954e0c61f1
mem access for quantize kernel (#2331)
|
2 年之前 |
Arash Bakhtiari
|
48c5220b52
Refactor residual add kernels (#2333)
|
2 年之前 |