Liangliang Ma
|
40bde528bc
[XPU] upgrade xpu max1100 CI workflow to pytorch2.3 (#6646)
|
1 天之前 |
Joe Mayer
|
6eefc3d0ea
Fix Memory Leak In AIO (#6630)
|
4 天之前 |
Masahiro Tanaka
|
c9fc34a4be
Use file store for tests (#6632)
|
4 天之前 |
Masahiro Tanaka
|
a36db9cc1c
Update torch version in workflows (#6631)
|
4 天之前 |
jiahao su
|
c9899dc14a
Add README Pipeline Status for Huawei Ascend NPU (#6588)
|
6 天之前 |
Masahiro Tanaka
|
1a45bd8e8c
Lock cache file of HF model list (#6628)
|
6 天之前 |
Shelly Nahir
|
ce468c3756
add option to disable logger while compiling to avoid graph breaks (#6496)
|
6 天之前 |
Xu Song
|
bf60fc0ca6
Support safetensors export (#6579)
|
1 周之前 |
Joe Mayer
|
85b7469ea0
Add first Step in LR Schedulers (#6597)
|
1 周之前 |
diskkid
|
13c16c9562
Accept btl_tcp_if_include option through launcher_args (#6613)
|
1 周之前 |
Olatunji Ruwase
|
65ab64481f
Add API for updating ZeRO gradients (#6590)
|
1 周之前 |
Ma, Guokai
|
cf41e8c4e8
[compile] Show breakdown of graph break (#6601)
|
1 周之前 |
Masahiro Tanaka
|
7a5bc4fdf9
Ignore reuse_dist_env (#6623)
|
1 周之前 |
Masahiro Tanaka
|
5c4b97f109
apply fp16 autocast only to floating point values
|
1 周之前 |
Masahiro Tanaka
|
adec99121b
Add API to get devices of offload states (#6586)
|
1 周之前 |
Nir Sonnenschein
|
d7ca3d8373
reduce setting global variables to reduce torch compile graph breaks (#6541)
|
1 周之前 |
Joe Mayer
|
a1f98bdc70
AIO CPU Locked Tensor (#6592)
|
1 周之前 |
Masahiro Tanaka
|
7d751ee890
Clean up prefetched parameters (#6557)
|
1 周之前 |
Logan Adams
|
55f7f3789e
Update version.txt after 0.15.2 release (#6615)
|
1 周之前 |
gyou2021
|
474a3288cd
Enabled Qwen2-MoE Tensor Parallelism (TP) inference (#6551)
|
1 周之前 |
Logan Adams
|
1062a0c658
Unpin accelerate tests, update lightning with node16 removal. (#6611)
|
1 周之前 |
Omar Elayan
|
645639bcf8
Rearrange inference OPS and stop using builder.load (#5490)
|
1 周之前 |
Yichen Yan
|
ca8b1fe945
Handle when `backend` is also in compile_kwargs (#6502)
|
1 周之前 |
Masahiro Tanaka
|
5cbbff40bd
Fix device selection using CUDA_VISIBLE_DEVICES (#6530)
|
1 周之前 |
Olatunji Ruwase
|
f74ea69abf
Improve DS logging control (#6602)
|
1 周之前 |
Yejing-Lai
|
e97b453645
Add llama3.2 vision autotp (#6577)
|
1 周之前 |
Logan Adams
|
745dd48b90
Pin accelerate to fix CI failures/issues (#6610)
|
1 周之前 |
Logan Adams
|
00c4b98ba0
Fix SD workflow (#6609)
|
1 周之前 |
Logan Adams
|
20695b39b1
Move V100 workflows from cuda 11.1/11.7 to 12.1 (#6607)
|
2 周之前 |
Logan Adams
|
940887ded1
Add SSF Best practices badge (#6604)
|
2 周之前 |