Rui Qiao
|
d655fcc5d7
[Core] Mark single_node_oom release test as unstable (#47591)
|
1 月之前 |
Jiajun Yao
|
6bcce32dcb
[Core] Fix aws_cluster_launcher_full (#47512)
|
1 月之前 |
Cindy Zhang
|
ca85922517
[serve] remove outdated microbenchmark release test (#47502)
|
1 月之前 |
Scott Lee
|
b433de7cf7
[Data] [Release Test] Use worker node instead of head node for `read_images_comparison_microbenchmark_single_node` (#47228)
|
2 月之前 |
Sven Mika
|
b0403184a1
[RLlib] Fix SAC/DQN/CQL GPU and multi-GPU. (#47179)
|
2 月之前 |
Jiajun Yao
|
85eaffd43f
[Core] Stop running dask_on_ray_100gb_sort with python 3.11 (#46987)
|
2 月之前 |
Cuong Nguyen
|
e68005e505
[ci] upgrade release tests to python 3.12 (#46989)
|
2 月之前 |
Cuong Nguyen
|
a8caff68d7
[py12] build ray image (#46649)
|
2 月之前 |
Stephanie Wang
|
de0f728b2c
[core][adag] Report GPU performance results for aDAG microbenchmark (#46872)
|
2 月之前 |
Cindy Zhang
|
e079a9080c
[serve] add cmdline options to control which microbenchmarks to run (#46798)
|
2 月之前 |
Lonnie Liu
|
6b81634a64
[air examples] remove runtime env on dependencies (#46759)
|
2 月之前 |
Cuong Nguyen
|
e90d02d359
[ci] update release test requirements (#46745)
|
2 月之前 |
Lonnie Liu
|
b51ed33116
[ml] update air examples dependencies (#46723)
|
3 月之前 |
Cuong Nguyen
|
0b32b589ab
[py12] upgrade torch version again (#46662)
|
3 月之前 |
Cuong Nguyen
|
13d02a3c19
Revert "[RFC][python 3.12] upgrade pytorch " (#46661)
|
3 月之前 |
Cuong Nguyen
|
d26b2efb11
[RFC][python 3.12] upgrade pytorch (#46256)
|
3 月之前 |
Cuong Nguyen
|
bd9dc1630f
[release] unbreak dask_on_ray_1tb_sort (#46120)
|
4 月之前 |
Cuong Nguyen
|
0bf72a6d7c
[ci] deflake rllib release tests (#45901)
|
4 月之前 |
Lonnie Liu
|
d59d1ef352
[finetune] change fine-tuning examples to use cuda 12.3 (#45879)
|
4 月之前 |
Sven Mika
|
d49f15b111
[RLlib] Add "official" benchmark script for Atari PPO benchmarks (new API stack). (#45697)
|
4 月之前 |
Sven Mika
|
4adb78b2bf
[RLlib] Activate DreamerV3 weekly release test (on Pong-v5 with the 100k setup). (#45654)
|
4 月之前 |
Hongchao Deng
|
d4fc01c584
[core] add chaos_many_tasks/actors terminate instance cases (#45663)
|
4 月之前 |
Sven Mika
|
c94140a3a4
[RLlib] Complete do-over of RLlib release tests (new API stack). (#45589)
|
4 月之前 |
Stephanie Wang
|
ab2b442b34
[core][experimental] Fix GPU microbenchmark (#45426)
|
5 月之前 |
Cuong Nguyen
|
90fa2895bd
[release] mark chaos_torch_batch_inference_16_gpu_300gb_raw as non-stable (#45387)
|
5 月之前 |
Stephanie Wang
|
79f39957dc
[core][experimental] Accelerated DAG NCCL-based p2p channels for torch.Tensors (#45092)
|
5 月之前 |
Cindy Zhang
|
863dc2392f
[serve] run all serve release tests in isolated cloud (#44939)
|
6 月之前 |
Cindy Zhang
|
4449c5e3c6
[serve] remove old autoscaling release tests (#44785)
|
6 月之前 |
Cindy Zhang
|
f38582fd76
[serve] improve 1k replica scalability release test (#44318)
|
6 月之前 |
Cindy Zhang
|
6158f13cc6
[serve] Add microbenchmark release tests (#44327)
|
6 月之前 |