Sven Mika 0395e04260 [RLlib] Remove default-metrics from `Algorithm` (tune does NOT error anymore if any stop-metric is missing). (#46200) 3 月之前
..
checkpointing_tests f8e59cba73 [RLlib] Do-over of release tests in light of rllib_contrib AND new- vs old API stack. (#43278) 7 月之前
learning_tests 0395e04260 [RLlib] Remove default-metrics from `Algorithm` (tune does NOT error anymore if any stop-metric is missing). (#46200) 3 月之前
1gpu_16cpus.yaml 1b6312823c [RLlib] Various updates to the Release CI RLlib (#36883) 1 年之前
1gpu_16cpus_gce.yaml 1b6312823c [RLlib] Various updates to the Release CI RLlib (#36883) 1 年之前
1gpu_24cpus.yaml 1b6312823c [RLlib] Various updates to the Release CI RLlib (#36883) 1 年之前
1gpu_24cpus_gce.yaml 1b6312823c [RLlib] Various updates to the Release CI RLlib (#36883) 1 年之前
1gpu_32cpus.yaml 509184a429 [RLlib] Make checkpointing test have multiple nodes, make node for dqn test larger (#37127) 1 年之前
1gpu_4cpus.yaml 4adb78b2bf [RLlib] Activate DreamerV3 weekly release test (on Pong-v5 with the 100k setup). (#45654) 4 月之前
1gpu_4cpus_gce.yaml 1b6312823c [RLlib] Various updates to the Release CI RLlib (#36883) 1 年之前
2gpus_32cpus.yaml 1a0989a1c0 [Release Test] Add GCE variation for core release tests [2/n] (#34337) 1 年之前
2gpus_32cpus_gce.yaml 1a0989a1c0 [Release Test] Add GCE variation for core release tests [2/n] (#34337) 1 年之前
2gpus_64cpus.yaml 3a181229cb Allow tf and torch apex release tests to run in parallel (#30882) 1 年之前
2gpus_64cpus_gce.yaml 3d335e1b22 [CI][GCE][RLlib] Add GCE variations to RLlib release tests (#34080) 1 年之前
32cpus.yaml 1a0989a1c0 [Release Test] Add GCE variation for core release tests [2/n] (#34337) 1 年之前
32cpus_gce.yaml 1a0989a1c0 [Release Test] Add GCE variation for core release tests [2/n] (#34337) 1 年之前
4gpus_512_cpus_gce.yaml 3d335e1b22 [CI][GCE][RLlib] Add GCE variations to RLlib release tests (#34080) 1 年之前
4gpus_544_cpus.yaml 66aae4cb32 [Release Test] Make sure to delete all EBS volumes (#28707) 2 年之前
4gpus_64cpus.yaml 1b6312823c [RLlib] Various updates to the Release CI RLlib (#36883) 1 年之前
4gpus_64cpus_gce.yaml 1b6312823c [RLlib] Various updates to the Release CI RLlib (#36883) 1 年之前
8gpus_96cpus.yaml 1b6312823c [RLlib] Various updates to the Release CI RLlib (#36883) 1 年之前
8gpus_96cpus_gce.yaml 1b6312823c [RLlib] Various updates to the Release CI RLlib (#36883) 1 年之前
README.rst c9d220bcda [RLlib] Upgrade RLlib regression test scripts to new testing tool - RLlib release logs for 1.4. (#16080) 3 年之前
app_config.yaml 1baab751a5 [RLlib] Fix RLlib stresstest (removed `minigrid` package). (#40656) 1 年之前
debug_app_config.yaml 25cf1842da [ci/release] Remove `default` images in app config templates (#37970) 1 年之前
multi_node_checkpointing_compute_config.yaml 509184a429 [RLlib] Make checkpointing test have multiple nodes, make node for dqn test larger (#37127) 1 年之前
multi_node_checkpointing_compute_config_gce.yaml 43210e0190 [RLlib] Change placement group strategy for learner (#36929) 1 年之前

README.rst

RLlib Tests
===========

This directory contains various RLlib release tests.

You should run these tests with the `releaser `_ tool.

Overview
--------
Currently, there are 3 RLlib tests:

1. ``learning_tests`` - Tests, whether major algos (tf+torch) can learn in Atari or PyBullet envs in ~30-60min.
1. ``stress_tests`` - Runs 4 IMPALA Atari jobs, each one using 1GPU and 128CPUs (needs autoscaling to succeed).
1. ``unit_gpu_tests`` - Tests, whether all of RLlib's example scripts can be run on a GPU.

Generally the releaser tool will run all tests in parallel.

Acceptance criteria
-------------------
These tests are considered passing when they throw no error at the end of the output log.