Sven Mika
|
d5bfb7b7da
[RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652)
|
2 年之前 |
Sven Mika
|
3ac4daba07
[RLlib] Discussion 4351: Conv2d default filter tests and add default setting for 96x96 image obs space. (#21560)
|
2 年之前 |
Avnish Narayan
|
f7a5fc36eb
[rllib] Give rnnsac_stateless cartpole gpu, increase timeout (#21407)
|
2 年之前 |
Sven Mika
|
9e6b871739
[RLlib] Better utils for flattening complex inputs and enable prev-actions for LSTM/attention for complex action spaces. (#21330)
|
2 年之前 |
Sven Mika
|
abd3bef63b
[RLlib] QMIX better defaults + added to CI learning tests (#21332)
|
2 年之前 |
Sven Mika
|
daa4304a91
[RLlib] Switch off preprocessors by default for PGTrainer. (#21008)
|
2 年之前 |
Sven Mika
|
596c8e2772
[RLlib] Experimental no-flatten option for actions/prev-actions. (#20918)
|
2 年之前 |
Eric Liang
|
6f93ea437e
Remove the flaky test tag (#21006)
|
2 年之前 |
Avnish Narayan
|
6996eaa986
[RLlib] Add necessary fields to Base Envs, and BaseEnv wrapper classes (#20832)
|
2 年之前 |
Ishant Mrinal
|
2868d1a2cf
[RLlib] Support for RE3 exploration algorithm (for tf) (#19551)
|
2 年之前 |
Sven Mika
|
60b2219d72
[RLlib] Allow for evaluation to run by `timesteps` (alternative to `episodes`) and add auto-setting to make sure train doesn't ever have to wait for eval (e.g. long episodes) to finish. (#20757)
|
2 年之前 |
Jun Gong
|
65bd8e29f8
[RLlib] Update a few things to get rid of the `remote_vector_env` deprecation warning. (#20753)
|
2 年之前 |
mvindiola1
|
8cee0c03bf
[RLlib] Update `max_seq_len` in pad_batch_to_sequences_of_same_size (#20743)
|
2 年之前 |
Sven Mika
|
7a585fb275
[RLlib; Documentation] RLlib README overhaul. (#20249)
|
2 年之前 |
Sven Mika
|
56619b955e
[RLlib; Documentation] Some docstring cleanups; Rename RemoteVectorEnv into RemoteBaseEnv for clarity. (#20250)
|
2 年之前 |
Avnish Narayan
|
dc17f0a241
Add error messages for missing tf and torch imports (#20205)
|
2 年之前 |
Sven Mika
|
f82880eda1
Revert "Revert [RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061) (#20399)" (#20417)
|
2 年之前 |
Amog Kamsetty
|
90dc5460d4
Revert "[RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061)" (#20399)
|
2 年之前 |
Sven Mika
|
5b1c8e46e1
[RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061)
|
2 年之前 |
Sven Mika
|
ebd56b57db
[RLlib; documentation] "RLlib in 60sec" overhaul. (#20215)
|
2 年之前 |
Sven Mika
|
143d23a278
[RLlib] Issue 20062: Action inference examples missing (#20144)
|
2 年之前 |
Sven Mika
|
76f8a9f125
[RLlib; testing] Increase size of two time-out'ing test cases from medium to large. (#20128)
|
3 年之前 |
Sven Mika
|
a931076f59
[RLlib] Tf2 + eager-tracing same speed as framework=tf; Add more test coverage for tf2+tracing. (#19981)
|
3 年之前 |
Sven Mika
|
4cb23d1c95
[Tune; Testing] Revert to 3.7 (undone by accident by previous PR); + some minor comment cleanups. (#20031)
|
3 年之前 |
gjoliver
|
2c1fa459d4
[RLlib] Add an RLlib Tune experiment to UserTest suite. (#19807)
|
3 年之前 |
Avnish Narayan
|
026bf01071
[RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535)
|
3 年之前 |
Sven Mika
|
e6ae08f416
[RLlib] Optionally don't drop last ts in v-trace calculations (APPO and IMPALA). (#19601)
|
3 年之前 |
Sven Mika
|
2d24ef0d32
[RLlib] Add all simple learning tests as `framework=tf2`. (#19273)
|
3 年之前 |
Sven Mika
|
4d945fe651
[RLlib] Issue 19878: Re-instate bare_metal_policy example script (#19881)
|
3 年之前 |
Rohan138
|
b9c9cc5946
[RLlib] Updated PettingZoo+RLlib tutorial; Removed pettingzoo example script (#19069)
|
3 年之前 |