Sven Mika
|
223b39611e
[RLlib] Deprecate/cleanup: AlgorithmConfig["multiagent"] access and usage in tests and examples. (#35879)
|
1 年之前 |
Sven Mika
|
794cfd9725
[RLlib] `AlgorithmConfig.overrides()` to replace `multiagent->policies->config` and `evaluation_config` dicts. (#30879)
|
1 年之前 |
Artur Niederfahrenhorst
|
ef62802353
[RLlib] Unify policy mapping function usage (#30216)
|
1 年之前 |
Sven Mika
|
72fefc3a40
[RLlib] AlgorithmConfig: Replace more of the old-style config dicts across codebase. (#29799)
|
2 年之前 |
Sven Mika
|
b218ae7e4a
[RLlib] Replace CartPole-v0 -> CartPole-v1 everywhere, incl. docs. (#29752)
|
2 年之前 |
Sven Mika
|
199dc8cff0
Revert "Revert "[RLlib] @deprecate(error=True|False) escalation."" (#28807)
|
2 年之前 |
Amog Kamsetty
|
e501654925
Revert "[RLlib] @deprecate(error=True|False) escalation. (#28733)" (#28795)
|
2 年之前 |
Sven Mika
|
c4348c1889
[RLlib] @deprecate(error=True|False) escalation. (#28733)
|
2 年之前 |
Jun Gong
|
8c9cac350d
Fix unit test test_check_env.py and est_check_multi_agent.py. (#25993)
|
2 年之前 |
Sven Mika
|
b5bc2b93c3
[RLlib] Move all remaining algos into `algorithms` directory. (#25366)
|
2 年之前 |
kourosh hakhamaneshi
|
3815e52a61
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896)
|
2 年之前 |
Balaji Veeramani
|
7f1bacc7dc
[CI] Format Python code with Black (#21975)
|
2 年之前 |
Sven Mika
|
f94bd99ce4
[RLlib] Issue 21044: Improve error message for "multiagent" dict checks. (#21448)
|
2 年之前 |