Sven Mika
|
827ab91741
[RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557)
|
1 年之前 |
Sven Mika
|
e7a614f388
Revert "Revert "[RLlib] AlgorithmConfig: Next steps (volume 01); Algos, Rollo…" (#29747)
|
2 年之前 |
Kai Fricke
|
12b579d95e
Revert "[RLlib] AlgorithmConfig: Next steps (volume 01); Algos, RolloutWorker, PolicyMap, WorkerSet use AlgorithmConfig objects under the hood. (#29395)" (#29742)
|
2 年之前 |
Sven Mika
|
182744bbd1
[RLlib] AlgorithmConfig: Next steps (volume 01); Algos, RolloutWorker, PolicyMap, WorkerSet use AlgorithmConfig objects under the hood. (#29395)
|
2 年之前 |
mgerstgrasser
|
a9b9208e38
[RLlib] Add lr_schedule support to SimpleQ and PG. (#28381)
|
2 年之前 |
Artur Niederfahrenhorst
|
5133978adc
[RLlib] PG policy subclassing conversion. (#25288)
|
2 年之前 |
Eric Liang
|
905258dbc1
Clean up docstyle in python modules and add LINT rule (#25272)
|
2 年之前 |
kourosh hakhamaneshi
|
3815e52a61
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896)
|
2 年之前 |