提交历史

作者 SHA1 备注 提交日期
  Sven Mika 827ab91741 [RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557) 1 年之前
  Sven Mika e7a614f388 Revert "Revert "[RLlib] AlgorithmConfig: Next steps (volume 01); Algos, Rollo…" (#29747) 2 年之前
  Kai Fricke 12b579d95e Revert "[RLlib] AlgorithmConfig: Next steps (volume 01); Algos, RolloutWorker, PolicyMap, WorkerSet use AlgorithmConfig objects under the hood. (#29395)" (#29742) 2 年之前
  Sven Mika 182744bbd1 [RLlib] AlgorithmConfig: Next steps (volume 01); Algos, RolloutWorker, PolicyMap, WorkerSet use AlgorithmConfig objects under the hood. (#29395) 2 年之前
  mgerstgrasser a9b9208e38 [RLlib] Add lr_schedule support to SimpleQ and PG. (#28381) 2 年之前
  Sven Mika 6ca0b2f8e5 [RLlib] Some minor cleanups. (#28464) 2 年之前
  Jun Gong b383d987d1 [RLlib] Fix a bunch of issues related to connectors. (#26510) 2 年之前
  Artur Niederfahrenhorst 5133978adc [RLlib] PG policy subclassing conversion. (#25288) 2 年之前
  Eric Liang 905258dbc1 Clean up docstyle in python modules and add LINT rule (#25272) 2 年之前
  kourosh hakhamaneshi 3815e52a61 [RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896) 2 年之前