提交历史

作者 SHA1 备注 提交日期
  Sven Mika 6522935291 [RLlib] Slate-Q tf implementation and tests/benchmarks. (#22389) 2 年之前
  Sven Mika 38d75ce058 [RLlib] Cleanup SlateQ algo; add test + add target Q-net (#21827) 2 年之前
  Balaji Veeramani 7f1bacc7dc [CI] Format Python code with Black (#21975) 2 年之前
  Sven Mika 893536ebd9 [RLlib] Move bandits into main agents folder; Make RecSim adapter more accessible; (#21773) 2 年之前
  Ishant Mrinal 2868d1a2cf [RLlib] Support for RE3 exploration algorithm (for tf) (#19551) 2 年之前
  Tanay Wakhare 1826b29757 [RLlib] Curiosity (intrinsic motivation) Exploration module. (#9912) 4 年之前
  Sven Mika e153e3179f [RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798) 4 年之前
  Sven Mika 83e06cd30a [RLlib] DDPG refactor and Exploration API action noise classes. (#7314) 4 年之前
  Sven Mika d537e9f0d8 [RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155) 4 年之前
  Sven Mika 6e1c3ea824 [RLlib] Exploration API (+EpsilonGreedy sub-class). (#6974) 4 年之前