Sven Mika
|
6522935291
[RLlib] Slate-Q tf implementation and tests/benchmarks. (#22389)
|
2 年之前 |
Sven Mika
|
38d75ce058
[RLlib] Cleanup SlateQ algo; add test + add target Q-net (#21827)
|
2 年之前 |
Balaji Veeramani
|
7f1bacc7dc
[CI] Format Python code with Black (#21975)
|
2 年之前 |
Sven Mika
|
893536ebd9
[RLlib] Move bandits into main agents folder; Make RecSim adapter more accessible; (#21773)
|
2 年之前 |
Ishant Mrinal
|
2868d1a2cf
[RLlib] Support for RE3 exploration algorithm (for tf) (#19551)
|
2 年之前 |
Tanay Wakhare
|
1826b29757
[RLlib] Curiosity (intrinsic motivation) Exploration module. (#9912)
|
4 年之前 |
Sven Mika
|
e153e3179f
[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798)
|
4 年之前 |
Sven Mika
|
83e06cd30a
[RLlib] DDPG refactor and Exploration API action noise classes. (#7314)
|
4 年之前 |
Sven Mika
|
d537e9f0d8
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
4 年之前 |
Sven Mika
|
6e1c3ea824
[RLlib] Exploration API (+EpsilonGreedy sub-class). (#6974)
|
4 年之前 |