Michael Luo
|
020c9439dd
[RLlib] CQL Documentation + Tests (#14531)
|
3 年之前 |
Michael Luo
|
67229bf350
[RLlib] SlateQ Documentation (#13266)
|
3 年之前 |
Sven Mika
|
391cdfae8c
[RLlib] Trajectory view API docs. (#12718)
|
3 年之前 |
Sven Mika
|
f91c455527
[RLlib] Curiosity documentation. (#11066)
|
4 年之前 |
Eric Liang
|
3eed3eca09
Move meta-learning algorithms into their own section in the TOC (#10727)
|
4 年之前 |
Michael Luo
|
8e613652af
[RLLib] MBMPO Fixes (#10296)
|
4 年之前 |
Simon Mo
|
5a38a76c83
[Doc] Use sphinx_book_theme (#10379)
|
4 年之前 |
Michael Luo
|
4e9888ce2f
[RLlib] Dreamer (#10172)
|
4 年之前 |
Eric Liang
|
7e3e4cd321
[rllib] Execution plan API documentation (#10000)
|
4 年之前 |
Eric Liang
|
668f555755
[rllib] Clean up outdated docs #9915
|
4 年之前 |
Michael Luo
|
851d02463b
[Doc] RLlib Algorithms Documentation: MAML + PyTorch MAML (#9189)
|
4 年之前 |
Eric Liang
|
be26a7b1b0
[rllib] Support for complex / variable-length observation spaces (#8393)
|
4 年之前 |
Eric Liang
|
9a83908c46
[rllib] Deprecate policy optimizers (#8345)
|
4 年之前 |
Sven Mika
|
166bb5d690
[RLlib] IMPALA PyTorch (#8287)
|
4 年之前 |
Sven Mika
|
499ad5fbe4
[RLlib] PyTorch version of APPO. (#8120)
|
4 年之前 |
Sven Mika
|
d15609ba2a
[RLlib] PyTorch version of ARS (Augmented Random Search). (#8106)
|
4 年之前 |
Sven Mika
|
3812bfedda
[RLlib] PyTorch version of ES (Evolution Strategies). (#8104)
|
4 年之前 |
Sven Mika
|
d2b5c171cb
[RLlib] Add pytorch sigils to toc and add links to algo overview table. (#7950)
|
4 年之前 |
Eric Liang
|
5cebee68d6
[rllib] Add scaling guide to documentation, improve bandit docs (#7780)
|
4 年之前 |
Eric Liang
|
9392cdbf74
[rllib] Add high-performance external application connector (#7641)
|
4 年之前 |
Eric Liang
|
52cf77f5a9
[rllib] SAC no_done_at_end should default to False (#7594)
|
4 年之前 |
Sven Mika
|
2d97650b1e
[RLlib] Add Exploration API documentation. (#7373)
|
4 年之前 |
Eric Liang
|
026f6884b5
[rllib] Add Decentralized DDPPO trainer and documentation (#7088)
|
4 年之前 |
Eric Liang
|
fbc545c03b
[rllib] Support parallel, parameterized evaluation (#6981)
|
4 年之前 |
Eric Liang
|
6bb30c9f1b
fix links (#6883)
|
4 年之前 |
Eric Liang
|
14016535a5
[rllib] Add TF and Torch icons to show which are available for each algo (#6869)
|
4 年之前 |
Victor Le
|
4e24c805ee
AlphaZero and Ranked reward implementation (#6385)
|
4 年之前 |
Eric Liang
|
bc5e259264
[rllib] Add a doc section on computing actions (#6326)
|
4 年之前 |
Eric Liang
|
a0dcb45dc3
[rllib] Fix APEX priorities returning zero all the time (#5980)
|
5 年之前 |
Eric Liang
|
bc6a95deb0
[rllib] Eager execution for centralized critic example, fix simple optimizer for multiagent (#5683)
|
5 年之前 |