Artur Niederfahrenhorst
|
d07e50e957
[RLlib] Replay buffer API (cleanups; docstrings; renames; move into `rllib/execution/buffers` dir) (#20552)
|
2 年之前 |
Sven Mika
|
4888d7c9af
[RLlib] Replay buffers: Add config option to store contents in checkpoints. (#17999)
|
3 年之前 |
Sven Mika
|
e2be41b407
[RLlib] MARWIL + BC: Various fixes and enhancements. (#16218)
|
3 年之前 |
Amog Kamsetty
|
ebc44c3d76
[CI] Upgrade flake8 to 3.9.1 (#15527)
|
3 年之前 |
Michael Luo
|
a2d1215200
[RLlib] Execution Annotation (#13036)
|
3 年之前 |
Edward Oakes
|
cde711aaf1
Revert "[RLLib] Execution-Folder Type Annotations (#12760)" (#12886)
|
3 年之前 |
Michael Luo
|
becca1424d
[RLLib] Execution-Folder Type Annotations (#12760)
|
3 年之前 |
Eric Liang
|
ecdaaffc67
add large data warning (#10957)
|
4 年之前 |
Sven Mika
|
805dad3bc4
[RLlib] SAC algo cleanup. (#10825)
|
4 年之前 |
Sven Mika
|
2256047876
[RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114)
|
4 年之前 |
Eric Liang
|
1e0e1a45e6
[rllib] Add type annotations for evaluation/, env/ packages (#9003)
|
4 年之前 |
Eric Liang
|
34bae27ac7
[rllib] Flexible multi-agent replay modes and replay_sequence_length (#8893)
|
4 年之前 |
Eric Liang
|
9a83908c46
[rllib] Deprecate policy optimizers (#8345)
|
4 年之前 |
Eric Liang
|
6bf1dc0888
[rllib] [hotfix] Build broken due to merge conflict: MixInReplay has no attribute buffer
|
4 年之前 |
Eric Liang
|
96f4d82cc3
[rllib] Qmix replay ratio is wrong
|
4 年之前 |
Eric Liang
|
2c599dbf05
[rllib] Port QMIX, MADDPG to new execution API (#8344)
|
4 年之前 |
Eric Liang
|
ee0eb44a32
Rename async_queue_depth -> num_async (#8207)
|
4 年之前 |
Eric Liang
|
2298f6fb40
[rllib] Port DQN/Ape-X to training workflow api (#8077)
|
4 年之前 |
Eric Liang
|
31b40b00f6
[rllib] Pull out experimental dsl into rllib.execution module, add initial unit tests (#7958)
|
4 年之前 |