Sven Mika
|
90c6b10498
[RLlib] Decentralized multi-agent learning; PR #01 (#21421)
|
2 年之前 |
Sven Mika
|
5a313ba3d6
[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169)
|
3 年之前 |
Sven Mika
|
c3a15ecc0f
[RLlib] Issue #13802: Enhance metrics for `multiagent->count_steps_by=agent_steps` setting. (#14033)
|
3 年之前 |
Michael Luo
|
a2d1215200
[RLlib] Execution Annotation (#13036)
|
3 年之前 |
Edward Oakes
|
cde711aaf1
Revert "[RLLib] Execution-Folder Type Annotations (#12760)" (#12886)
|
3 年之前 |
Michael Luo
|
becca1424d
[RLLib] Execution-Folder Type Annotations (#12760)
|
3 年之前 |
Sven Mika
|
e40b14d255
[RLlib] Batch-size for truncate_episode batch_mode should be confgurable in agent-steps (rather than env-steps), if needed. (#12420)
|
3 年之前 |
Sven Mika
|
2256047876
[RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114)
|
4 年之前 |
Eric Liang
|
1e0e1a45e6
[rllib] Add type annotations for evaluation/, env/ packages (#9003)
|
4 年之前 |
mehrdadn
|
f93bb008bb
Change os.uname()[1] and socket.gethostname() to the portable and faster platform.node_ip() (#8839)
|
4 年之前 |
Eric Liang
|
9a83908c46
[rllib] Deprecate policy optimizers (#8345)
|
4 年之前 |