Sven Mika
|
2d24ef0d32
[RLlib] Add all simple learning tests as `framework=tf2`. (#19273)
|
3 年之前 |
Sven Mika
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
gjoliver
|
d81885c1f1
[RLlib] Fix all the CI tests that were broken by is_training and replay buffer changes; re-comment-in the failing RLlib tests (#19809)
|
3 年之前 |
Sven Mika
|
b213565783
[RLlib] Fix failing test cases: Soft-deprecate ModelV2.from_batch (in favor of ModelV2.__call__). (#19693)
|
3 年之前 |
gjoliver
|
c3c42278e4
[RLlib] clean up all the SampleBatch['is_training'] deprecation warnings (#19652)
|
3 年之前 |
Sven Mika
|
8a066474d4
[RLlib] No Preprocessors; preparatory PR #1 (#18367)
|
3 年之前 |
Sven Mika
|
e3e6ed7aaa
[RLlib] Issues 17844, 18034: Fix n-step > 1 bug. (#18358)
|
3 年之前 |
Sven Mika
|
a428f10ebe
[RLlib] Add multi-GPU learning tests to nightly. (#17778)
|
3 年之前 |
Sven Mika
|
5a313ba3d6
[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169)
|
3 年之前 |
Michael Luo
|
474f04e322
[RLlib] DDPG/TD3 + A3C/A2C + MARWIL/BC Annotation/Comments/Code Cleanup (#14707)
|
3 年之前 |
Sven Mika
|
839fc59224
[RLlib] CQL TensorFlow support (#15841)
|
3 年之前 |
Antoine Galataud
|
ce1c001b1d
[RLlib] DQN: Place LearningRateSchedule mixin at the right moment (#15558)
|
3 年之前 |
Sven Mika
|
04bc0a9828
[RLlib] Remove all non-trajectory view API code. (#14860)
|
3 年之前 |
Sven Mika
|
69202c6a7d
[RLlib] Obsolete usage tracking dict via sample batch. (#13065)
|
3 年之前 |
Sven Mika
|
732197e23a
[RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393)
|
3 年之前 |
Sven Mika
|
8000258333
[RLlib] R2D2 Implementation. (#13933)
|
3 年之前 |
Kai Fricke
|
d9e5d5f47a
[RLlib] Cast fcnet_hiddens to list for DQN models (list vs tuple mismatch error) (#14308)
|
3 年之前 |
Sven Mika
|
2e3655e8a9
[RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238)
|
3 年之前 |
desktable
|
4ccfd07a61
[RLlib] Add docstrings for agents/dqn (#10710)
|
4 年之前 |
desktable
|
799318d7d7
[RLlib] Add type annotations for agents/dqn (#10626)
|
4 年之前 |
Sven Mika
|
8a891b3c30
[RLlib] SAC n_step > 1. (#10567)
|
4 年之前 |
Barak Michener
|
8e76796fd0
ci: Redo `format.sh --all` script & backfill lint fixes (#9956)
|
4 年之前 |
Sven Mika
|
5dc4b6686e
[RLlib] Implement DQN PyTorch distributional head. (#9589)
|
4 年之前 |
Sven Mika
|
935d8308fb
[RLlib] Issue #9437 (PyTorch converts to CPU tensor, even if on GPU). (#9497)
|
4 年之前 |
Sven Mika
|
fcdf410ae1
[RLlib] Tf2.x native. (#8752)
|
4 年之前 |
Sven Mika
|
43043ee4d5
[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136)
|
4 年之前 |
Sven Mika
|
4fd8977eaf
[RLlib] Minor cleanup in preparation to tf2.x support. (#9130)
|
4 年之前 |
Sven Mika
|
428516056a
[RLlib] SAC Torch (incl. Atari learning) (#7984)
|
4 年之前 |
Sven Mika
|
22ccc43670
[RLlib] DQN torch version. (#7597)
|
4 年之前 |