Sven Mika
|
9c5a0cfd7a
[RLlib] Issue 14385: `Policy.compute_actions_from_input_dict` does not properly track accessed fields for Policy's view requirements. (#14386)
|
3 年之前 |
Sven Mika
|
69202c6a7d
[RLlib] Obsolete usage tracking dict via sample batch. (#13065)
|
3 年之前 |
Sven Mika
|
8000258333
[RLlib] R2D2 Implementation. (#13933)
|
3 年之前 |
Sven Mika
|
81e7434091
[RLlib] TFPolicy.export_model: Add timestep placeholder to model's signature, if needed. (#13988)
|
3 年之前 |
Sven Mika
|
eb0038612f
[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584)
|
3 年之前 |
Sven Mika
|
6f342a2221
[RLlib] Preparatory PR for: Documentation on Model Building. (#13260)
|
3 年之前 |
Sven Mika
|
391cdfae8c
[RLlib] Trajectory view API docs. (#12718)
|
3 年之前 |
Sven Mika
|
b2bcab711d
[RLlib] Attention Nets: tf (#12753)
|
3 年之前 |
Sven Mika
|
74c98ac38e
[RLlib] Issue 12244: Unable to restore multi-agent PPOTFPolicy's Model (from exported). (#12786)
|
3 年之前 |
Sven Mika
|
99c81c6795
[RLlib] Attention Net prep PR #3. (#12450)
|
3 年之前 |
Sven Mika
|
9021f15b2a
[RLlib] Fix setup-dev.py error when creating a softlink for new_dashboard. (#12442)
|
3 年之前 |
Sven Mika
|
6da4342822
[RLlib] Add on_learn_on_batch (Policy) callback to DefaultCallbacks. (#12070)
|
3 年之前 |
Sven Mika
|
62c7ab5182
[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747)
|
4 年之前 |
Sven Mika
|
d9f1874e34
[RLlib] Minor fixes (torch GPU bugs + some cleanup). (#11609)
|
4 年之前 |
Sven Mika
|
8ea1bc5ff9
[RLlib] Allow for more than 2^31 policy timesteps. (#11301)
|
4 年之前 |
Sven Mika
|
ce96b03b07
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
4 年之前 |
Sven Mika
|
2256047876
[RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114)
|
4 年之前 |
Barak Michener
|
8e76796fd0
ci: Redo `format.sh --all` script & backfill lint fixes (#9956)
|
4 年之前 |
Sven Mika
|
fcdf410ae1
[RLlib] Tf2.x native. (#8752)
|
4 年之前 |
Sven Mika
|
f43d934817
[RLlib] Type annotations for policy. (#9248)
|
4 年之前 |
Sven Mika
|
43043ee4d5
[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136)
|
4 年之前 |
Sven Mika
|
25c0974543
[RLlib] Issue 8412 (Adam vars not stored in ModelV2). (#8480)
|
4 年之前 |
Sven Mika
|
6c2b9a4cfa
[RLlib] Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304)
|
4 年之前 |
Sven Mika
|
5537fe13b0
[RLlib] Exploration API: ParamNoise Integration into DQN; working example/test cases. (#7814)
|
4 年之前 |
Sven Mika
|
e153e3179f
[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798)
|
4 年之前 |
Sven Mika
|
66df8b8c35
[RLlib] Working/learning example: PPO + torch + LSTM. (#7797)
|
4 年之前 |
Sven Mika
|
e4bd5db4d8
[RLlib] Minimal ParamNoise PR. (#7772)
|
4 年之前 |
Sven Mika
|
369a3417c4
[RLlib] Add tf-graph by default when doing `Policy.export_model()`. (#7759)
|
4 年之前 |
Sven Mika
|
1138f2ebed
[RLlib] Issue 7046 cannot restore keras model from h5 file. (#7482)
|
4 年之前 |
Sven Mika
|
2fb219a658
[Ray RLlib] Fix tree import (#7662)
|
4 年之前 |