Sven Mika
|
e74947cc94
[RLlib] Env directory cleanup and tests. (#13082)
|
3 年之前 |
Sven Mika
|
d49c3fae0b
[RLlib] Trajectory View API: Atari framestacking. (#13315)
|
3 年之前 |
Sven Mika
|
391cdfae8c
[RLlib] Trajectory view API docs. (#12718)
|
3 年之前 |
Sven Mika
|
e40b14d255
[RLlib] Batch-size for truncate_episode batch_mode should be confgurable in agent-steps (rather than env-steps), if needed. (#12420)
|
3 年之前 |
Sven Mika
|
99c81c6795
[RLlib] Attention Net prep PR #3. (#12450)
|
3 年之前 |
Sven Mika
|
19c8033df2
[RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366)
|
3 年之前 |
Sven Mika
|
3ad9365e1d
[RLlib] Attention Net prep PR #2: Smaller cleanups. (#12449)
|
3 年之前 |
Sven Mika
|
0df55a139c
[RLlib] Attention Net prep PR #1: Smaller cleanups. (#12447)
|
3 年之前 |
Sven Mika
|
dab241dcc6
[RLlib] Fix inconsistency wrt batch size in SampleCollector (traj. view API). Makes DD-PPO work with traj. view API. (#12063)
|
3 年之前 |
Sven Mika
|
5b788ccb13
[RLlib] Trajectory view API (prep PR for switching on by default across all RLlib; plumbing only) (#11717)
|
4 年之前 |
Sven Mika
|
36bda8432b
[RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic (#11056)
|
4 年之前 |
Sven Mika
|
805dad3bc4
[RLlib] SAC algo cleanup. (#10825)
|
4 年之前 |
Sven Mika
|
4b278c36fc
[RLlib] Behavioral Cloning (from MARWIL). (#10619)
|
4 年之前 |
Sven Mika
|
244aafdcf8
[RLlib] Curiosity enhancements. (#10373)
|
4 年之前 |
Sven Mika
|
715ee8dfc9
[RLlib] Issue 10469: Callbacks should receive env idx ... (#10477)
|
4 年之前 |
Sven Mika
|
e968b52cb7
[RLlib] Trajectory view API - 03 Fast LSTM + prev actions/rewards (#9950)
|
4 年之前 |
Sven Mika
|
2256047876
[RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114)
|
4 年之前 |
Barak Michener
|
8e76796fd0
ci: Redo `format.sh --all` script & backfill lint fixes (#9956)
|
4 年之前 |
Miguel Morales
|
372114b4ed
Update sampler.py (#9805)
|
4 年之前 |
Sven Mika
|
b0b0463161
[RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678)
|
4 年之前 |
Sven Mika
|
03ab86567f
[RLlib] Layout of Trajectory View API (new class: Trajectory; not used yet). (#9269)
|
4 年之前 |
Sven Mika
|
5b2a97597b
[RLlib] Retire `try_import_tree` (should be installed along with other requirements). (#9211)
|
4 年之前 |
Eric Liang
|
1e0e1a45e6
[rllib] Add type annotations for evaluation/, env/ packages (#9003)
|
4 年之前 |
Sven Mika
|
8d1ccfd0f7
[RLlib] Issue 8889: action clipping bug ppo not learning mujoco (#8898)
|
4 年之前 |
Sven Mika
|
368088be85
[RLlib] Sample batch docs and cleanup. (#8778)
|
4 年之前 |
Sven Mika
|
d8a081a185
[RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590)
|
4 年之前 |
Sven Mika
|
6d196197bc
[RLlib] utils/spaces ... (#8608)
|
4 年之前 |
Sven Mika
|
d76578700d
[RLlib] `Policy.compute_single_action()` broken for nested actions (Issue 8411). (#8514)
|
4 年之前 |
Eric Liang
|
f48da50e1c
[rllib] observation function api for multi-agent (#8236)
|
4 年之前 |
Sven Mika
|
1775e89f26
[RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143)
|
4 年之前 |