Sven Mika
|
0f62ccc6ee
[RLlib] Clean up some deprecation messages (they shouldn't be there) and make others `error=True` (from `error=False`) (#38555)
|
1 年之前 |
Sven Mika
|
8e680c483c
[RLlib] gymnasium support (new `Env.reset()/step()/seed()/render()` APIs). (#28369)
|
1 年之前 |
Sven Mika
|
130b7eeaba
[RLlib] `Trainer` to `Algorithm` renaming. (#25539)
|
2 年之前 |
Balaji Veeramani
|
7f1bacc7dc
[CI] Format Python code with Black (#21975)
|
2 年之前 |
Sven Mika
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
Sven Mika
|
5107d16ae5
[RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. (#17530)
|
3 年之前 |
Michael Luo
|
4cbe13cdfd
[RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603)
|
3 年之前 |
Amog Kamsetty
|
ebc44c3d76
[CI] Upgrade flake8 to 3.9.1 (#15527)
|
3 年之前 |
Sven Mika
|
99ae7bae05
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
3 年之前 |
Sven Mika
|
99c81c6795
[RLlib] Attention Net prep PR #3. (#12450)
|
3 年之前 |
Sven Mika
|
19c8033df2
[RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366)
|
3 年之前 |
Sven Mika
|
fb318addcb
[RLlib] Curiosity exploration module: tf/tf2.x/tf-eager support. (#11945)
|
3 年之前 |
Sven Mika
|
62c7ab5182
[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747)
|
4 年之前 |
Sven Mika
|
5b788ccb13
[RLlib] Trajectory view API (prep PR for switching on by default across all RLlib; plumbing only) (#11717)
|
4 年之前 |
Sven Mika
|
0c0f67c14d
[RLlib] ARS/ES eval workers not working: Issue 9933. (#11308)
|
4 年之前 |
Sven Mika
|
c17169dc11
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
4 年之前 |
Sven Mika
|
36bda8432b
[RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic (#11056)
|
4 年之前 |
Sven Mika
|
805dad3bc4
[RLlib] SAC algo cleanup. (#10825)
|
4 年之前 |
desktable
|
799318d7d7
[RLlib] Add type annotations for agents/dqn (#10626)
|
4 年之前 |
Sven Mika
|
e968b52cb7
[RLlib] Trajectory view API - 03 Fast LSTM + prev actions/rewards (#9950)
|
4 年之前 |
Sven Mika
|
d14b501692
[RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115)
|
4 年之前 |
Sven Mika
|
2cbe29a7fa
[RLlib] Curiosity minor fixes, do-overs, and testing. (#10143)
|
4 年之前 |
Eric Liang
|
ca133e2699
[rllib] Remove extra model config kwargs passed incorrectly for Torch models (#10055)
|
4 年之前 |
Sven Mika
|
2256047876
[RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114)
|
4 年之前 |
Tanay Wakhare
|
1826b29757
[RLlib] Curiosity (intrinsic motivation) Exploration module. (#9912)
|
4 年之前 |
Barak Michener
|
8e76796fd0
ci: Redo `format.sh --all` script & backfill lint fixes (#9956)
|
4 年之前 |
Sven Mika
|
57690a3a9f
[RLlib] Trajectory view API - 02 actual API scaffold (#9753)
|
4 年之前 |
Sven Mika
|
9b90f7db67
[RLlib] Missing type annotations policy templates. (#9846)
|
4 年之前 |
Eric Liang
|
5acd3e66dd
[rllib] Fix torch TD error, IMPALA LR updates (#9477)
|
4 年之前 |
Sven Mika
|
4fd8977eaf
[RLlib] Minor cleanup in preparation to tf2.x support. (#9130)
|
4 年之前 |