Sven Mika
|
ad88af2e2e
[RLlib] Remove deprecation warnings in `utils/exploration` directory. (#37404)
|
1 年之前 |
Avnish Narayan
|
49342d58ec
[RLlib] RLlib deprecation Notices Part 2 (models/tf, models/torch, base_mode, catalog, modelv2, models/temp_spec_classes, policy/) (#36840)
|
1 年之前 |
Sven Mika
|
8e680c483c
[RLlib] gymnasium support (new `Env.reset()/step()/seed()/render()` APIs). (#28369)
|
1 年之前 |
Sven Mika
|
432f023642
[RLlib] Deprecate `AlgorithmConfig.framework("tfe")`: Use `tf2` instead. (#29755)
|
2 年之前 |
Eric Liang
|
905258dbc1
Clean up docstyle in python modules and add LINT rule (#25272)
|
2 年之前 |
Eric Liang
|
4963dfaae0
[api] Add API stability annotations for all RLlib symbols and add to LINT (#25060)
|
2 年之前 |
Sven Mika
|
6522935291
[RLlib] Slate-Q tf implementation and tests/benchmarks. (#22389)
|
2 年之前 |
Balaji Veeramani
|
7f1bacc7dc
[CI] Format Python code with Black (#21975)
|
2 年之前 |
Sven Mika
|
e485aa846a
[RLlib; Docs overhaul] Overhaul of auto-API reference pages (via sphinx autoclass/automodule). (#19786)
|
2 年之前 |
Sven Mika
|
cf21c634a3
[RLlib] Fix deprecated warning for torch_ops.py (soft-replaced by torch_utils.py). (#19982)
|
3 年之前 |
Sven Mika
|
2d24ef0d32
[RLlib] Add all simple learning tests as `framework=tf2`. (#19273)
|
3 年之前 |
Sven Mika
|
d0014cd351
[RLlib] Policies get/set_state fixes and enhancements. (#16354)
|
3 年之前 |
Sven Mika
|
cecfc3b43b
[RLlib] Multi-GPU support for Torch algorithms. (#14709)
|
3 年之前 |
dHannasch
|
8346dedc3a
Fix the linter failure. (#11755)
|
4 年之前 |
bcahlit
|
26176ec570
[RLlib] Fix epsilon_greedy on nested_action_spaces only in pytorch (#11453)
|
4 年之前 |
Sven Mika
|
8ea1bc5ff9
[RLlib] Allow for more than 2^31 policy timesteps. (#11301)
|
4 年之前 |
Sven Mika
|
199e5d0f75
[RLlib] Exploration class type annotations. (#11251)
|
4 年之前 |
Sven Mika
|
c17169dc11
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
4 年之前 |
Eric Liang
|
deea1861ab
[rllib] Try fixing torch GPU and masking errors (#10168)
|
4 年之前 |
Barak Michener
|
8e76796fd0
ci: Redo `format.sh --all` script & backfill lint fixes (#9956)
|
4 年之前 |
Sven Mika
|
78dfed2683
[RLlib] Issue 8384: QMIX doesn't learn anything. (#9527)
|
4 年之前 |
Sven Mika
|
fcdf410ae1
[RLlib] Tf2.x native. (#8752)
|
4 年之前 |
Sven Mika
|
01125b8fcf
[RLlib] DQN rainbow eager-mode (keras style NoisyLayer) (preparation for native tf2.x support). (#9304)
|
4 年之前 |
Sven Mika
|
43043ee4d5
[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136)
|
4 年之前 |
Sven Mika
|
5f278c6411
[RLlib] Examples folder restructuring (models) part 1 (#8353)
|
4 年之前 |
Sven Mika
|
6c2b9a4cfa
[RLlib] Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304)
|
4 年之前 |
Sven Mika
|
428516056a
[RLlib] SAC Torch (incl. Atari learning) (#7984)
|
4 年之前 |
Sven Mika
|
e153e3179f
[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798)
|
4 年之前 |
Eric Liang
|
be48e1964b
[rllib] Fix per-worker exploration in Ape-X; make more kwargs required for future safety (#7504)
|
4 年之前 |
Eric Liang
|
596b39e36a
[rllib] Make timestep a required arg for exploration classes (#7380)
|
4 年之前 |