.. |
exploration
|
026bf01071
[RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535)
|
3 年之前 |
metrics
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
schedules
|
8ea1bc5ff9
[RLlib] Allow for more than 2^31 policy timesteps. (#11301)
|
4 年之前 |
spaces
|
026bf01071
[RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535)
|
3 年之前 |
tests
|
3f89f35e52
[RLlib] Better error messages and hints; + failure-mode tests; (#18466)
|
3 年之前 |
__init__.py
|
ed85f59194
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879)
|
3 年之前 |
actors.py
|
d49dadf891
Change Python's `ObjectID` to `ObjectRef` (#9353)
|
4 年之前 |
annotations.py
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
compression.py
|
b804d40c04
Stop vendoring pyarrow (#7233)
|
4 年之前 |
debug.py
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
deprecation.py
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
error.py
|
026bf01071
[RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535)
|
3 年之前 |
filter.py
|
61a1274619
[RLlib] No Preprocessors (part 2). (#18468)
|
3 年之前 |
filter_manager.py
|
9a83908c46
[rllib] Deprecate policy optimizers (#8345)
|
4 年之前 |
framework.py
|
cf21c634a3
[RLlib] Fix deprecated warning for torch_ops.py (soft-replaced by torch_utils.py). (#19982)
|
3 年之前 |
from_config.py
|
3f89f35e52
[RLlib] Better error messages and hints; + failure-mode tests; (#18466)
|
3 年之前 |
images.py
|
05c9dfbbda
[RLlib] CV2 to Skimage dependency change (#16841)
|
3 年之前 |
install_atari_roms.sh
|
ac5d255c9c
[rllib/docker] silent unzip of atari roms (#18340)
|
3 年之前 |
memory.py
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
multi_agent.py
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
numpy.py
|
f3397b6f48
[RLlib] Minor fixes/cleanups; chop_into_sequences now handles nested data. (#19408)
|
3 年之前 |
sgd.py
|
ed85f59194
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879)
|
3 年之前 |
test_utils.py
|
246787cdd9
Revert "[RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055)" (#20284)
|
2 年之前 |
tf_ops.py
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
tf_run_builder.py
|
7588bfd315
[Lint] Add flake8-bugbear (#19053)
|
3 年之前 |
tf_utils.py
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
threading.py
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
timer.py
|
3c6b94f3f5
[rllib] Enable performance metrics reporting for RLlib pipelines, add A3C (#7299)
|
4 年之前 |
torch_ops.py
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
torch_utils.py
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
typing.py
|
b213565783
[RLlib] Fix failing test cases: Soft-deprecate ModelV2.from_batch (in favor of ModelV2.__call__). (#19693)
|
3 年之前 |
window_stat.py
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |