Kai Fricke 246787cdd9 Revert "[RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055)" (#20284) 2 年之前
..
exploration 026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535) 3 年之前
metrics 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
schedules 8ea1bc5ff9 [RLlib] Allow for more than 2^31 policy timesteps. (#11301) 4 年之前
spaces 026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535) 3 年之前
tests 3f89f35e52 [RLlib] Better error messages and hints; + failure-mode tests; (#18466) 3 年之前
__init__.py ed85f59194 [RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879) 3 年之前
actors.py d49dadf891 Change Python's `ObjectID` to `ObjectRef` (#9353) 4 年之前
annotations.py 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
compression.py b804d40c04 Stop vendoring pyarrow (#7233) 4 年之前
debug.py 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
deprecation.py 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
error.py 026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535) 3 年之前
filter.py 61a1274619 [RLlib] No Preprocessors (part 2). (#18468) 3 年之前
filter_manager.py 9a83908c46 [rllib] Deprecate policy optimizers (#8345) 4 年之前
framework.py cf21c634a3 [RLlib] Fix deprecated warning for torch_ops.py (soft-replaced by torch_utils.py). (#19982) 3 年之前
from_config.py 3f89f35e52 [RLlib] Better error messages and hints; + failure-mode tests; (#18466) 3 年之前
images.py 05c9dfbbda [RLlib] CV2 to Skimage dependency change (#16841) 3 年之前
install_atari_roms.sh ac5d255c9c [rllib/docker] silent unzip of atari roms (#18340) 3 年之前
memory.py 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
multi_agent.py 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
numpy.py f3397b6f48 [RLlib] Minor fixes/cleanups; chop_into_sequences now handles nested data. (#19408) 3 年之前
sgd.py ed85f59194 [RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879) 3 年之前
test_utils.py 246787cdd9 Revert "[RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055)" (#20284) 2 年之前
tf_ops.py 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
tf_run_builder.py 7588bfd315 [Lint] Add flake8-bugbear (#19053) 3 年之前
tf_utils.py 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
threading.py 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
timer.py 3c6b94f3f5 [rllib] Enable performance metrics reporting for RLlib pipelines, add A3C (#7299) 4 年之前
torch_ops.py 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
torch_utils.py 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
typing.py b213565783 [RLlib] Fix failing test cases: Soft-deprecate ModelV2.from_batch (in favor of ModelV2.__call__). (#19693) 3 年之前
window_stat.py 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前