Sven Mika d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) 2 年之前
..
collectors 6c3e63bc9c [RLlib] Fix view requirements. (#21043) 2 年之前
tests 12b087acb8 [RLlib] Base env pre-checker. (#21569) 2 年之前
__init__.py 9c73871da0 [RLlib; Docs overhaul] Docstring cleanup: Evaluation (#19783) 3 年之前
episode.py 596c8e2772 [RLlib] Experimental no-flatten option for actions/prev-actions. (#20918) 2 年之前
metrics.py 9c73871da0 [RLlib; Docs overhaul] Docstring cleanup: Evaluation (#19783) 3 年之前
observation_function.py 9c73871da0 [RLlib; Docs overhaul] Docstring cleanup: Evaluation (#19783) 3 年之前
postprocessing.py f82880eda1 Revert "Revert [RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061) (#20399)" (#20417) 2 年之前
rollout_worker.py d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) 2 年之前
sample_batch_builder.py 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
sampler.py 596c8e2772 [RLlib] Experimental no-flatten option for actions/prev-actions. (#20918) 2 年之前
worker_set.py d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) 2 年之前