Sven Mika 92f030331e [RLlib] Initial code/comment cleanups in preparation for decentralized multi-agent learner. (#21420) 2 年之前
..
tests daa4304a91 [RLlib] Switch off preprocessors by default for PGTrainer. (#21008) 2 年之前
__init__.py 99ae7bae05 [RLlib] JAXPolicy prep. PR #1. (#13077) 3 年之前
dynamic_tf_policy.py 596c8e2772 [RLlib] Experimental no-flatten option for actions/prev-actions. (#20918) 2 年之前
eager_tf_policy.py f82880eda1 Revert "Revert [RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061) (#20399)" (#20417) 2 年之前
policy.py 92f030331e [RLlib] Initial code/comment cleanups in preparation for decentralized multi-agent learner. (#21420) 2 年之前
policy_map.py e37afe0425 [RLlib; Docs] Auto API reference pages overhaul: `rllib/policy` and `rllib/agents` packages. (#20537) 2 年之前
policy_template.py a8286c55af [RLLib] Fix deprecated convert_to_non_torch_type (#20751) 2 年之前
rnn_sequencing.py 92f030331e [RLlib] Initial code/comment cleanups in preparation for decentralized multi-agent learner. (#21420) 2 年之前
sample_batch.py 92f030331e [RLlib] Initial code/comment cleanups in preparation for decentralized multi-agent learner. (#21420) 2 年之前
tf_policy.py 596c8e2772 [RLlib] Experimental no-flatten option for actions/prev-actions. (#20918) 2 年之前
tf_policy_template.py 9c73871da0 [RLlib; Docs overhaul] Docstring cleanup: Evaluation (#19783) 3 年之前
torch_policy.py 92f030331e [RLlib] Initial code/comment cleanups in preparation for decentralized multi-agent learner. (#21420) 2 年之前
torch_policy_template.py 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
view_requirement.py 04bc0a9828 [RLlib] Remove all non-trajectory view API code. (#14860) 3 年之前