Avnish Narayan 684e28b380 [RLlib] RLlib deprecation Notices Part 1 (algorithm/, evaluation/, execution/, models/jax/) (#36826) 1 年之前
..
tests 827ab91741 [RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557) 1 年之前
README.md 3e053c85ee [RLlib] Fix broken links from agent -> algo conversion. (#25014) 2 年之前
__init__.py 8a9a176a24 [RLlib] Remove all default config objects and rllib/agents (#33242) 1 年之前
pg.py 684e28b380 [RLlib] RLlib deprecation Notices Part 1 (algorithm/, evaluation/, execution/, models/jax/) (#36826) 1 年之前
pg_tf_policy.py 827ab91741 [RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557) 1 年之前
pg_torch_policy.py 827ab91741 [RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557) 1 年之前
utils.py 905258dbc1 Clean up docstyle in python modules and add LINT rule (#25272) 2 年之前

README.md

Policy Gradient (PG)

An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.

Detailed Documentation

Implementation