Avnish Narayan 684e28b380 [RLlib] RLlib deprecation Notices Part 1 (algorithm/, evaluation/, execution/, models/jax/) (#36826) | 1 年之前 | |
---|---|---|
.. | ||
tests | 827ab91741 [RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557) | 1 年之前 |
README.md | 3e053c85ee [RLlib] Fix broken links from agent -> algo conversion. (#25014) | 2 年之前 |
__init__.py | 8a9a176a24 [RLlib] Remove all default config objects and rllib/agents (#33242) | 1 年之前 |
pg.py | 684e28b380 [RLlib] RLlib deprecation Notices Part 1 (algorithm/, evaluation/, execution/, models/jax/) (#36826) | 1 年之前 |
pg_tf_policy.py | 827ab91741 [RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557) | 1 年之前 |
pg_torch_policy.py | 827ab91741 [RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557) | 1 年之前 |
utils.py | 905258dbc1 Clean up docstyle in python modules and add LINT rule (#25272) | 2 年之前 |
An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.