Max van Dijck 232c331ce3 [RLlib] Rename all np.product usage to np.prod (#46317) | 3 月之前 | |
---|---|---|
.. | ||
examples | 471e15d1c6 [RLlib-contrib] ES (evolutionary strategies). (#36625) | 1 年之前 |
src | 471e15d1c6 [RLlib-contrib] ES (evolutionary strategies). (#36625) | 1 年之前 |
tests | 471e15d1c6 [RLlib-contrib] ES (evolutionary strategies). (#36625) | 1 年之前 |
tuned_examples | a9ac55d4f2 [RLlib; RLlib contrib] Move `tuned_examples` into rllib_contrib and remove CI learning tests for contrib algos. (#40444) | 1 年之前 |
BUILD | a9ac55d4f2 [RLlib; RLlib contrib] Move `tuned_examples` into rllib_contrib and remove CI learning tests for contrib algos. (#40444) | 1 年之前 |
README.md | 471e15d1c6 [RLlib-contrib] ES (evolutionary strategies). (#36625) | 1 年之前 |
pyproject.toml | 232c331ce3 [RLlib] Rename all np.product usage to np.prod (#46317) | 3 月之前 |
requirements.txt | 232c331ce3 [RLlib] Rename all np.product usage to np.prod (#46317) | 3 月之前 |
ES is a class of black box optimization algorithms, as an alternative to popular MDP-based RL techniques such as Q-learning and Policy Gradients. It is invariant to action frequency and delayed rewards, tolerant of extremely long horizons, and does not need temporal discounting or value function approximation.
conda create -n rllib-es python=3.10
conda activate rllib-es
pip install -r requirements.txt
pip install -e '.[development]'
[ES Example]()