README.rst 2.2 KB

12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849505152535455565758596061
  1. Long Running Tests
  2. ==================
  3. This directory contains the long-running workloads which are intended to run
  4. forever until they fail. To set up the project you need to run
  5. .. code-block:: bash
  6. $ pip install anyscale
  7. $ anyscale init
  8. Note that all the long running test is running inside virtual environment, tensorflow_p36
  9. Running the Workloads
  10. ---------------------
  11. The easiest approach to running these workloads is to use the
  12. `Releaser`_ tool to run them with the command
  13. ``python cli.py suite:run long_running_tests``. By default, this
  14. will start a session to run each workload in the Anyscale product
  15. and kick them off.
  16. To run the tests manually, you can also use the `Anyscale UI <https://www.anyscale.dev/>`. First run ``anyscale snapshot create`` from the command line to create a project snapshot. Then from the UI, you can launch an individual session and execute the run command for each test.
  17. You can also start the workloads using the CLI with:
  18. .. code-block:: bash
  19. $ anyscale start
  20. $ anyscale run test_workload --workload=<WORKLOAD_NAME> --wheel=<RAY_WHEEL_LINK>
  21. Doing this for each workload will start one EC2 instance per workload and will start the workloads
  22. running (one per instance). A list of
  23. available workload options is available in the `ray_projects/project.yaml` file.
  24. Debugging
  25. ---------
  26. The primary method to debug the test while it is running is to view the logs and the dashboard from the UI. After the test has failed, you can still view the stdout logs in the UI and also inspect
  27. the logs under ``/tmp/ray/session*/logs/`` and
  28. ``/tmp/ray/session*/logs/debug_state.txt``.
  29. .. To check up on the workloads, run either
  30. .. ``anyscale session --name="*" execute check-load``, which
  31. .. will print the load on each machine, or
  32. .. ``anyscale session --name="*" execute show-output``, which
  33. .. will print the tail of the output for each workload.
  34. Shut Down the Workloads
  35. -----------------------
  36. The instances running the workloads can all be killed by running
  37. ``anyscale stop <SESSION_NAME>``.
  38. Adding a Workload
  39. -----------------
  40. To create a new workload, simply add a new Python file under ``workloads/`` and
  41. add the workload in the run command in `ray-project/project.yaml`.
  42. .. _`Releaser`: https://github.com/ray-project/releaser