The Runtime team develops and maintains the Ray C++ backend (e.g., distributed scheduler, language runtime integration, I/O and memory subsystems). We are responsible for the reliability, scalability, and performance of Ray as well as ensuring that Ray provides the right feature set to support higher level libraries and use cases. The team works on a balance of new features / distributed libraries, test infra improvements, debugging, and longer-term architectural improvements to Ray.
A snapshot of projects you can work on:
- Optimizing performance of large-scale workloads on Ray
- Stability and stress testing infrastructure
- Improving fault tolerance (HA)