Real-Time Machine Learning: The Missing Pieces

Robert Nishihara, Philipp Moritz, Stephanie Wang, Alexey Tumanov, William Paul, Johann Schleier-Smith, Richard Liaw, Mehrdad Niknami, Michael I. Jordan, Ion Stoica
2017
2 references

Abstract

Machine learning applications are increasingly deployed not only to serve predictions using static models, but also as tightly-integrated components of feedback loops involving dynamic, real-time decision making. These applications pose a new set of requirements, none of which are difficult to achieve in isolation, but the combination of which creates a challenge for existing distributed execution frameworks: computation with millisecond latency at high throughput, adaptive construction of arbitrary task graphs, and execution of heterogeneous kernels over diverse sets of resources. We assert that a new distributed execution framework is needed for such ML applications and propose a candidate approach with a proof-of-concept architecture that achieves a 63x performance improvement over a state-of-the-art execution framework for a representative application.

1 repository
2 references

Code References

â–¶ ray-project/ray
2 files
â–¶ doc/source/ray-overview/getting-started.md
1
- [Ray HotOS paper (old)](https://arxiv.org/abs/1703.03924)
â–¶ README.rst
1
.. _`Ray HotOS paper`: https://arxiv.org/abs/1703.03924
Link copied to clipboard!