Vulcan: Automatic Query Planning for Live ML Analytics

Picture of Yiwen Zhang
Yiwen Zhang
Picture of Xumiao Zhang
Xumiao Zhang
Picture of Ganesh Ananthanarayanan
Ganesh Ananthanarayanan
Picture of Yuanchao Shu
Yuanchao Shu
Picture of Victor Bahl
Victor Bahl
Picture of Z. Morley Mao
Z. Morley Mao
Picture of Mosharaf Chowdhury
Mosharaf Chowdhury
Published at USENIX NSDI 2024

Abstract

Live ML analytics have gained increasing popularity with large-scale deployments due to recent evolution of ML technologies. To serve live ML queries, experts nowadays still need to perform manual query planning, which involves pipeline construction, query configuration, and pipeline placement across multiple edge tiers in a heterogeneous infrastructure. Finding the best query plan for a live ML query requires navigating a huge search space, calling for an efficient and systematic solution. In this paper, we propose Vulcan, a system that automatically generates query plans for live ML queries to optimize their accuracy, latency, and resource consumption. Based on the user query and performance requirements, Vulcan determines the best pipeline, placement, and query configuration for the query with low profiling cost; it also performs fast online adaptation after query deployment. Vulcan outperforms state-of-the-art ML analytics systems by 4.1×-30.1× in terms of search cost while delivering up to 3.3× better query latency.

Materials