Vulcan: Automatic Query Planning for Live ML Analytics

Yiwen Zhang

Xumiao Zhang

Ganesh Ananthanarayanan

Anand Iyer

Yuanchao Shu

Victor Bahl

Z. Morley Mao

Mosharaf Chowdhury

Published at USENIX NSDI 2024

Abstract

Live ML analytics have gained increasing popularity with large-scale deployments due to recent evolution of ML technologies. To serve live ML queries, experts nowadays still need to perform manual query planning, which involves pipeline construction, query configuration, and pipeline placement across multiple edge tiers in a heterogeneous infrastructure. Finding the best query plan for a live ML query requires navigating a huge search space, calling for an efficient and systematic solution. In this paper, we propose Vulcan, a system that automatically generates query plans for live ML queries to optimize their accuracy, latency, and resource consumption. Based on the user query and performance requirements, Vulcan determines the best pipeline, placement, and query configuration for the query with low profiling cost; it also performs fast online adaptation after query deployment. Vulcan outperforms state-of-the-art ML analytics systems by 4.1×-30.1× in terms of search cost while delivering up to 3.3× better query latency.

Vulcan: Automatic Query Planning for Live ML Analytics

Abstract

Materials