COMPUTER SCIENCE TECHNICAL REPORT ABSTRACTS

CMU-CS-20-105
Computer Science Department
School of Computer Science, Carnegie Mellon University

CMU-CS-20-105

Learned Adaptive Accuracy-Cost Optimization
for Machine Learning Systems

Conglong Li

Ph.D. Thesis

May 2020

Keywords: Machine Learning, Information Retrieval, Recommendation Systems, Cache, Similarity Search, Approximate Nearest Neighbor Search, Sponsored Search, Search Advertising, Workload Analysis

This dissertation seeks to address the challenge of making adaptiv eaccuracy cost balancing inside systems for large-scale machine learning-based recommendation services. We show that it is important to make performance tradeoff decisions at a per-query basis instead of a predefined policy for all queries. We show that we can achieve a better tradeoff between accuracy and cost by leveraging lightweight machine learning models to make more adaptive decision-making inside systems infrastructure.

Large-scale recommendation services have two computation-heavy components with strict accuracy and latency targets: scoring (typically achieved by complex machine learning models) and candidate retrieval (typically achieved by approximate nearest neighborsearch). We first introduce a caching system for scoring component in recommendation systems (in particular search advertising systems). Inside the cache, we leverage lightweight machine learning models to make adaptive cache refresh decisions, which provides a better balance between recommendation accuracy and computation cost. This leads to a better net profit in the search advertising context. We then present the learned adaptive termination for approximate nearest neighbor search inside the candidate retrieval component. We leverage lightweight machine learning models to decide how much to search for each query, which provides a better balance between the search accuracy and latency (computation cost).

119 pages

Thesis Committee:
David G. Andersen (Chair)
Michael Kaminsky
Rashmi K. Vinayak
Yuxiong He (Microsoft AI and Research)

Srinivasan Seshan, Head, Computer Science Department
Martial Hebert, Dean, School of Computer Science

Return to: SCS Technical Report Collection
School of Computer Science

This page maintained by reports@cs.cmu.edu