Tagged articles
1 articles
Page 1 of 1
Ctrip Technology
Ctrip Technology
Jul 23, 2020 · Artificial Intelligence

Inference Performance Optimization for AI Applications: Methods, Case Studies, and Future Directions

This article examines the challenges of deep learning inference, outlines general optimization methodologies—including system-level and model-level techniques—presents practical case studies such as Transformer translation model improvements, and discusses future trends in automated compilation and performance tuning for AI services.

AI inferenceDeep LearningTVM
0 likes · 15 min read
Inference Performance Optimization for AI Applications: Methods, Case Studies, and Future Directions