DataFunSummit
Jul 4, 2023 · Artificial Intelligence
PPL: A Full‑Platform Deep Learning Deployment Framework by SenseTime
The article presents SenseTime's PPL framework, detailing its toolchain, inference engine, multi‑backend operator library, quantization tools, CUDA optimizations, performance benchmarks across CPUs, GPUs, DSPs and DSAs, and outlines future plans for broader chip support and AI for Science.
AI inferenceCUDA OptimizationDeep Learning Deployment
0 likes · 23 min read