Tagged articles
6 articles
Page 1 of 1
DaTaobao Tech
DaTaobao Tech
Apr 21, 2025 · Artificial Intelligence

How MNN LLM Delivers Fast, Stable On‑Device LLM Inference for Android, iOS, and Desktop

Facing DeepSeek R1 server instability, the open‑source MNN LLM framework offers local, mobile‑friendly deployment with model quantization and hardware‑specific optimizations, dramatically improving inference speed, stability, and download reliability across Android, iOS, and desktop platforms while supporting multimodal inputs.

AndroidLLMMNN
0 likes · 11 min read
How MNN LLM Delivers Fast, Stable On‑Device LLM Inference for Android, iOS, and Desktop
DataFunSummit
DataFunSummit
Jun 5, 2021 · Artificial Intelligence

Compression Techniques for BERT: Analysis, Quantization, Pruning, Distillation, and Structure‑Preserving Methods

This article reviews BERT’s architecture, analyzes the storage and compute costs of each layer, and systematically presents compression methods—including quantization, pruning, knowledge distillation (Distilled BiLSTM and MobileBERT), and structure‑preserving techniques—aimed at enabling efficient deployment on resource‑constrained mobile devices.

BERTMobile Deploymentknowledge distillation
0 likes · 15 min read
Compression Techniques for BERT: Analysis, Quantization, Pruning, Distillation, and Structure‑Preserving Methods
Kuaishou Large Model
Kuaishou Large Model
Apr 1, 2021 · Artificial Intelligence

How Kuaishou Y‑Tech Leverages GANs for Real‑Time Face Attribute Editing in Short Videos

This article details Kuaishou Y‑Tech's practical deployment of GAN‑based high‑precision face attribute editing—covering gender, age, hair, and expression transformations—for short‑video effects, discussing background, business applications, technical challenges, and solutions across data preparation, model training, and mobile deployment.

Computer VisionGANKuaishou
0 likes · 15 min read
How Kuaishou Y‑Tech Leverages GANs for Real‑Time Face Attribute Editing in Short Videos
Tencent TDS Service
Tencent TDS Service
Jul 12, 2018 · Artificial Intelligence

How to Engineer MobileNet for Efficient Image Classification on Mobile Devices

This article details the engineering of MobileNet V1 for image classification on mobile terminals, covering its depthwise separable convolution architecture, data collection and preprocessing, model training with transfer learning, TensorFlow Lite conversion, deployment on iOS/Android, and GPU acceleration techniques for faster inference.

Deep LearningGPU AccelerationMobile Deployment
0 likes · 19 min read
How to Engineer MobileNet for Efficient Image Classification on Mobile Devices
Ctrip Technology
Ctrip Technology
Dec 19, 2015 · Operations

Ctrip Continuous Delivery Conference – “Publish When Ready” – Sharing Practices and Platforms

The Ctrip Continuous Delivery Conference held on December 19 showcased industry experts discussing large‑scale deployment, container‑based delivery, mobile automation, and the MCD platform, highlighting practical strategies for high‑availability, rapid release pipelines across web and mobile services.

ContainerMobile Deploymentrelease-automation
0 likes · 6 min read
Ctrip Continuous Delivery Conference – “Publish When Ready” – Sharing Practices and Platforms