How MNN LLM Delivers Fast, Stable On‑Device LLM Inference for Android, iOS, and Desktop

Facing DeepSeek R1 server instability, the open‑source MNN LLM framework offers local, mobile‑friendly deployment with model quantization and hardware‑specific optimizations, dramatically improving inference speed, stability, and download reliability across Android, iOS, and desktop platforms while supporting multimodal inputs.

AndroidLLMMNN

0 likes · 11 min read

How MNN LLM Delivers Fast, Stable On‑Device LLM Inference for Android, iOS, and Desktop

DataFunSummit

Jun 5, 2021 · Artificial Intelligence

Compression Techniques for BERT: Analysis, Quantization, Pruning, Distillation, and Structure‑Preserving Methods

This article reviews BERT’s architecture, analyzes the storage and compute costs of each layer, and systematically presents compression methods—including quantization, pruning, knowledge distillation (Distilled BiLSTM and MobileBERT), and structure‑preserving techniques—aimed at enabling efficient deployment on resource‑constrained mobile devices.

BERTMobile Deploymentknowledge distillation

0 likes · 15 min read

Compression Techniques for BERT: Analysis, Quantization, Pruning, Distillation, and Structure‑Preserving Methods

Kuaishou Large Model

Apr 1, 2021 · Artificial Intelligence

How Kuaishou Y‑Tech Leverages GANs for Real‑Time Face Attribute Editing in Short Videos

This article details Kuaishou Y‑Tech's practical deployment of GAN‑based high‑precision face attribute editing—covering gender, age, hair, and expression transformations—for short‑video effects, discussing background, business applications, technical challenges, and solutions across data preparation, model training, and mobile deployment.

Computer VisionGANKuaishou

0 likes · 15 min read

How Kuaishou Y‑Tech Leverages GANs for Real‑Time Face Attribute Editing in Short Videos

DevOps Cloud Academy

Apr 22, 2020 · Mobile Development

Android Project Configuration and CI/CD Pipeline with Jenkins for APK Upload

This guide explains how to configure an Android project, write Python scripts for uploading APKs to Fir.im and Pgyer, create a Jenkinsfile with checkout, build, and upload stages, and set up Jenkins global variables and pipeline parameters for automated CI/CD deployment.

APKAndroidAutomation

0 likes · 6 min read

Android Project Configuration and CI/CD Pipeline with Jenkins for APK Upload

Tencent TDS Service

Jul 12, 2018 · Artificial Intelligence

How to Engineer MobileNet for Efficient Image Classification on Mobile Devices

This article details the engineering of MobileNet V1 for image classification on mobile terminals, covering its depthwise separable convolution architecture, data collection and preprocessing, model training with transfer learning, TensorFlow Lite conversion, deployment on iOS/Android, and GPU acceleration techniques for faster inference.

Deep LearningGPU AccelerationMobile Deployment

0 likes · 19 min read

How to Engineer MobileNet for Efficient Image Classification on Mobile Devices

Ctrip Technology

Dec 19, 2015 · Operations

Ctrip Continuous Delivery Conference – “Publish When Ready” – Sharing Practices and Platforms

The Ctrip Continuous Delivery Conference held on December 19 showcased industry experts discussing large‑scale deployment, container‑based delivery, mobile automation, and the MCD platform, highlighting practical strategies for high‑availability, rapid release pipelines across web and mobile services.

ContainerMobile Deploymentrelease-automation

0 likes · 6 min read

Ctrip Continuous Delivery Conference – “Publish When Ready” – Sharing Practices and Platforms