Tagged articles

AI platform

114 articles · Page 1 of 2

Jul 1, 2026 · Artificial Intelligence

How to Prevent AI Platform Price Hikes and Downtime with a Workflow Decoupling & Migration SOP

The article reports a Q3 test showing that tightly coupling AI workflows to a single platform can cause up to 24 hours of downtime, then presents a three‑step “configuration extraction + smooth routing” protocol that reduces migration time to under two hours, cuts error rates to zero, and improves architectural resilience.

AI platformconfiguration extractionmigration SOP

0 likes · 8 min read

How to Prevent AI Platform Price Hikes and Downtime with a Workflow Decoupling & Migration SOP

DataFunTalk

Jun 28, 2026 · Artificial Intelligence

How Knora Uses Ontology + Large Models to Overcome Hallucination and Execution Gaps in Enterprise AI

The article presents Knora 4.0, an ontology‑enhanced AI platform that tackles six enterprise AI challenges—hallucination, instability, weak planning, poor responsiveness, data integration, and long cold‑start—by tightly coupling domain ontologies with large language models, detailing its architecture, autonomous agents, real‑world LED production line use case, roadmap, and expert round‑table insights.

AI platformAutonomous AgentsEnterprise AI

0 likes · 15 min read

How Knora Uses Ontology + Large Models to Overcome Hallucination and Execution Gaps in Enterprise AI

Alibaba Cloud Infrastructure

Jun 24, 2026 · Cloud Native

How a 3‑Person Team Got 12k Users Without Marketing Using OSS Vector Bucket for a Low‑Cost AI Platform

A three‑person startup built Matrees, an AI‑driven world‑building platform, by switching from a self‑hosted open‑source vector database to Alibaba Cloud’s fully managed OSS Vector Bucket, cutting infrastructure costs by about 90 %, eliminating maintenance overhead, and organically attracting over 12,000 users who generated more than 45 million words of content.

AI platformOSS Vector BucketRAG

0 likes · 8 min read

How a 3‑Person Team Got 12k Users Without Marketing Using OSS Vector Bucket for a Low‑Cost AI Platform

DataFunTalk

Jun 12, 2026 · Artificial Intelligence

How Ontology + Large Models Enable Knora to Tackle Hallucinations and Execution Gaps in Enterprise AI

The article explains how Knora 4.0 combines ontology with large‑model AI to move enterprise applications from isolated chat bots to autonomous, end‑to‑end systems, addressing six major challenges such as hallucinations, unstable outputs, weak planning, poor responsiveness, data integration difficulty, and long cold‑start cycles, and demonstrates the approach with real LED‑line use cases, architectural details, and a roadmap for future autonomous agents.

AI platformAutonomous AgentsEnterprise AI

0 likes · 17 min read

How Ontology + Large Models Enable Knora to Tackle Hallucinations and Execution Gaps in Enterprise AI

DataFunTalk

Jun 6, 2026 · Artificial Intelligence

How Knora Uses Ontology + Large Models to Overcome Enterprise AI Hallucinations and Execution Gaps

The article explains how Knora 4.0 combines ontology with large‑model AI to address six core challenges of enterprise AI—hallucinations, unstable output, weak planning, poor responsiveness, data integration, and long cold‑start—by structuring business knowledge, defining executable actions, and deploying autonomous agents that close the analysis‑decision‑execution loop.

AI platformAutonomous AgentsEnterprise AI

0 likes · 16 min read

How Knora Uses Ontology + Large Models to Overcome Enterprise AI Hallucinations and Execution Gaps

DataFunTalk

May 27, 2026 · Artificial Intelligence

How Knora Combines Ontology and Large Models to Overcome Hallucinations and Execution Gaps in Enterprise AI

The article analyzes how Knora 4.0 integrates enterprise ontologies with large‑model AI to address six core challenges—hallucinations, unstable outputs, weak planning, poor responsiveness, data silos, and long cold‑start cycles—by detailing its layered architecture, autonomous agent Knora Claw, real‑world LED‑line case studies, and a three‑year roadmap toward fully autonomous enterprise systems.

AI platformAutonomous AgentsEnterprise AI

0 likes · 17 min read

How Knora Combines Ontology and Large Models to Overcome Hallucinations and Execution Gaps in Enterprise AI

DataFunSummit

May 25, 2026 · Big Data

How Hisense Built an AI‑Ready Multimodal Data Platform: Storage, Governance, and Development

This article details Hisense's journey to create an AI‑ready multimodal data platform, covering the challenges of integrating diverse business systems, the shift from a Hadoop‑based architecture to a cloud‑native data lake, the JuData governance and development platform, and six practical scenarios that demonstrate unified ingestion, metadata management, rule‑based quality control, intelligent asset retrieval, and future AI‑driven DataOps capabilities.

AI platformCloud NativeData Governance

0 likes · 23 min read

How Hisense Built an AI‑Ready Multimodal Data Platform: Storage, Governance, and Development

DataFunTalk

May 16, 2026 · Artificial Intelligence

How Knora Combines Ontology and Large Models to Overcome AI Hallucinations and Execution Gaps in Enterprises

The article explains how YueDian Technology's Knora 4.0 platform fuses domain ontologies with large‑model AI to create a unified, trustworthy, and autonomous enterprise AI system that addresses hallucination, data integration, and execution challenges across complex business scenarios.

AI platformAutonomous AgentsEnterprise AI

0 likes · 14 min read

How Knora Combines Ontology and Large Models to Overcome AI Hallucinations and Execution Gaps in Enterprises

Machine Heart

May 13, 2026 · Artificial Intelligence

How an 8‑Year‑Old Built a Prototype OS and Native App Using Just a Phone and AI

An 8‑year‑old child turned his hand‑drawn OS concept into a working prototype with Baidu's Miaodao 3.0, while the article analyses how the platform’s four‑dimensional upgrade lowers the barrier to AI‑driven native app creation for both individuals and enterprises.

AI platformAI programmingEnterprise AI

0 likes · 18 min read

How an 8‑Year‑Old Built a Prototype OS and Native App Using Just a Phone and AI

DataFunSummit

Apr 29, 2026 · Industry Insights

Beyond the Data Rear‑view Mirror: Palantir’s Strategic Value and Real‑World Cases

Palantir leverages its Ontology‑driven data integration and AI platforms—Gotham, Foundry, and AIP—to transform fragmented data into actionable intelligence, delivering decision‑making advantages in government, aerospace, food, and energy sectors, while shifting from custom‑heavy services to an open, platform‑based ecosystem.

AI agentsAI platformData Integration

0 likes · 11 min read

Beyond the Data Rear‑view Mirror: Palantir’s Strategic Value and Real‑World Cases

DataFunTalk

Apr 27, 2026 · Artificial Intelligence

Ontology + Large Model: How Knora Tackles Enterprise AI Hallucination and Execution Gaps

The article analyses how Knora 4.0 combines enterprise ontologies with large‑model AI to eliminate hallucinations, provide stable semantic constraints, and enable end‑to‑end autonomous execution across complex business scenarios, illustrated with LED production‑line use cases and a detailed platform architecture.

AI platformAutonomous AgentsEnterprise AI

0 likes · 17 min read

Ontology + Large Model: How Knora Tackles Enterprise AI Hallucination and Execution Gaps

DataFunSummit

Apr 26, 2026 · Industry Insights

Why Palantir AIP Is More Than a Data Platform – The Secret ‘Implementation Orchestration Machine’

The article analyzes how Palantir’s ontology‑driven platforms—Gotham, Foundry, and the 2023 AI Platform (AIP)—break data silos, enable real‑time decision making, and shift the company from custom‑heavy solutions to a low‑code, AI‑agent‑centric ecosystem, illustrated with military, aerospace, and retail case studies.

AI platformAIPData Integration

0 likes · 10 min read

Why Palantir AIP Is More Than a Data Platform – The Secret ‘Implementation Orchestration Machine’

DataFunTalk

Apr 25, 2026 · Artificial Intelligence

How Palantir Ontology Modeling Turns Real Estate Ops into an AI‑Driven Enterprise

Healthpeak, a large medical‑real‑estate REIT, replaced fragmented spreadsheets and manual data entry with Palantir AIP’s ontology‑driven AI operating system, achieving automated billing, voice‑driven workflows, reduced errors, and a scalable, data‑centric operation that frees managers to focus on tenant relationships.

AI platformAutomationEnterprise AI

0 likes · 17 min read

How Palantir Ontology Modeling Turns Real Estate Ops into an AI‑Driven Enterprise

AntData

Apr 17, 2026 · Industry Insights

5 Silver Rules That Made Dataphin‑MCP’s AI Platform Scale to 1M Calls in 9 Days

This article shares the practical lessons learned from building Dataphin‑MCP, an AI‑enabled data‑development platform, by outlining five concrete "silver" rules, illustrating each with real‑world cases, and discussing deeper considerations for building robust AI‑first tools and harnesses.

AI platformAgent DesignConcept modeling

0 likes · 13 min read

5 Silver Rules That Made Dataphin‑MCP’s AI Platform Scale to 1M Calls in 9 Days

AI Explorer

Apr 10, 2026 · Artificial Intelligence

Why Onyx Open‑Source AI Platform Is Redefining Enterprise AI Development

Onyx, an open‑source AI platform that exploded on GitHub, bundles chat, RAG, web search and code execution into a model‑agnostic, self‑hosted solution, offering a one‑command installer, lightweight and full‑feature modes, and targeting developers, enterprises, researchers, and privacy‑focused users.

AI platformLLMOnyx

0 likes · 6 min read

Why Onyx Open‑Source AI Platform Is Redefining Enterprise AI Development

AI Explorer

Apr 5, 2026 · Artificial Intelligence

Onyx Open-Source AI Platform: Full Model Support and One‑Stop Deployable Solution

Onyx is an open‑source AI platform that acts as an application layer for large language models, offering a unified interface for RAG, web search, code execution, multimodal interaction, and customizable agents, with model‑agnostic support, one‑click installation, and flexible deployment options for individuals and enterprises.

AI platformDockerKubernetes

0 likes · 6 min read

Onyx Open-Source AI Platform: Full Model Support and One‑Stop Deployable Solution

AI Engineering

Apr 4, 2026 · Industry Insights

Anthropic Stops Supporting OpenClaw in Subscription Mode

Anthropic announced that, effective noon PT on April 4, Claude's subscription quota will no longer be usable through third‑party tools like OpenClaw, forcing users to switch to a pay‑as‑you‑go model, sparking criticism and highlighting the risks of relying on large AI platforms.

AI platformAnthropicClaude

0 likes · 3 min read

Anthropic Stops Supporting OpenClaw in Subscription Mode

dbaplus Community

Feb 9, 2026 · Artificial Intelligence

How EffectiveGPU Cuts GPU Costs with Fine‑Grained Partitioning and Volcano Scheduling

This article details how SF Tech's EffectiveGPU (EGPU) platform redesigns GPU resource management on Kubernetes, introducing fine‑grained memory and compute partitioning, priority‑based scheduling, Volcano integration, and monitoring pipelines to dramatically improve utilization and reduce hardware costs for AI workloads.

AI platformGPUGPU partitioning

0 likes · 23 min read

How EffectiveGPU Cuts GPU Costs with Fine‑Grained Partitioning and Volcano Scheduling

Fighter's World

Jan 23, 2026 · Artificial Intelligence

Why Most 'Palantir-ization' Fails: a16z Insights on Ontology‑FDE Architecture

The article dissects why many startups that try to emulate Palantir’s “platform‑first” model stumble, highlighting a16z’s five gating questions, the critical role of Ontology and Forward Deployed Engineers as a double‑helix architecture, and a practical matrix for assessing AI‑centric business and technical maturity.

AI platformEnterprise AIForward Deployed Engineer

0 likes · 20 min read

Why Most 'Palantir-ization' Fails: a16z Insights on Ontology‑FDE Architecture

Tech Verticals & Horizontals

Jan 8, 2026 · Artificial Intelligence

ByteDance Agent Practice Manual: Technical Guide and Deployment Strategies (2025)

This comprehensive manual outlines ByteDance's Agent platform, covering its technical foundations, architecture, development workflow, real‑world application scenarios, operational optimization, security compliance, future innovation paths, case studies, team collaboration, risk mitigation, tooling, and global adaptation.

AI platformAgentByteDance

0 likes · 4 min read

ByteDance Agent Practice Manual: Technical Guide and Deployment Strategies (2025)

Architect's Alchemy Furnace

Dec 21, 2025 · Artificial Intelligence

Deploy and Explore Open WebUI: A Feature‑Rich Self‑Hosted AI Platform

Open WebUI is a self‑hosted, extensible AI platform that runs fully offline, supports multiple LLM back‑ends such as Ollama and OpenAI‑compatible APIs, offers built‑in RAG, role‑based access, multi‑model chat, markdown/LaTeX, image generation, and provides detailed Docker, pip, and Kubernetes installation guides with ready‑to‑run commands.

AI platformDockerLLM

0 likes · 11 min read

Deploy and Explore Open WebUI: A Feature‑Rich Self‑Hosted AI Platform

DataFunSummit

Dec 14, 2025 · Artificial Intelligence

How Sina Weibo Scaled Enterprise AI with a Unified Multi‑Agent Platform

Sina Weibo’s engineering team tackled the high technical barriers, low reuse, and long cycles of large‑model AI deployment by building a unified AI application platform that combines a layered architecture, low‑code workflow, multi‑agent orchestration, and knowledge‑base integration, enabling rapid, reliable AI solutions across the company.

AI platformEnterprise AIKnowledge Base

0 likes · 26 min read

How Sina Weibo Scaled Enterprise AI with a Unified Multi‑Agent Platform

Su San Talks Tech

Dec 14, 2025 · Operations

How to Deploy a Lightweight Gitea Git Server with Docker for AI Platforms

This guide walks through deploying the lightweight Gitea Git service with Docker, configuring MySQL, setting up the web interface, creating repositories, generating access tokens, and accessing its REST API, offering a resource‑efficient alternative to GitLab for AI model hosting platforms.

AI platformDockerGit

0 likes · 5 min read

How to Deploy a Lightweight Gitea Git Server with Docker for AI Platforms

Past Memory Big Data

Dec 9, 2025 · Artificial Intelligence

A Decade of Evolution: Inside Pinterest’s AI Platform Journey

Over ten years Pinterest transformed a fragmented machine‑learning stack into a unified AI platform, iterating through stages from early ad‑hoc pipelines to scalable GPU‑accelerated services, while learning that timing, organization alignment, and efficiency are crucial for lasting impact.

AI platformGPU inferenceML Ops

0 likes · 25 min read

A Decade of Evolution: Inside Pinterest’s AI Platform Journey

Old Meng AI Explorer

Dec 5, 2025 · Industry Insights

How Bisheng Turns Enterprise AI Deployment into a Zero‑Code, One‑Stop Process

Bisheng, an open‑source LLM DevOps platform, solves the fragmented, high‑threshold, and compliance‑heavy challenges of enterprise AI by offering a zero‑code visual workflow, all‑in‑one RAG/Agent capabilities, strict security controls, and high‑precision document parsing, enabling rapid, secure AI application rollout.

AI platformLLM DevOpsRAG

0 likes · 11 min read

How Bisheng Turns Enterprise AI Deployment into a Zero‑Code, One‑Stop Process

JD Cloud Developers

Nov 24, 2025 · Artificial Intelligence

JoyAgent: Open‑Source Enterprise‑Grade Multi‑Agent Platform from JD

The 2025 Open Atom Developer Conference highlighted JD's JoyAgent project, an open‑source, 100% enterprise‑grade multi‑agent platform that excels in AI, data governance, and diagnostic analysis, with detailed features, performance metrics, and deployment experiences shared.

AI platformData GovernanceDiagnostic Analysis

0 likes · 7 min read

JoyAgent: Open‑Source Enterprise‑Grade Multi‑Agent Platform from JD

Architect

Nov 17, 2025 · Artificial Intelligence

Comparing Tasking AI and Dify: Architecture, Core Capabilities, and AI Workflow Engines

This article examines the design of LLM‑native AI application platforms Tasking AI and Dify, comparing their LLM integration, plugin management, multi‑tenant isolation, system architecture, and especially Dify’s GraphEngine for complex AI workflow orchestration.

AI platformDifyGraphEngine

0 likes · 22 min read

Comparing Tasking AI and Dify: Architecture, Core Capabilities, and AI Workflow Engines

360 Smart Cloud

Nov 14, 2025 · Artificial Intelligence

How TLM Platform Powers LLM Ops with PPO, GRPO and Reinforcement Evaluators

The article introduces the TLM large‑model development platform, details its fine‑tuning options, explains reinforcement learning fundamentals and key algorithms such as PPO and the newer GRPO, describes the architecture of a reinforcement evaluator, and shows how to configure RL training on the platform.

AI platformGRPOLLMOps

0 likes · 10 min read

How TLM Platform Powers LLM Ops with PPO, GRPO and Reinforcement Evaluators

Alibaba Cloud Big Data AI Platform

Nov 4, 2025 · Artificial Intelligence

How Alibaba Cloud’s PAI Powers Cutting‑Edge LLM Research at EMNLP 2025

EMNLP 2025 in Suzhou will feature Alibaba Cloud’s AI platform PAI presenting four accepted papers on knowledge distillation, small‑model reasoning, distilled reasoning models, and an automated RAG benchmark framework, alongside exhibition demos, networking events, and recruitment opportunities for AI talent.

AI platformEMNLP 2025Knowledge Distillation

0 likes · 10 min read

How Alibaba Cloud’s PAI Powers Cutting‑Edge LLM Research at EMNLP 2025

Tencent Cloud Developer

Oct 29, 2025 · Artificial Intelligence

How Tasking AI and Dify Redefine LLM‑Powered AI Application Development

This article analyzes the architecture, core capabilities, and workflow orchestration of LLM‑native application platforms Tasking AI and Dify, comparing their microservice designs, plugin management, multi‑tenant isolation, and GraphEngine execution to highlight strengths, trade‑offs, and future development trends.

AI platformDifyLLM

0 likes · 21 min read

How Tasking AI and Dify Redefine LLM‑Powered AI Application Development

Raymond Ops

Sep 26, 2025 · Artificial Intelligence

How to Build and Deploy a Dify LLM Application Platform on CentOS

This comprehensive guide walks you through the fundamentals of Dify, an open‑source LLM application platform, its key features and use cases, and provides step‑by‑step instructions for preparing the environment, installing Docker and Docker‑Compose, and deploying Dify on a CentOS 7.9 server.

AI platformDifyDocker

0 likes · 13 min read

How to Build and Deploy a Dify LLM Application Platform on CentOS

Alibaba Cloud Big Data AI Platform

Jul 23, 2025 · Artificial Intelligence

Unlock Efficient LLMs: How Alibaba’s PAI EasyDistill Powers Model Post‑Training

This article explains how Alibaba Cloud's AI platform PAI leverages the EasyDistill framework for post‑training model optimization, covering knowledge distillation concepts, data synthesis techniques, basic and advanced distillation training, the DistilQwen model family, real‑world customer cases, and step‑by‑step practical demos.

AI platformEasyDistillKnowledge Distillation

0 likes · 12 min read

Unlock Efficient LLMs: How Alibaba’s PAI EasyDistill Powers Model Post‑Training

Alibaba Cloud Big Data AI Platform

Jul 9, 2025 · Artificial Intelligence

Boost Large Model Performance with PAI‑ChatLearn: A High‑Performance RL Framework

PAI‑ChatLearn is a flexible, easy‑to‑use, high‑efficiency reinforcement‑learning framework on Alibaba Cloud’s AI platform that addresses usability and performance challenges of post‑training large models through features like Ray‑based scheduling, dynamic batchsize, sequence packing, MoE acceleration, and provides step‑by‑step guidance for deploying RL tasks such as Qwen‑3 on PAI‑DLC.

AI platformPAI-ChatLearnlarge models

0 likes · 11 min read

Boost Large Model Performance with PAI‑ChatLearn: A High‑Performance RL Framework

Alibaba Cloud Big Data AI Platform

Jun 25, 2025 · Artificial Intelligence

Boost Post‑Training Efficiency with Cosmos‑RL, Ray, and VeRL on Alibaba PAI

This article introduces Alibaba Cloud's PAI platform and demonstrates how open‑source reinforcement‑learning frameworks such as Cosmos‑RL, Ray, and VeRL accelerate post‑training for large language models, offering higher throughput, fault‑tolerance, and seamless integration for AI developers.

AI platformOpen Source Frameworksdistributed training

0 likes · 9 min read

Boost Post‑Training Efficiency with Cosmos‑RL, Ray, and VeRL on Alibaba PAI

Huolala Tech

May 29, 2025 · Artificial Intelligence

How LWS Enables Scalable Multi‑Node Large Model Deployment on Kubernetes

The article explains how the Dolphin AI platform tackles large‑model deployment challenges by replacing standard Kubernetes Deployments with LeaderWorkerSet, detailing its architecture, features, installation steps, example configurations, testing, scaling, rolling updates, fault recovery, and future roadmap for AI workloads.

AI platformDistributed InferenceKubernetes

0 likes · 12 min read

How LWS Enables Scalable Multi‑Node Large Model Deployment on Kubernetes

Fighter's World

May 24, 2025 · Artificial Intelligence

Why Glean Leads Enterprise Search: What Makes It So Powerful?

The article examines Glean’s evolution from an enterprise‑search startup to a comprehensive Work AI Platform, detailing its market growth, competitive positioning, technical architecture—including data connectors, knowledge graphs, custom models, and agent reasoning—and the strategic challenges it must overcome to sustain its lead.

AI platformAgentContextual AI

0 likes · 30 min read

Why Glean Leads Enterprise Search: What Makes It So Powerful?

360 Zhihui Cloud Developer

May 15, 2025 · Cloud Native

How 360’s AI Platform Boosted GPU Utilization with Volcano Scheduler

360’s AI platform migrated its GPU clusters to a cloud‑native architecture and adopted the Volcano scheduler, achieving over 45% GPU utilization, less than 7% fragmentation, and more than 1000000 scheduled Pods, while leveraging flexible plugins, hierarchical queues, and resource pooling to optimize AI and big‑data workloads.

AI platformGPU schedulingKubernetes

0 likes · 13 min read

How 360’s AI Platform Boosted GPU Utilization with Volcano Scheduler

DeWu Technology

May 9, 2025 · Artificial Intelligence

Growth Story of a Technical Lead: Building a One‑Stop Large‑Model Training and Inference Platform at Dewu

Meng, a former Tencent and Alibaba engineer, led Dewu’s one‑stop large‑model training and inference platform, cutting integration costs, creating a shared GPU pool and CI/CD pipeline, building a Milvus vector‑database, and driving self‑directed learning that boosted business value, user experience, and set a roadmap for future RAG and cloud‑native optimizations.

AI platformMLOpsVector Database

0 likes · 18 min read

Growth Story of a Technical Lead: Building a One‑Stop Large‑Model Training and Inference Platform at Dewu

Go Programming World

Apr 22, 2025 · Artificial Intelligence

Design and Implementation of an Enterprise‑Grade LLMOPS Platform (EasyAI)

This article presents a comprehensive overview of building an enterprise‑level LLMOPS platform—including concept definitions, the relationship between LLMOPS, MLOps and intelligent agent platforms, four development tiers, architecture layers, core technical concerns, deployment options, and the benefits of cloud‑native AI development.

AI platformCloud NativeGo

0 likes · 15 min read

Design and Implementation of an Enterprise‑Grade LLMOPS Platform (EasyAI)

MaGe Linux Operations

Apr 3, 2025 · Artificial Intelligence

How to Build and Deploy a Dify LLM Application Platform on CentOS

This guide explains what Dify is, outlines its key features and application scenarios, and provides step‑by‑step instructions for preparing the environment, installing Docker and Docker‑Compose, and deploying Dify on a CentOS 7.9 system, including verification of a successful setup.

AI platformDifyDocker

0 likes · 9 min read

Baidu Geek Talk

Mar 19, 2025 · Artificial Intelligence

Inside Baidu’s New Wenxin 4.5 & X1: Multimodal Breakthroughs and Tool‑Enabled AI

Baidu officially launched the Wenxin 4.5 and X1 large language models, showcasing native multimodal foundations, advanced attention masks, heterogeneous expert extensions, and tool‑calling capabilities, while offering low‑cost API access on the Qianfan platform and outlining the underlying technical innovations that drive their performance gains.

AI platformBaiduMultimodal AI

0 likes · 8 min read

Inside Baidu’s New Wenxin 4.5 & X1: Multimodal Breakthroughs and Tool‑Enabled AI

JD Tech Talk

Feb 18, 2025 · Artificial Intelligence

Agent Applications in Advertising at JD.com: Technical Implementation and Platform Architecture

This article explores how JD.com's advertising team leverages Agent technology to enhance advertising operations through AI-driven automation, covering technical implementations like RAG, Function Call capabilities, and platform architecture for scalable AI solutions.

AI platformAgentJD.com

0 likes · 23 min read

Agent Applications in Advertising at JD.com: Technical Implementation and Platform Architecture

DataFunSummit

Feb 14, 2025 · Artificial Intelligence

Building Large‑Scale Recommendation Systems with Big Data and Large Language Models on Alibaba Cloud AI Platform

This presentation details how Alibaba Cloud's AI platform integrates big‑data pipelines, feature‑store services, and large language model capabilities to construct high‑performance search‑recommendation architectures, covering system design, training and inference optimizations, LLM‑driven use cases, and open‑source RAG tooling.

AI platformBig DataFeature Store

0 likes · 17 min read

Building Large‑Scale Recommendation Systems with Big Data and Large Language Models on Alibaba Cloud AI Platform

Python Crawling & Data Mining

Feb 7, 2025 · Artificial Intelligence

Unlocking DeepSeek: How to Use AI-Powered Python Scraping for Baidu Hot Topics

This article introduces DeepSeek, an AI platform with strong NLP and multimodal capabilities, outlines its core features and products, and provides a step‑by‑step Python tutorial—including a complete requests‑BeautifulSoup script—to scrape Baidu’s homepage hot‑topic titles, plus usage tips and precautions.

AI platformBaidu hot topicsPython web scraping

0 likes · 8 min read

Unlocking DeepSeek: How to Use AI-Powered Python Scraping for Baidu Hot Topics

Baidu Geek Talk

Feb 5, 2025 · Artificial Intelligence

How to Unlock Full GPU Efficiency for Enterprise AI Platforms

This article analyzes common GPU efficiency problems in enterprise AI compute platforms—such as low utilization, long fault‑resolution times, and limited performance gains—and presents three practical solutions: dynamic resource allocation, systematic fault‑tolerance, and system‑level tuning, illustrated with real‑world case studies.

AI platformGPU UtilizationResource Scheduling

0 likes · 11 min read

How to Unlock Full GPU Efficiency for Enterprise AI Platforms

DataFunTalk

Jan 26, 2025 · Artificial Intelligence

58.com’s LingXi Large Language Model Platform: Development, Deployment, and Performance Optimizations

Since the launch of ChatGPT, 58.com has built a Model‑as‑a‑Service platform called LingXi that trains and serves domain‑specific large language models, supports over a hundred internal scenarios with daily inference exceeding ten million calls, and continuously improves performance through quantization, GPU optimization, model miniaturization, and advanced AI applications such as interview assistants, voice agents, and RAG‑enabled agents.

AI ApplicationsAI platformInference Optimization

0 likes · 9 min read

58.com’s LingXi Large Language Model Platform: Development, Deployment, and Performance Optimizations

Alibaba Cloud Big Data AI Platform

Jan 3, 2025 · Artificial Intelligence

Build an Education‑Focused Retrieval‑Augmented Generation (RAG) Solution with Alibaba PAI

This guide walks you through creating a RAG‑enhanced AI solution for education using Alibaba PAI, covering prerequisite setup, knowledge‑base construction with PAI‑Designer, model deployment, connection configuration, workflow assembly, and a side‑by‑side comparison of RAG versus non‑RAG answers.

AI platformLLMMilvus

0 likes · 16 min read

Build an Education‑Focused Retrieval‑Augmented Generation (RAG) Solution with Alibaba PAI

DataFunSummit

Dec 31, 2024 · Artificial Intelligence

How Momo Leverages Large Model Technology to Transform Business and R&D Processes

This article explains how Momo utilizes large language model technologies to revamp its AI application paradigm, achieve efficient inference through quantization and prefix caching, build a workflow‑based model platform, and outline future plans for framework optimization and multimodal support.

AI platformInference OptimizationLarge Language Models

0 likes · 16 min read

How Momo Leverages Large Model Technology to Transform Business and R&D Processes

Alibaba Cloud Infrastructure

Dec 4, 2024 · Artificial Intelligence

Implementation of a Cloud‑Native AI‑Powered Quantitative Research Platform Using Alibaba Cloud ACK

The article details how the Juqian intelligent investment research platform leverages Alibaba Cloud's ACK cloud‑native AI suite, Kubernetes, and various cloud services to build a high‑efficiency, scalable AI‑driven quantitative finance solution, improving resource utilization, reducing costs, and accelerating research workflows.

.aiACKAI platform

0 likes · 5 min read

Implementation of a Cloud‑Native AI‑Powered Quantitative Research Platform Using Alibaba Cloud ACK

Tencent Tech

Nov 19, 2024 · Artificial Intelligence

How Tencent’s Angel Platform Secured the 2024 World Internet Conference Leading Technology Award

Tencent’s Angel machine learning platform, recognized for breakthroughs in trillion‑scale model training, inference, and deployment, won the 2024 World Internet Conference Leading Technology Award, highlighting its self‑developed hardware‑software stack, high‑performance networking, and extensive real‑world AI applications.

AI platformAngelTechnology Award

0 likes · 6 min read

How Tencent’s Angel Platform Secured the 2024 World Internet Conference Leading Technology Award

WeChat Backend Team

Oct 23, 2024 · Artificial Intelligence

How We Scaled AI Computing in WeChat with Ray: From Challenges to AstraRay

This article details the AI computing challenges faced by WeChat, explains why the Ray distributed engine was chosen, and describes the design and large‑scale deployment of the AstraRay platform—including scheduling, resource management, and multi‑model support—to achieve low‑cost, high‑efficiency AI services.

AI platformAstraRayDistributed Computing

0 likes · 20 min read

How We Scaled AI Computing in WeChat with Ray: From Challenges to AstraRay

58 Tech

Aug 7, 2024 · Artificial Intelligence

Bridging Compute and Applications: 58.com AI Lab’s Large‑Model Platform and AI Agent Solutions

In this article, 58.com AI Lab senior director Zhan Kunlin explains how the company built a multi‑layer AI platform, created a vertical large‑language model called LingXi, and developed an AI Agent system with RAG capabilities to accelerate practical AI applications across various business scenarios.

AI agentsAI platformModel Deployment

0 likes · 10 min read

Bridging Compute and Applications: 58.com AI Lab’s Large‑Model Platform and AI Agent Solutions

NewBeeNLP

Aug 5, 2024 · Industry Insights

How Alibaba Cloud Scales Search Recommendations with Big Data, AI, and LLMs

This article details Alibaba Cloud's end‑to‑end architecture for search and advertising recommendation, covering the data platform, AI services, feature‑store design, training and inference optimizations, and the integration of large language models for new recommendation scenarios.

AI platformAlibaba CloudBig Data

0 likes · 17 min read

How Alibaba Cloud Scales Search Recommendations with Big Data, AI, and LLMs

DataFunTalk

Aug 2, 2024 · Artificial Intelligence

From Big Data to Large Models: Alibaba Cloud AI Platform Architecture and Practices for Search Recommendation

This presentation details Alibaba Cloud's AI platform, covering the end‑to‑end pipeline from big‑data processing and feature engineering to large‑model training, inference optimization, recommendation system architecture, and RAG applications, highlighting practical engineering solutions and performance gains.

AI platformBig DataFeature Store

0 likes · 18 min read

From Big Data to Large Models: Alibaba Cloud AI Platform Architecture and Practices for Search Recommendation

Architects' Tech Alliance

Jul 28, 2024 · Artificial Intelligence

Design and Optimization Practices for Intelligent Computing Platforms in the Era of Large Models

The article examines the new characteristics, challenges, and technical practices of intelligent computing platforms required for large‑model AI workloads, covering infrastructure adaptation, heterogeneous scheduling, application acceleration, operation reliability, and future directions for simplifying GPU usage and connecting heterogeneous resources.

AI platformPerformance OptimizationScheduling

0 likes · 6 min read

Design and Optimization Practices for Intelligent Computing Platforms in the Era of Large Models

DataFunTalk

Jun 21, 2024 · Artificial Intelligence

Fine‑tuning Large Language Models with Alibaba Cloud PAI: Practices, Techniques, and Deployment

This article introduces the Alibaba Cloud PAI platform for large language model (LLM) fine‑tuning, covering model‑training pipelines, performance‑cost trade‑offs, retrieval‑augmented generation, fine‑tuning methods such as full‑parameter, LoRA and QLoRA, model selection, data preparation, evaluation, and real‑world deployment examples.

AI platformLLMModel Deployment

0 likes · 20 min read

Fine‑tuning Large Language Models with Alibaba Cloud PAI: Practices, Techniques, and Deployment

DataFunSummit

Jun 17, 2024 · Artificial Intelligence

Strategies for Reducing Cost and Improving Efficiency in Recommendation Systems with Alibaba Cloud PAI‑Rec

This article discusses how Alibaba Cloud’s AI platform PAI‑Rec reduces recommendation system costs and boosts efficiency by optimizing training resources, leveraging FeatureStore, EasyRec and TorchEasyRec frameworks, detailing workflow stages, feature consistency, GPU acceleration, componentized model configuration, and practical deployment timelines.

AI platformFeature StoreGPU Acceleration

0 likes · 14 min read

Strategies for Reducing Cost and Improving Efficiency in Recommendation Systems with Alibaba Cloud PAI‑Rec

58UXD

May 27, 2024 · Artificial Intelligence

How to Build AI Chatbots with Coze: Free ChatGPT‑4 Powered Platform

This guide introduces Coze, ByteDance's AI chatbot and app‑development platform that offers free ChatGPT‑4 access, outlines its core features such as plugins, knowledge bases, workflows and multi‑platform integration, and provides a step‑by‑step tutorial for creating and publishing your own AI bot.

AI ChatbotAI platformChatGPT-4

0 likes · 8 min read

How to Build AI Chatbots with Coze: Free ChatGPT‑4 Powered Platform

Eric Tech Circle

May 22, 2024 · Artificial Intelligence

Deploy and Build AI Apps with Dify: A Complete Open‑Source Guide

This article introduces Dify, an open‑source LLM application platform, outlines its core features such as workflows, model support, RAG pipelines, agents, and observability, compares it with alternatives, and provides step‑by‑step deployment instructions using Docker Compose and Helm for local and Kubernetes environments.

AI platformDockerKubernetes

0 likes · 7 min read

Deploy and Build AI Apps with Dify: A Complete Open‑Source Guide

AI Large Model Application Practice

Apr 5, 2024 · Artificial Intelligence

Hands‑On Comparison of Baidu AppBuilder, Alibaba Bailei, and ByteDance Coze LLM Platforms

This article provides a practical, side‑by‑side review of three major large‑model application development platforms—Baidu AppBuilder, Alibaba Bailei, and ByteDance Coze—detailing their creation workflows, configuration options, SDK capabilities, plugin ecosystems, workflow orchestration, and overall strengths and limitations for building AI agents.

AI platformAppBuilderComparison

0 likes · 18 min read

Hands‑On Comparison of Baidu AppBuilder, Alibaba Bailei, and ByteDance Coze LLM Platforms

dbaplus Community

Apr 1, 2024 · Cloud Native

Scaling Cloud‑Native Containers at DeWu: Multi‑Cluster Management and Cost Optimization

This article details DeWu's cloud‑native transformation since August 2021, covering multi‑cluster federation, application profiling, custom scheduling plugins, resource pre‑reservation, co‑location of online and offline workloads, cost‑saving hardware choices, multi‑cloud strategy, and the development of the KubeAI platform for AI scenarios.

AI platformMulti-Clustercontainer scheduling

0 likes · 24 min read

Scaling Cloud‑Native Containers at DeWu: Multi‑Cluster Management and Cost Optimization

DataFunSummit

Feb 25, 2024 · Artificial Intelligence

Tencent FinTech AI Development Platform: Architecture, Challenges, and Solutions

This article introduces Tencent FinTech’s AI development platform, outlining its business background and goals, the technical challenges encountered in feature engineering, model training, and inference stability, and the comprehensive solutions—including a unified feature engine, distributed training framework, optimized deployment, and future plans for large‑scale graph training and AutoML.

AI platformFinTechModel Deployment

0 likes · 13 min read

Tencent FinTech AI Development Platform: Architecture, Challenges, and Solutions

Alibaba Cloud Big Data AI Platform

Jan 11, 2024 · Artificial Intelligence

Deploy and Fine‑Tune Alibaba’s Qwen‑72B‑Chat on PAI‑QuickStart

This guide explains how to meet runtime requirements, deploy Qwen‑72B‑Chat via the Alibaba Cloud PAI console, invoke it with cURL or Python SDK, and perform full‑parameter fine‑tuning using Megatron‑LM, providing a complete end‑to‑end workflow for large language model development.

AI platformPAI-QuickStartQwen-72B

0 likes · 12 min read

Deploy and Fine‑Tune Alibaba’s Qwen‑72B‑Chat on PAI‑QuickStart

Baidu Geek Talk

Dec 20, 2023 · Artificial Intelligence

A Unified Platform for Prompt Development, Evaluation, and Iteration in Large Language Model Applications

The proposed unified platform centralizes prompt creation, evaluation, and iteration for large‑model applications, offering one‑stop hosting, metric‑driven testing, seamless resource integration, model switching, fine‑grained traffic control, and an automated data‑flywheel with QEP scoring, cutting optimization cycles from weeks to days while paving the way for advanced fine‑tuning techniques.

AI platformAutomationData Flywheel

0 likes · 17 min read

A Unified Platform for Prompt Development, Evaluation, and Iteration in Large Language Model Applications

DataFunTalk

Nov 9, 2023 · Artificial Intelligence

Coeus: Bilibili's Cloud‑Native AI Platform and the PyTorch Training Performance Tuning Handbook

The article introduces Coeus, Bilibili's cloud‑native AI platform built on Kubernetes with Alluxio integration, explains how it solves major data and compute challenges, improves training performance, and promotes a free PyTorch performance‑tuning guide for engineers.

AI platformAlluxioKubernetes

0 likes · 4 min read

Coeus: Bilibili's Cloud‑Native AI Platform and the PyTorch Training Performance Tuning Handbook

DataFunTalk

Oct 19, 2023 · Artificial Intelligence

Multimodal Large Model Platform: History, Architecture, and Practice by Nine Chapters Cloud Extreme DataCanvas

This article presents Nine Chapters Cloud Extreme DataCanvas's insights and practices on multimodal large model platforms, covering their historical development, platform components such as AI Foundation Software and Prompt Manager, practical implementations like memory-augmented models and ETL pipelines, and future prospects for enterprise knowledge bases and agents.

AI platformKnowledge BaseMultimodal AI

0 likes · 13 min read

Multimodal Large Model Platform: History, Architecture, and Practice by Nine Chapters Cloud Extreme DataCanvas

Baidu Geek Talk

Oct 11, 2023 · Artificial Intelligence

How Baidu’s Qianfan 2.0 Supercharges Large‑Model Development and Deployment

The article reviews Baidu Cloud’s Qianfan 2.0 platform, detailing its expanded model catalog, dataset library, Chinese‑language enhancements, compression and speed gains, robust AI infrastructure, application templates, and end‑to‑end data‑labeling pipeline that together lower cost and accelerate large‑model adoption across industries.

AI platformLarge Language ModelsModel Deployment

0 likes · 14 min read

How Baidu’s Qianfan 2.0 Supercharges Large‑Model Development and Deployment

HelloTech

Sep 13, 2023 · Artificial Intelligence

AI Platform‑Powered Automated Ticket Routing: Modeling Workflow, Feature Engineering, and Intent Recognition

The Haro AI platform automates customer‑service ticket routing by applying a four‑step pipeline—feature processing, model training, evaluation, and deployment—using BERT/ALBERT‑based intent recognition, configurable feature storage, AutoML or expert modes, and Faas‑style deployment, as demonstrated in the Universal Ticket System case study, dramatically improving accuracy and efficiency.

AI platformALBERTBERT

0 likes · 11 min read

AI Platform‑Powered Automated Ticket Routing: Modeling Workflow, Feature Engineering, and Intent Recognition

HelloTech

Aug 22, 2023 · Artificial Intelligence

AI Platform Architecture and Automation in Machine Learning

An end‑to‑end AI platform integrates feature processing, model training, deployment, and decision orchestration across offline and online layers, leveraging automated pipelines such as AutoML (feature engineering, hyper‑parameter optimization, neural architecture search) built on Ray Tune and NNI, which have already boosted CTR in real‑world advertising and aim to make every user an algorithm engineer.

AI platformAutoMLDeep Learning

0 likes · 8 min read

AI Platform Architecture and Automation in Machine Learning

HelloTech

Aug 9, 2023 · Artificial Intelligence

AutoML in Hello's AI Platform and Quarkc: Building the Next‑Generation Front‑End Component Engine

At the 2023 SECon Global Software Engineering Innovation Summit in Shanghai, Hello’s technology team will showcase how its AI platform leverages AutoML to streamline model development across intelligent mobility services, and how the Quarkc engine uses Web Components to create cross‑stack, framework‑agnostic front‑end components.

AI platformAutoMLFront-end components

0 likes · 4 min read

AutoML in Hello's AI Platform and Quarkc: Building the Next‑Generation Front‑End Component Engine

DataFunTalk

Jul 11, 2023 · Artificial Intelligence

Sunshine Insurance Group's Zhèngyán Large Model Open Platform: Architecture, Tools, and Business Applications

The article describes Sunshine Insurance Group's Zhèngyán Large Model Open Platform, detailing its three‑layer architecture, AutoTrain tool, self‑developed LLM, smart routing, plugin marketplace, intelligent review, and how these capabilities empower insurance marketing, sales, service, and management through AI‑driven solutions.

AI platformInsurance TechnologyModel Deployment

0 likes · 13 min read

Sunshine Insurance Group's Zhèngyán Large Model Open Platform: Architecture, Tools, and Business Applications

DataFunSummit

Jul 1, 2023 · Artificial Intelligence

Alibaba Cloud Native Deep Learning Platform PAI‑DLC: Architecture, Features, and Future Outlook

This article introduces Alibaba Cloud's PAI‑DLC, a cloud‑native deep learning platform that integrates machine‑learning capabilities, containerized services, AI‑aware scheduling, GPU virtualization, elastic training with EasyScale, data access, and observability, and discusses its architecture, key features, and future directions.

AI platformCloud NativeDeep Learning

0 likes · 16 min read

Alibaba Cloud Native Deep Learning Platform PAI‑DLC: Architecture, Features, and Future Outlook

58 Tech

May 11, 2023 · Artificial Intelligence

Stella Data Annotation Platform: Design, Architecture, and AI‑Assisted Labeling

The article details the design and implementation of the Stella data annotation SaaS platform at 58.com, covering its background, evolution, modular architecture, annotation capabilities across text, image, audio, and video, AI‑assisted labeling, storage solutions, quality and efficiency management, as well as localization and licensing considerations.

AI platformActive LearningLocalization

0 likes · 21 min read

Stella Data Annotation Platform: Design, Architecture, and AI‑Assisted Labeling

HelloTech

May 8, 2023 · Artificial Intelligence

One‑Stop AI Platform for Cloud, Edge, Mobile, Flink, and Application Intelligence: Architecture, Challenges, and Solutions

The article presents a comprehensive one‑stop AI platform that unifies training, model, feature, and decision services across cloud, edge, mobile, Flink, and application environments, detailing its architecture, the limitations of cloud‑centric inference, the advantages of localized inference, and the challenges and solutions for model and feature localization, SDK design, and future AutoML enhancements.

AI platformFlinkdistributed systems

0 likes · 17 min read

One‑Stop AI Platform for Cloud, Edge, Mobile, Flink, and Application Intelligence: Architecture, Challenges, and Solutions

HelloTech

Apr 19, 2023 · Cloud Native

How FaaS Transforms AI Platforms: Lessons from Haro’s Cloud‑Native Journey

The article analyzes the operational, stability, and cost challenges of Haro’s AI platform, explains why a serverless FaaS architecture—specifically Knative—was selected, and details the implementation steps, performance gains, and future scenarios for AI workloads.

AI platformCloud NativeFaaS

0 likes · 8 min read

How FaaS Transforms AI Platforms: Lessons from Haro’s Cloud‑Native Journey

Huolala Tech

Mar 23, 2023 · Cloud Native

How Huolala Built a Cloud‑Native One‑Stop AI Platform on Kubernetes

Huolala’s Big Data Intelligent Platform team describes how they built a cloud‑native, one‑stop AI solution on Kubernetes, integrating Flink‑based feature engineering, a multi‑tenant Zeppelin notebook, GPU‑aware training, and a unified model‑serving platform, while addressing resource isolation, storage persistence, and cross‑cloud deployment.

AI platformCloud NativeGPU scheduling

0 likes · 17 min read

How Huolala Built a Cloud‑Native One‑Stop AI Platform on Kubernetes

Baidu Intelligent Cloud Tech Hub

Jan 18, 2023 · Artificial Intelligence

How Baidu’s AI Cloud Powers Scalable Autonomous Driving Solutions

This article outlines Baidu Intelligent Cloud’s end‑to‑end autonomous driving platform, detailing its AI foundation, massive cloud‑based data and compute requirements, flexible deployment strategies for various manufacturers, and comprehensive toolchains for data collection, annotation, training, simulation, and compliance.

AI platformBaiduCloud Computing

0 likes · 12 min read

How Baidu’s AI Cloud Powers Scalable Autonomous Driving Solutions

DataFunTalk

Dec 20, 2022 · Artificial Intelligence

Baidu Smart Cloud Digital Human Platform: Development, Architecture, and Solution Overview

This article provides a comprehensive overview of Baidu's Smart Cloud Digital Human platform, detailing its evolution since 2019, core AI-driven architecture, platform components such as persona management and business orchestration, various industry solutions, and technical Q&A on rendering, latency, and deployment.

AI platformBaiducloud rendering

0 likes · 13 min read

Baidu Smart Cloud Digital Human Platform: Development, Architecture, and Solution Overview

Laiye Technology Team

Dec 16, 2022 · Artificial Intelligence

Efficient Production of Scene-specific OCR Models Using an AI Platform

This article explains how a unified AI platform enables rapid, data‑driven creation, training, deployment, and evaluation of OCR models for visually distinct text regions such as seals, meter readings, license plates, and VIN codes, while minimizing hardware and annotation costs.

AI platformKubeflowModel Training

0 likes · 7 min read

Efficient Production of Scene-specific OCR Models Using an AI Platform

ByteDance Terminal Technology

Jul 29, 2022 · Artificial Intelligence

Pitaya: ByteDance’s End‑Side AI Engineering Platform Overview

Pitaya, built by ByteDance’s Client AI and MLX teams, is a comprehensive end‑side AI engineering platform that provides a full workflow from model development and data preparation to deployment, monitoring, and federated learning, supporting large‑scale commercial scenarios across multiple apps.

AI platformedge AIfederated learning

0 likes · 14 min read

Pitaya: ByteDance’s End‑Side AI Engineering Platform Overview

DataFunTalk

Jul 29, 2022 · Artificial Intelligence

Tencent Music Cloud‑Native One‑Stop Machine Learning Platform: Features and Future Roadmap

This article introduces Tencent Music's cloud‑native, one‑stop machine learning platform, detailing its engineering workflow, distributed acceleration, inference closed‑loop, edge computing capabilities, and future plans, while highlighting challenges of traditional ML pipelines and the platform's solutions for resource orchestration, storage, scheduling, and GPU utilization.

AI platformResource Managementcloud-native

0 likes · 17 min read

Tencent Music Cloud‑Native One‑Stop Machine Learning Platform: Features and Future Roadmap

Cloud Native Technology Community

Jul 21, 2022 · Cloud Native

Simplify Kubeflow Deployment with kubeflow-chart: A Step‑by‑Step Guide

This article analyzes the difficulties of using vanilla Kubeflow for MLOps, introduces the kubeflow‑chart Helm chart that streamlines deployment and integrates tools like SQLFlow and kfpdist, and provides detailed installation commands and a roadmap of upcoming components for a full cloud‑native AI platform.

AI platformCloud NativeKubeflow

0 likes · 12 min read

Simplify Kubeflow Deployment with kubeflow-chart: A Step‑by‑Step Guide

HelloTech

May 26, 2022 · Artificial Intelligence

Hello's Automated Growth Algorithm Loop: C‑Side Scenarios, Challenges, and Active Growth Strategies

Hello’s automated C‑side growth algorithm loop integrates diverse traffic sources, semi‑supervised PU‑learning, graph‑based look‑alike targeting, causal uplift models for smart subsidies, and adaptive copy and external ad optimization, dramatically boosting ride‑hailing and lifestyle service revenue while minimizing engineering duplication.

AI platformRecommendation SystemsUplift Modeling

0 likes · 20 min read

Hello's Automated Growth Algorithm Loop: C‑Side Scenarios, Challenges, and Active Growth Strategies

ITPUB

Apr 27, 2022 · Artificial Intelligence

How 58’s WPAI Platform Boosted AI Resource Utilization by Over 50%

This article details the design and optimization of 58.com’s WPAI machine learning platform, covering background, training‑task scheduling, elastic inference scaling, offline‑online resource mixing, and model‑inference acceleration, and shows how these techniques collectively raised GPU usage by 51% and CPU usage by 38% while cutting costs.

AI platformElastic ScalingGPU Utilization

0 likes · 26 min read

How 58’s WPAI Platform Boosted AI Resource Utilization by Over 50%

DataFunSummit

Apr 26, 2022 · Artificial Intelligence

Elastic Distributed Training at Huya: Design, Implementation, and Results

This talk describes Huya’s elastic distributed training system, covering the motivation behind elasticity, its design using Kubernetes and ETCD for dynamic node registration and scaling, implementation details of the EFDL framework, performance evaluations on ResNet‑50, and the resulting benefits and future directions.

AI platformGPU schedulingHuya

0 likes · 11 min read

Elastic Distributed Training at Huya: Design, Implementation, and Results

DataFunTalk

Apr 23, 2022 · Artificial Intelligence

Elastic Distributed Training at Huya: Design, Implementation, and Results

This article describes Huya's elastic distributed training system, explaining why elasticity is needed, the architectural design using Kubernetes and ETCD, the dynamic scaling process, performance evaluations on ResNet‑50, and future improvements for more efficient and reliable AI model training.

AI platformGPU schedulingKubernetes

0 likes · 10 min read

Kuaishou Tech

Nov 29, 2021 · Artificial Intelligence

Starry Vector Retrieval Platform: Architecture, Features, and Performance

The article describes the design, challenges, architecture, key features, algorithm optimizations, and future roadmap of Kuaishou's Starry vector retrieval platform, which delivers high‑performance, high‑reliability, and easy‑to‑use large‑scale ANN search for diverse business scenarios.

AI platformANNPerformance Optimization

0 likes · 14 min read

Starry Vector Retrieval Platform: Architecture, Features, and Performance

Alibaba Cloud Native

Oct 16, 2021 · Cloud Native

How Vivo Built a Hybrid‑Cloud AI Platform with Kubernetes and ACK

This article details how vivo AI's research institute created a hybrid‑cloud AI computing platform by integrating on‑premise bare‑metal servers with Alibaba Cloud ACK, using Kubernetes, Calico, and Terway to achieve elastic GPU resources, advanced storage features, and cost‑effective scaling for deep‑learning workloads.

AI platformCloud NativeHybrid Cloud

0 likes · 10 min read

How Vivo Built a Hybrid‑Cloud AI Platform with Kubernetes and ACK

ITFLY8 Architecture Home

Feb 23, 2021 · Artificial Intelligence

How Meituan Built a One‑Stop Machine Learning Platform for Delivery Optimization

This article explains how Meituan’s delivery business has transitioned from data online to AI‑driven decision making by building a comprehensive, one‑stop machine learning platform that includes model management, data graph, feature store, AB testing, and a machine‑learning definition language to accelerate algorithm iteration and reduce operational costs.

AB testingAI platformDelivery Logistics

0 likes · 5 min read

How Meituan Built a One‑Stop Machine Learning Platform for Delivery Optimization

DataFunTalk

Jan 13, 2021 · Artificial Intelligence

Building Graph Algorithm Tasks on Tencent Cloud TI-ONE with Angel

This article introduces Tencent Cloud's TI-ONE AI platform, explains its built‑in Angel algorithm support, demonstrates how to visually construct a graph‑algorithm workflow such as GraphSage, and outlines the resource configuration, execution, and result retrieval process for developers.

AI platformAngelTI-ONE

0 likes · 8 min read

Building Graph Algorithm Tasks on Tencent Cloud TI-ONE with Angel

58 Tech

Nov 20, 2020 · Artificial Intelligence

Evolution and Practice of the 58.com AI Algorithm Platform (WPAI)

The article details the development, architecture, and optimization of 58.com’s AI algorithm platform (WPAI), covering its background, overall design, large‑scale distributed machine learning, deep‑learning platform features, inference performance enhancements, GPU resource scheduling improvements, and future directions.

AI platformGPU schedulingInference Optimization

0 likes · 15 min read

Evolution and Practice of the 58.com AI Algorithm Platform (WPAI)

JD Retail Technology

Oct 21, 2020 · Artificial Intelligence

Galileo: A Distributed Graph Deep Learning Framework for Large‑Scale Industrial Scenarios

The article introduces Galileo, JD Retail's distributed graph deep‑learning platform that supports heterogeneous and dynamic graphs, ultra‑large scale training, flexible model customization, and seamless integration with TensorFlow and PyTorch, highlighting its architecture, core challenges, built‑in algorithms, and upcoming open‑source release.

AI platformGraph Neural Networksdistributed training

0 likes · 11 min read

Galileo: A Distributed Graph Deep Learning Framework for Large‑Scale Industrial Scenarios

58 Tech

Aug 7, 2020 · Artificial Intelligence

Technical Overview of 58.com Intelligent Voice Analysis Platform

The article presents a comprehensive technical overview of 58.com’s intelligent voice analysis platform, detailing its business background, system architecture, speech and NLP technologies, speaker diarization methods, model performance, data labeling workflow, and practical applications in call‑center quality inspection and user profiling.

AI platformdata labelingnatural language processing

0 likes · 11 min read

Technical Overview of 58.com Intelligent Voice Analysis Platform

Meituan Technology Team

Jul 16, 2020 · Artificial Intelligence

Augur: An Online Model Inference Framework and Poker Platform for Meituan Search

Meituan’s AI‑driven search combines the Augur online inference framework—offering stateless, distributed feature operators, transformers, and a DSL for rapid, high‑throughput model scoring—with the Poker platform for model training, versioning, and experimentation, together accelerating iteration, improving performance, and enabling advanced model‑as‑feature ensembles.

AI platformSearch Enginefeature engineering

0 likes · 26 min read

Augur: An Online Model Inference Framework and Poker Platform for Meituan Search

Youzan Coder

Jun 17, 2020 · Artificial Intelligence

Sunfish: An Integrated AI Platform for Model Training and Online Service Deployment at Youzan

Sunfish is Youzan’s integrated AI platform that unifies visual drag‑and‑drop model training, notebook‑based algorithm development, automated model management and one‑click publishing with a low‑latency, high‑availability “small‑box” inference service, enabling end‑to‑end deep‑learning workflows from data exploration to online recommendation and risk‑control deployment.

AI platformMLOpsModel Training

0 likes · 17 min read

Sunfish: An Integrated AI Platform for Model Training and Online Service Deployment at Youzan

iQIYI Technical Product Team

Jun 12, 2020 · Artificial Intelligence

Deepthought: An End‑to‑End Machine Learning Platform at iQIYI

Deepthought is iQIYI’s end‑to‑end machine‑learning platform that unifies distributed frameworks, decouples pipeline stages, integrates with Tongtian Tower, and offers visual drag‑and‑drop configuration, evolving from a fraud‑detection prototype to a generic system with real‑time inference, automated hyper‑parameter optimization, and support for large‑scale data across anti‑fraud, recommendation, and analytics workloads.

AI platformAutoMLData Engineering

0 likes · 13 min read

Deepthought: An End‑to‑End Machine Learning Platform at iQIYI

JD Tech Talk

Apr 24, 2020 · Artificial Intelligence

Automated Machine Learning System Architecture and Hyper‑Parameter Optimization Process

This article presents a comprehensive automated machine‑learning platform that abstracts task design, hyper‑parameter search space management, optimization engines, algorithm repositories, training/evaluation engines, model repositories and monitoring panels, offering both expert‑assisted and code‑free modes to accelerate model building while reducing reliance on specialist knowledge.

AI platformAutoMLModel Management

0 likes · 17 min read

Automated Machine Learning System Architecture and Hyper‑Parameter Optimization Process

Didi Tech

Apr 2, 2020 · Artificial Intelligence

Interview: Didi AI’s DELTA – A Unified Framework for NLP and Speech Model Development

In this interview, Didi AI Labs’ Han Kun explains how the DELTA platform unifies TensorFlow‑based NLP and speech models—supporting tasks from text classification to voice emotion recognition—through a modular, easily deployable architecture, accelerating development, powering Didi products, and now open‑sourced for broader AI collaboration.

AI platformDeltaNLP

0 likes · 14 min read

Interview: Didi AI’s DELTA – A Unified Framework for NLP and Speech Model Development

Alibaba Cloud Developer

Dec 10, 2019 · Artificial Intelligence

Why GNNs Matter: Inside Alibaba’s AliGraph Platform for Scalable Graph AI

The article introduces AliGraph, Alibaba’s comprehensive Graph Neural Network platform showcased at NeurIPS 2019, explaining its layered architecture, scalable graph engine, extensible operators, and real‑world applications across e‑commerce, security and cloud services, while highlighting performance gains, supported algorithms, and the strategic focus on GNN research and development.

AI platformAlibabaGraph Neural Networks

0 likes · 14 min read

Why GNNs Matter: Inside Alibaba’s AliGraph Platform for Scalable Graph AI

58 Tech

Nov 6, 2019 · Artificial Intelligence

TensorRT Acceleration and Integration Design for the 58 AI Platform (WPAI)

This article explains how the 58 AI platform leverages NVIDIA TensorRT to accelerate deep‑learning inference on GPUs, describes three integration approaches, details the TF‑TRT implementation and Kubernetes deployment, and presents performance gains for ResNet‑50 and OCR models.

AI platformGPU inferenceKubernetes deployment

0 likes · 7 min read

TensorRT Acceleration and Integration Design for the 58 AI Platform (WPAI)