Tag

AI Platform

0 views collected around this technical thread.

360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
May 15, 2025 · Cloud Native

How 360’s AI Platform Boosted GPU Utilization with Volcano Scheduler

360’s AI platform migrated its GPU clusters to a cloud‑native architecture and adopted the Volcano scheduler, achieving over 45% GPU utilization, less than 7% fragmentation, and more than 1000000 scheduled Pods, while leveraging flexible plugins, hierarchical queues, and resource pooling to optimize AI and big‑data workloads.

AI PlatformGPU schedulingKubernetes
0 likes · 13 min read
How 360’s AI Platform Boosted GPU Utilization with Volcano Scheduler
DeWu Technology
DeWu Technology
May 9, 2025 · Artificial Intelligence

Growth Story of a Technical Lead: Building a One‑Stop Large‑Model Training and Inference Platform at Dewu

Meng, a former Tencent and Alibaba engineer, led Dewu’s one‑stop large‑model training and inference platform, cutting integration costs, creating a shared GPU pool and CI/CD pipeline, building a Milvus vector‑database, and driving self‑directed learning that boosted business value, user experience, and set a roadmap for future RAG and cloud‑native optimizations.

AI PlatformMLOpsPerformance Optimization
0 likes · 18 min read
Growth Story of a Technical Lead: Building a One‑Stop Large‑Model Training and Inference Platform at Dewu
Go Programming World
Go Programming World
Apr 22, 2025 · Artificial Intelligence

Design and Implementation of an Enterprise‑Grade LLMOPS Platform (EasyAI)

This article presents a comprehensive overview of building an enterprise‑level LLMOPS platform—including concept definitions, the relationship between LLMOPS, MLOps and intelligent agent platforms, four development tiers, architecture layers, core technical concerns, deployment options, and the benefits of cloud‑native AI development.

AI PlatformDevOpsKubernetes
0 likes · 15 min read
Design and Implementation of an Enterprise‑Grade LLMOPS Platform (EasyAI)
JD Tech Talk
JD Tech Talk
Feb 18, 2025 · Artificial Intelligence

Agent Applications in Advertising at JD.com: Technical Implementation and Platform Architecture

This article explores how JD.com's advertising team leverages Agent technology to enhance advertising operations through AI-driven automation, covering technical implementations like RAG, Function Call capabilities, and platform architecture for scalable AI solutions.

AI PlatformArtificial IntelligenceFunction Call
0 likes · 23 min read
Agent Applications in Advertising at JD.com: Technical Implementation and Platform Architecture
DataFunSummit
DataFunSummit
Feb 14, 2025 · Artificial Intelligence

Building Large‑Scale Recommendation Systems with Big Data and Large Language Models on Alibaba Cloud AI Platform

This presentation details how Alibaba Cloud's AI platform integrates big‑data pipelines, feature‑store services, and large language model capabilities to construct high‑performance search‑recommendation architectures, covering system design, training and inference optimizations, LLM‑driven use cases, and open‑source RAG tooling.

AI PlatformFeature StoreRAG
0 likes · 17 min read
Building Large‑Scale Recommendation Systems with Big Data and Large Language Models on Alibaba Cloud AI Platform
DataFunTalk
DataFunTalk
Jan 26, 2025 · Artificial Intelligence

58.com’s LingXi Large Language Model Platform: Development, Deployment, and Performance Optimizations

Since the launch of ChatGPT, 58.com has built a Model‑as‑a‑Service platform called LingXi that trains and serves domain‑specific large language models, supports over a hundred internal scenarios with daily inference exceeding ten million calls, and continuously improves performance through quantization, GPU optimization, model miniaturization, and advanced AI applications such as interview assistants, voice agents, and RAG‑enabled agents.

AI PlatformAI applicationsLLM
0 likes · 9 min read
58.com’s LingXi Large Language Model Platform: Development, Deployment, and Performance Optimizations
DataFunSummit
DataFunSummit
Dec 31, 2024 · Artificial Intelligence

How Momo Leverages Large Model Technology to Transform Business and R&D Processes

This article explains how Momo utilizes large language model technologies to revamp its AI application paradigm, achieve efficient inference through quantization and prefix caching, build a workflow‑based model platform, and outline future plans for framework optimization and multimodal support.

AI PlatformMOMOinference optimization
0 likes · 16 min read
How Momo Leverages Large Model Technology to Transform Business and R&D Processes
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Dec 4, 2024 · Artificial Intelligence

Implementation of a Cloud‑Native AI‑Powered Quantitative Research Platform Using Alibaba Cloud ACK

The article details how the Juqian intelligent investment research platform leverages Alibaba Cloud's ACK cloud‑native AI suite, Kubernetes, and various cloud services to build a high‑efficiency, scalable AI‑driven quantitative finance solution, improving resource utilization, reducing costs, and accelerating research workflows.

AIAI PlatformAck
0 likes · 5 min read
Implementation of a Cloud‑Native AI‑Powered Quantitative Research Platform Using Alibaba Cloud ACK
Tencent Tech
Tencent Tech
Nov 19, 2024 · Artificial Intelligence

How Tencent’s Angel Platform Secured the 2024 World Internet Conference Leading Technology Award

Tencent’s Angel machine learning platform, recognized for breakthroughs in trillion‑scale model training, inference, and deployment, won the 2024 World Internet Conference Leading Technology Award, highlighting its self‑developed hardware‑software stack, high‑performance networking, and extensive real‑world AI applications.

AI PlatformAngelLarge Models
0 likes · 6 min read
How Tencent’s Angel Platform Secured the 2024 World Internet Conference Leading Technology Award
58 Tech
58 Tech
Aug 7, 2024 · Artificial Intelligence

Bridging Compute and Applications: 58.com AI Lab’s Large‑Model Platform and AI Agent Solutions

In this article, 58.com AI Lab senior director Zhan Kunlin explains how the company built a multi‑layer AI platform, created a vertical large‑language model called LingXi, and developed an AI Agent system with RAG capabilities to accelerate practical AI applications across various business scenarios.

AI PlatformAI agentsModel Deployment
0 likes · 10 min read
Bridging Compute and Applications: 58.com AI Lab’s Large‑Model Platform and AI Agent Solutions
DataFunTalk
DataFunTalk
Aug 2, 2024 · Artificial Intelligence

From Big Data to Large Models: Alibaba Cloud AI Platform Architecture and Practices for Search Recommendation

This presentation details Alibaba Cloud's AI platform, covering the end‑to‑end pipeline from big‑data processing and feature engineering to large‑model training, inference optimization, recommendation system architecture, and RAG applications, highlighting practical engineering solutions and performance gains.

AI PlatformFeature StoreLarge Models
0 likes · 18 min read
From Big Data to Large Models: Alibaba Cloud AI Platform Architecture and Practices for Search Recommendation
Architects' Tech Alliance
Architects' Tech Alliance
Jul 28, 2024 · Artificial Intelligence

Design and Optimization Practices for Intelligent Computing Platforms in the Era of Large Models

The article examines the new characteristics, challenges, and technical practices of intelligent computing platforms required for large‑model AI workloads, covering infrastructure adaptation, heterogeneous scheduling, application acceleration, operation reliability, and future directions for simplifying GPU usage and connecting heterogeneous resources.

AI PlatformLarge ModelsPerformance Optimization
0 likes · 6 min read
Design and Optimization Practices for Intelligent Computing Platforms in the Era of Large Models
DataFunTalk
DataFunTalk
Jun 21, 2024 · Artificial Intelligence

Fine‑tuning Large Language Models with Alibaba Cloud PAI: Practices, Techniques, and Deployment

This article introduces the Alibaba Cloud PAI platform for large language model (LLM) fine‑tuning, covering model‑training pipelines, performance‑cost trade‑offs, retrieval‑augmented generation, fine‑tuning methods such as full‑parameter, LoRA and QLoRA, model selection, data preparation, evaluation, and real‑world deployment examples.

AI PlatformFine-tuningLLM
0 likes · 20 min read
Fine‑tuning Large Language Models with Alibaba Cloud PAI: Practices, Techniques, and Deployment
DataFunSummit
DataFunSummit
Jun 17, 2024 · Artificial Intelligence

Strategies for Reducing Cost and Improving Efficiency in Recommendation Systems with Alibaba Cloud PAI‑Rec

This article discusses how Alibaba Cloud’s AI platform PAI‑Rec reduces recommendation system costs and boosts efficiency by optimizing training resources, leveraging FeatureStore, EasyRec and TorchEasyRec frameworks, detailing workflow stages, feature consistency, GPU acceleration, componentized model configuration, and practical deployment timelines.

AI PlatformFeature StoreGPU Acceleration
0 likes · 14 min read
Strategies for Reducing Cost and Improving Efficiency in Recommendation Systems with Alibaba Cloud PAI‑Rec
DataFunSummit
DataFunSummit
Feb 25, 2024 · Artificial Intelligence

Tencent FinTech AI Development Platform: Architecture, Challenges, and Solutions

This article introduces Tencent FinTech’s AI development platform, outlining its business background and goals, the technical challenges encountered in feature engineering, model training, and inference stability, and the comprehensive solutions—including a unified feature engine, distributed training framework, optimized deployment, and future plans for large‑scale graph training and AutoML.

AI PlatformFinTechModel Deployment
0 likes · 13 min read
Tencent FinTech AI Development Platform: Architecture, Challenges, and Solutions
Baidu Geek Talk
Baidu Geek Talk
Dec 20, 2023 · Artificial Intelligence

A Unified Platform for Prompt Development, Evaluation, and Iteration in Large Language Model Applications

The proposed unified platform centralizes prompt creation, evaluation, and iteration for large‑model applications, offering one‑stop hosting, metric‑driven testing, seamless resource integration, model switching, fine‑grained traffic control, and an automated data‑flywheel with QEP scoring, cutting optimization cycles from weeks to days while paving the way for advanced fine‑tuning techniques.

AI PlatformAutomationData Flywheel
0 likes · 17 min read
A Unified Platform for Prompt Development, Evaluation, and Iteration in Large Language Model Applications
DataFunTalk
DataFunTalk
Oct 19, 2023 · Artificial Intelligence

Multimodal Large Model Platform: History, Architecture, and Practice by Nine Chapters Cloud Extreme DataCanvas

This article presents Nine Chapters Cloud Extreme DataCanvas's insights and practices on multimodal large model platforms, covering their historical development, platform components such as AI Foundation Software and Prompt Manager, practical implementations like memory-augmented models and ETL pipelines, and future prospects for enterprise knowledge bases and agents.

AI PlatformLarge Modelsknowledge base
0 likes · 13 min read
Multimodal Large Model Platform: History, Architecture, and Practice by Nine Chapters Cloud Extreme DataCanvas
HelloTech
HelloTech
Sep 13, 2023 · Artificial Intelligence

AI Platform‑Powered Automated Ticket Routing: Modeling Workflow, Feature Engineering, and Intent Recognition

The Haro AI platform automates customer‑service ticket routing by applying a four‑step pipeline—feature processing, model training, evaluation, and deployment—using BERT/ALBERT‑based intent recognition, configurable feature storage, AutoML or expert modes, and Faas‑style deployment, as demonstrated in the Universal Ticket System case study, dramatically improving accuracy and efficiency.

AI PlatformALBERTBERT
0 likes · 11 min read
AI Platform‑Powered Automated Ticket Routing: Modeling Workflow, Feature Engineering, and Intent Recognition
HelloTech
HelloTech
Aug 22, 2023 · Artificial Intelligence

AI Platform Architecture and Automation in Machine Learning

An end‑to‑end AI platform integrates feature processing, model training, deployment, and decision orchestration across offline and online layers, leveraging automated pipelines such as AutoML (feature engineering, hyper‑parameter optimization, neural architecture search) built on Ray Tune and NNI, which have already boosted CTR in real‑world advertising and aim to make every user an algorithm engineer.

AI PlatformAutomationHPO
0 likes · 8 min read
AI Platform Architecture and Automation in Machine Learning