Tagged articles
52 articles
Page 1 of 1
Data Party THU
Data Party THU
May 14, 2026 · Artificial Intelligence

Explore TimechoAI: The New Timer Time‑Series Large Model Cloud Service Now Open for Beta

TimechoAI, the cloud service built on the Timer time‑series large model, offers the latest SOTA model (Timer‑3.5) alongside classic baselines, supports multiple data input methods, covariate integration, and API/SDK access, and invites industrial and IoT teams to test its predictive maintenance, production optimization, energy load forecasting, and anomaly detection capabilities through a simple invitation process.

AI Cloud ServiceIndustrial IoTLarge Model
0 likes · 13 min read
Explore TimechoAI: The New Timer Time‑Series Large Model Cloud Service Now Open for Beta
Data Party THU
Data Party THU
May 12, 2026 · Artificial Intelligence

Time Series Large Models Explained: What They Are and Why They Matter

The article introduces time‑series data, its ubiquitous examples, the challenges of traditional small models, and proposes a universal time‑series large model that simplifies data preparation and model building, ultimately enabling more efficient and stable industrial AI solutions, now offered as a cloud service.

AIARIMACRISP-DM
0 likes · 6 min read
Time Series Large Models Explained: What They Are and Why They Matter
PMTalk Product Manager Community
PMTalk Product Manager Community
Apr 30, 2026 · Artificial Intelligence

10 Essential Large‑Model Fine‑Tuning Techniques for AI Product Managers

This article systematically presents ten large‑model training and fine‑tuning methods—from full‑parameter finetuning to parameter‑efficient PEFT—detailing their principles, suitable scenarios, step‑by‑step workflows, code examples, and practical selection guidance for AI product managers.

AdapterFine-tuningLarge Model
0 likes · 13 min read
10 Essential Large‑Model Fine‑Tuning Techniques for AI Product Managers
PMTalk Product Manager Community
PMTalk Product Manager Community
Apr 30, 2026 · Artificial Intelligence

How a Large AI Model Is Trained: Insights from a High‑Earning AI Product Manager

The article walks through model training, validation, ensemble learning, and deployment from an AI product manager’s viewpoint, using a churn‑prediction case to illustrate decision boundaries, metric choices, industry‑specific algorithm trade‑offs, cost considerations, and practical serving options.

AI product managementLarge ModelModel Deployment
0 likes · 6 min read
How a Large AI Model Is Trained: Insights from a High‑Earning AI Product Manager
DataFunSummit
DataFunSummit
Apr 23, 2026 · Artificial Intelligence

Ontology + Large Model: How Knora Solves Hallucination and Execution Gaps in Enterprise AI

The article details how Knora 4.0 integrates ontology with large‑model AI to create a reusable, extensible enterprise AI platform that mitigates hallucination, stabilises output, and enables autonomous end‑to‑end execution, illustrated with LED production line case studies, architectural breakdowns, and a roadmap for future intelligent agents.

Autonomous AgentsEnterprise AIKnowledge Graph
0 likes · 17 min read
Ontology + Large Model: How Knora Solves Hallucination and Execution Gaps in Enterprise AI
ZhiKe AI
ZhiKe AI
Apr 19, 2026 · Artificial Intelligence

What Is an AI Agent? A 3‑Minute Beginner’s Guide

An AI Agent is a large‑model system that can perceive its environment, plan steps, invoke tools, and remember past interactions to autonomously achieve user‑specified goals, distinguishing it from simple chatbots that only answer questions.

AI AgentAutomationLarge Model
0 likes · 6 min read
What Is an AI Agent? A 3‑Minute Beginner’s Guide
Fun with Large Models
Fun with Large Models
Apr 17, 2026 · Artificial Intelligence

Mastering Large Model Training: Practical Parameter Tuning from Beginner to Pro

This guide walks you through interpreting training logs and loss curves, diagnosing common issues such as oscillation, under‑fitting, and over‑fitting, and applying concrete adjustments to learning rate, LoRA settings, batch size, and epochs, with scenario‑specific strategies to turn a novice into a tuning expert.

AI trainingLarge ModelLoRA
0 likes · 23 min read
Mastering Large Model Training: Practical Parameter Tuning from Beginner to Pro
Machine Heart
Machine Heart
Apr 2, 2026 · Artificial Intelligence

LongCat-Next: Turning Images, Audio, and Text into Tokens – What’s Next?

LongCat-Next is a 68.5‑billion‑parameter discrete‑native autoregressive multimodal model that tokenizes images, audio and text, challenges the belief that visual tokenization loses detail, matches specialized models on fine‑grained tasks, and demonstrates that joint understanding‑generation training can even improve generation quality.

Audio SynthesisLarge ModelLongCat-Next
0 likes · 21 min read
LongCat-Next: Turning Images, Audio, and Text into Tokens – What’s Next?
Didi Tech
Didi Tech
Mar 12, 2026 · Artificial Intelligence

How STAPO Improves Large‑Model Fine‑Tuning by Silencing Spurious Tokens

The STAPO (Spurious‑Token‑Aware Policy Optimization) algorithm, introduced by Tsinghua University's iDLab and Didi's Deep Sea Lab, tackles policy‑entropy instability and performance oscillation in reinforcement‑learning fine‑tuning of large models by mathematically analyzing token collision probability, defining spurious tokens, and applying a Silencing Spurious Tokens mechanism that yields state‑of‑the‑art results on multiple math‑reasoning benchmarks.

AI SafetyFine-tuningLarge Model
0 likes · 7 min read
How STAPO Improves Large‑Model Fine‑Tuning by Silencing Spurious Tokens
JD Tech Talk
JD Tech Talk
Mar 2, 2026 · Artificial Intelligence

How AI Agents Are Revolutionizing Insurance: Methodology, Economics, and Technical Blueprint

This article presents a comprehensive methodology for selecting AI agent scenarios, explains the economic benefits of agent deployment, details the technical architecture—including domain large models, knowledge bases, planning strategies, and RL‑based scheduling—and illustrates how these components are applied to insurance product design, pricing, fulfillment, and risk control to drive scale and profit.

AgentInsuranceLarge Model
0 likes · 42 min read
How AI Agents Are Revolutionizing Insurance: Methodology, Economics, and Technical Blueprint
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Feb 18, 2026 · Artificial Intelligence

Microsoft’s 671B LLM Unifies Offline Ad Tasks—Can It Cut Compute Costs?

Microsoft’s AdNanny replaces a forest of specialized offline models with a single 671 B LLM, using a three‑stage data factory to generate reasoning‑rich corpora, dynamic task re‑weighting, RL‑based metric alignment, and a hybrid 31‑pipeline‑parallel architecture that halves compute cost while boosting performance on core ad‑ranking tasks.

AdNannyLLMLarge Model
0 likes · 9 min read
Microsoft’s 671B LLM Unifies Offline Ad Tasks—Can It Cut Compute Costs?
360 Smart Cloud
360 Smart Cloud
Dec 1, 2025 · Artificial Intelligence

How to Build Real‑Time Streaming Speech Recognition with a Large‑Model API (Go & Python)

This guide explains the background of speech‑to‑text technology, introduces the large‑model streaming speech recognition API, walks through obtaining an API key, and provides detailed Go and Python code for establishing a WebSocket connection, sending full‑client and audio‑only requests, and parsing server responses.

AIGolangLarge Model
0 likes · 12 min read
How to Build Real‑Time Streaming Speech Recognition with a Large‑Model API (Go & Python)
Architecture and Beyond
Architecture and Beyond
Nov 9, 2025 · Artificial Intelligence

Why Most AI Agents Use Workflows and How to Design Effective Ones

The article examines why most AI agents operate as workflow‑driven systems due to the reliability, accuracy, execution and cost limits of large models, and offers practical guidance on designing, evaluating, and iterating effective workflow agents while acknowledging their constraints and future prospects.

AI AgentLarge ModelProduct Design
0 likes · 12 min read
Why Most AI Agents Use Workflows and How to Design Effective Ones
Fun with Large Models
Fun with Large Models
Oct 15, 2025 · Artificial Intelligence

Low-Code Agent Framework Guide Part 4: Best Practices for Coze Model and Plugin Settings

This guide walks through Coze's large‑model configuration—including model selection, generation diversity, input/output parameters, and persona templates—demonstrates a complete e‑commerce chatbot setup, and details two plugin integration methods with three concise best‑practice rules for effective agent development.

CozeLarge ModelPlugin Integration
0 likes · 14 min read
Low-Code Agent Framework Guide Part 4: Best Practices for Coze Model and Plugin Settings
JD Cloud Developers
JD Cloud Developers
Aug 22, 2025 · Artificial Intelligence

Unlocking AI Productivity: Inside JD’s Large‑Model Tool Innovations

JD’s internal technical salon reveals the rapid evolution of large‑model AI tools, outlining design principles, current capabilities like JoyAgent, OxyGent and JoyCode, real‑world applications across office, code review, logistics, and future policy and multi‑agent visions, highlighting their impact on modern workplace productivity.

AI toolsEnterprise AIJD.com
0 likes · 13 min read
Unlocking AI Productivity: Inside JD’s Large‑Model Tool Innovations
JD Tech Talk
JD Tech Talk
Aug 22, 2025 · Artificial Intelligence

How JD’s Large‑Model Tools Are Transforming AI Workflows and Future Enterprises

JD’s internal technical salon reveals the rapid evolution of large‑model AI tools, outlining design principles, current capabilities like JoyAgent, OxyGent and JoyCode, real‑world applications across office, code review, logistics, and policy outlook, while highlighting future multi‑agent and embodied intelligence trends.

AI applicationsAI toolsEnterprise AI
0 likes · 13 min read
How JD’s Large‑Model Tools Are Transforming AI Workflows and Future Enterprises
Data Party THU
Data Party THU
Aug 2, 2025 · Artificial Intelligence

Timer 3.0: Generative Time‑Series Large Model Breaks Prediction Limits

The article summarizes Professor Long Mingsheng’s presentation on the Timer series of time‑series large models, detailing the three core challenges of industrial time‑series analysis, the evolution from statistical methods to generative models, and the technical breakthroughs of Timer 1.0, 2.0 and 3.0 that enable multi‑task, long‑context, and trillion‑scale forecasting for industrial digital transformation.

Industrial AIIoTDBLarge Model
0 likes · 14 min read
Timer 3.0: Generative Time‑Series Large Model Breaks Prediction Limits
JD Retail Technology
JD Retail Technology
Jul 31, 2025 · Artificial Intelligence

How AI is Revolutionizing Supply Chains: JD.com’s Billion‑Parameter Time‑Series Model

At the 2025 AI Innovation & Entrepreneurship Conference in Hangzhou, JD.com’s chief scientists unveiled a billion‑parameter time‑series large model and end‑to‑end inventory algorithms that dramatically boost demand forecasting, dynamic allocation, and overall supply‑chain efficiency, illustrating how AI can transform global logistics networks.

AIData‑Driven Decision MakingLarge Model
0 likes · 10 min read
How AI is Revolutionizing Supply Chains: JD.com’s Billion‑Parameter Time‑Series Model
DataFunTalk
DataFunTalk
Jul 21, 2025 · Artificial Intelligence

Top AI & Knowledge Graph Resources: A Curated Guide to Emerging Research

This article presents a curated list of cutting‑edge resources covering multimodal GraphRAG, knowledge‑graph‑driven large‑model applications in finance, healthcare, automotive, and more, offering insights into the evolving synergy between AI and knowledge graphs.

AIKnowledge GraphLarge Model
0 likes · 2 min read
Top AI & Knowledge Graph Resources: A Curated Guide to Emerging Research
Data Thinking Notes
Data Thinking Notes
Jun 29, 2025 · Artificial Intelligence

How to Build Effective Large Model Platforms: Challenges, Strategies, and Real‑World Cases

This report examines large model platforms as the engineering foundation for turning base models into business applications, outlines the full lifecycle toolchain, discusses key challenges such as technology selection and secure management, and provides practical case studies and strategic guidance for scalable, value‑driven deployment across industries.

AICase StudyDeployment
0 likes · 5 min read
How to Build Effective Large Model Platforms: Challenges, Strategies, and Real‑World Cases
JD Cloud Developers
JD Cloud Developers
Jun 24, 2025 · Artificial Intelligence

How JD Retail’s xLLM Architecture Revolutionizes AI Inference for E‑Commerce

At GAITC2025, JD Retail’s AI Infra lead Zhang Ke detailed the challenges of e‑commerce AI inference and introduced the xLLM edge‑cloud unified large‑model architecture, highlighting adaptive scheduling, offline unified scheduling, multi‑layer pipelines, and agent collaboration that boost performance, cut costs, and pave the way for future AI advancements.

AI inferenceLarge ModelModel Optimization
0 likes · 6 min read
How JD Retail’s xLLM Architecture Revolutionizes AI Inference for E‑Commerce
JD Tech
JD Tech
May 15, 2025 · Artificial Intelligence

How JD’s Omniforce Cuts Large‑Model Training Cost by 70% and Boosts Inference Speed 30%

The paper "Omniforce" from JD Exploration Research Institute presents a cloud‑edge collaborative AutoML system that uses model distillation, data governance, Bayesian training optimization, and cloud‑edge cooperation to reduce large‑model training costs by 70% and improve inference efficiency by an average of 30%, offering a reusable technical paradigm for scalable AI deployment.

AI efficiencyJoyBuildLarge Model
0 likes · 6 min read
How JD’s Omniforce Cuts Large‑Model Training Cost by 70% and Boosts Inference Speed 30%
DeWu Technology
DeWu Technology
May 9, 2025 · Artificial Intelligence

Growth Story of a Technical Lead: Building a One‑Stop Large‑Model Training and Inference Platform at Dewu

Meng, a former Tencent and Alibaba engineer, led Dewu’s one‑stop large‑model training and inference platform, cutting integration costs, creating a shared GPU pool and CI/CD pipeline, building a Milvus vector‑database, and driving self‑directed learning that boosted business value, user experience, and set a roadmap for future RAG and cloud‑native optimizations.

AI PlatformCareer DevelopmentLarge Model
0 likes · 18 min read
Growth Story of a Technical Lead: Building a One‑Stop Large‑Model Training and Inference Platform at Dewu
Baidu Geek Talk
Baidu Geek Talk
Apr 9, 2025 · Artificial Intelligence

Baidu's Wenxin X1 Large Model Officially Launches on Qianfan Platform

On April 2, Baidu released its Wenxin X1 large model on the Qianfan platform, offering enterprise users and developers a multimodal, deep‑thinking AI with superior math, coding, and reasoning scores, low token‑price API access, batch inference, one‑click distillation, and rapid RAG/Agent application building.

AIAPI ServiceBaidu
0 likes · 4 min read
Baidu's Wenxin X1 Large Model Officially Launches on Qianfan Platform
Alibaba Cloud Native
Alibaba Cloud Native
Mar 21, 2025 · Artificial Intelligence

Why Large‑Model Token Costs Explode and How to Tame Them

Deploying large‑model applications can lead to unpredictable token consumption far beyond traditional web services, driven by factors such as model type, input/output length, caching, chain‑of‑thought, prompt design, and ecosystem dependencies, requiring comprehensive monitoring, preventive controls, and post‑incident optimizations.

Large Modeltoken consumption
0 likes · 15 min read
Why Large‑Model Token Costs Explode and How to Tame Them
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Mar 21, 2025 · Industry Insights

How DeepSeek’s Large Model is Revolutionizing Digital Twin Simulations

This article analyzes how DeepSeek’s multimodal large model overcomes traditional digital‑twin simulation bottlenecks through dynamic modeling, generative data augmentation, and low‑cost open‑source architecture, enabling smarter city traffic, industrial design, and water‑resource management while reshaping the industry’s AI‑driven simulation landscape.

AIDeepSeekDigital Twin
0 likes · 22 min read
How DeepSeek’s Large Model is Revolutionizing Digital Twin Simulations
Huolala Tech
Huolala Tech
Feb 27, 2025 · Artificial Intelligence

How Huolala’s Wukong Platform Solves Large‑Model Deployment Challenges

Huolala’s Wukong platform tackles the common “technology hype, implementation difficulty” dilemma of generative AI by unifying multimodal enterprise knowledge, enabling dynamic multi‑agent workflows, and providing low‑code tools, observability, and stable deployment across dozens of business scenarios.

AI workflowEnterprise AILarge Model
0 likes · 11 min read
How Huolala’s Wukong Platform Solves Large‑Model Deployment Challenges
DataFunSummit
DataFunSummit
Dec 24, 2024 · Artificial Intelligence

Considerations and Practices for Domesticating Large‑Model Inference Engines

This article examines the importance of domestic large‑model inference engines, compares Chinese and international chips, evaluates four architectural approaches, discusses practical challenges such as performance loss and model support, and outlines future expectations for high‑performance, heterogeneous‑chip inference solutions.

Domestic ChipInference EngineLarge Model
0 likes · 9 min read
Considerations and Practices for Domesticating Large‑Model Inference Engines
Architects' Tech Alliance
Architects' Tech Alliance
Nov 24, 2024 · Industry Insights

What’s Driving the Next Wave of Large‑Model Compute Infrastructure?

As AI accelerates, large‑model compute infrastructure becomes a cornerstone of digital transformation, with specialized accelerators, heterogeneous architectures, massive distributed clusters, intelligent scheduling, soaring costs, energy concerns, software‑hardware co‑design challenges, and data‑privacy issues shaping its future development.

AI hardwareCompute infrastructureFuture Trends
0 likes · 9 min read
What’s Driving the Next Wave of Large‑Model Compute Infrastructure?
AntTech
AntTech
Nov 13, 2024 · Information Security

Ant Group’s Large‑Model‑Based Security Parallel Plane and Intelligent Threat Detection System

The article details Ant Group’s AI‑driven security parallel plane and intelligent threat detection system, its DKCF‑based architecture, key modules for data correlation, unknown threat discovery, alarm reduction, and knowledge‑graph integration, and its recognition in the 2024 AI Pioneer Case Collection.

Ant GroupDKCFKnowledge Graph
0 likes · 5 min read
Ant Group’s Large‑Model‑Based Security Parallel Plane and Intelligent Threat Detection System
Fighter's World
Fighter's World
Sep 22, 2024 · Artificial Intelligence

How Large-Model AI Transforms Smart Customer Service – Alibaba Cloud Insights

The talk outlines the evolution of intelligent customer service over three decades, explains how generative large-model AI like ChatGPT has raised service expectations, and presents Alibaba Cloud’s four-stage implementation—experience, efficiency, capability, and insight—through three concrete cases and a roadmap for SMEs to build their own smart service systems.

AI agentsAlibaba-CloudLarge Model
0 likes · 12 min read
How Large-Model AI Transforms Smart Customer Service – Alibaba Cloud Insights
NewBeeNLP
NewBeeNLP
Jul 24, 2024 · Industry Insights

From Black Iron to Silver: The Evolution of Large Model Infrastructure (2019‑2024)

The article traces the evolution of large‑model training and inference infrastructure from the early “black‑iron” era (2019‑2021) through the “golden” boom (2022‑2023) to the emerging “silver” phase (2024‑), highlighting key research breakthroughs, open‑source frameworks, hardware trends, market dynamics, and practical challenges for engineers entering the field.

AI InfrastructureInferenceLarge Model
0 likes · 22 min read
From Black Iron to Silver: The Evolution of Large Model Infrastructure (2019‑2024)
AntTech
AntTech
Jul 9, 2024 · Artificial Intelligence

2024 Large Model Security Practice Whitepaper Unveiled at the World AI Conference

The jointly authored 2024 Large Model Security Practice whitepaper, released at the World AI Conference, outlines a comprehensive safety framework covering security, reliability, and controllability, presents industry case studies, and proposes a five‑dimensional governance model to guide high‑quality development of large AI models.

AI SafetyLarge Modelindustry practice
0 likes · 7 min read
2024 Large Model Security Practice Whitepaper Unveiled at the World AI Conference
DataFunTalk
DataFunTalk
Jul 7, 2024 · Artificial Intelligence

Large Model Application Development: Architecture, Lifecycle, and Prompt Engineering

This article presents a comprehensive knowledge map for developing large‑model applications, covering a four‑layer technical architecture, the full development lifecycle, core elements such as prompt engineering and model fine‑tuning, evaluation methods, and practical case studies, offering guidance for both enterprises and startups.

AI application developmentLarge ModelPrompt engineering
0 likes · 15 min read
Large Model Application Development: Architecture, Lifecycle, and Prompt Engineering
AntTech
AntTech
Jun 21, 2024 · Artificial Intelligence

Ant Group’s Security Parallel Aspect Fusion AI: A Case Study Selected at the 2024 World Intelligent Industry Expo

The article details Ant Group’s security parallel aspect fusion AI solution, selected as an exemplary case at the 2024 World Intelligent Industry Expo, explaining its multi‑dimensional data collection, large‑model integration, baseline construction, knowledge‑graph generation, and superior threat‑detection performance.

AIAnt GroupCase Study
0 likes · 4 min read
Ant Group’s Security Parallel Aspect Fusion AI: A Case Study Selected at the 2024 World Intelligent Industry Expo
G7 EasyFlow Tech Circle
G7 EasyFlow Tech Circle
May 29, 2024 · Artificial Intelligence

Engineering Large Model Enterprise Applications: Best Practices

This article outlines the key characteristics of large‑model enterprise applications, compares them with consumer use cases, and presents a comprehensive engineering roadmap—including model selection, knowledge‑base integration, tool implementation, intent recognition, output control, high‑availability deployment, and ongoing optimization—to help practitioners effectively harness AI models in real‑world business environments.

AI EngineeringLarge ModelRAG
0 likes · 12 min read
Engineering Large Model Enterprise Applications: Best Practices
Baidu Tech Salon
Baidu Tech Salon
May 21, 2024 · Artificial Intelligence

Baidu Comate Code Knowledge Enhancement 2.0: First Domestic Intelligent Code Assistant with Real-time Retrieval

At the iTechClub summit, Baidu unveiled Comate Code Knowledge Enhancement 2.0—the nation’s first AI‑driven code assistant with real‑time web retrieval, over 10,000 built‑in GitHub repositories, API‑call generation, test case creation, and deep private‑doc understanding, now producing 27% of Baidu’s daily new code and topping VSCode and JetBrains plugin ratings.

AI code assistantBaiduCode Generation
0 likes · 5 min read
Baidu Comate Code Knowledge Enhancement 2.0: First Domestic Intelligent Code Assistant with Real-time Retrieval
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
May 15, 2024 · Artificial Intelligence

How Baidu’s AIAK‑LLM Supercharges Large‑Model Training and Inference

The article explains the scaling challenges of ever‑larger LLMs, introduces the MFU performance metric, surveys industry parallelism and memory‑saving techniques, and details Baidu’s AIAK‑LLM suite—including resource, component and acceleration layers—as well as concrete training and inference optimizations that raise MFU by 30‑60% and cut deployment costs.

AI InfrastructureLarge ModelMFU
0 likes · 25 min read
How Baidu’s AIAK‑LLM Supercharges Large‑Model Training and Inference
Efficient Ops
Efficient Ops
May 14, 2024 · Artificial Intelligence

How Large‑Model Agents Are Revolutionizing AIOps and Modern Operations

This article explores why large‑model Agent technology is essential for AIOps, explains single‑ and multi‑Agent architectures, memory and tool integration, and demonstrates practical applications such as anomaly detection, fault diagnosis, automated remediation, ChatOps, and future directions for intelligent, autonomous operations.

AI agentsLLMLarge Model
0 likes · 14 min read
How Large‑Model Agents Are Revolutionizing AIOps and Modern Operations
DataFunTalk
DataFunTalk
Mar 11, 2024 · Artificial Intelligence

Challenges and Future Directions for Knowledge Graph Construction in the Era of Large Models

The article examines the high construction cost and lack of unified standards in knowledge graphs, explains why large language models cannot fully solve core issues such as hallucination and multi‑hop reasoning, and argues that a new, unified semantic framework integrating large models is essential for future progress.

AIKnowledge GraphLarge Model
0 likes · 5 min read
Challenges and Future Directions for Knowledge Graph Construction in the Era of Large Models
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Mar 1, 2024 · Artificial Intelligence

How Huawei’s Pangu AI Model Revolutionizes 7‑Day Global Weather Forecasts

Huawei Cloud’s Pangu weather large model leverages a 3‑D neural network and AI‑driven temporal aggregation to deliver seven‑day global forecasts up to ten thousand times faster than traditional methods, improving accuracy and lead time over established ECMWF systems.

Large ModelNumerical Weather Predictionartificial intelligence
0 likes · 4 min read
How Huawei’s Pangu AI Model Revolutionizes 7‑Day Global Weather Forecasts
Baobao Algorithm Notes
Baobao Algorithm Notes
Dec 11, 2023 · Artificial Intelligence

Boost Large‑Model Fine‑Tuning with Low‑Cost Data Selection and Construction

The article explains practical techniques for choosing and constructing fine‑tuning data for large language models, covering data diversity through similarity‑based clustering, semi‑supervised filtering with binary classifiers, and uncertainty‑driven sampling using perplexity or reward models to build an efficient, low‑cost pipeline.

Large ModelReward modelactive learning
0 likes · 9 min read
Boost Large‑Model Fine‑Tuning with Low‑Cost Data Selection and Construction
Advanced AI Application Practice
Advanced AI Application Practice
Nov 24, 2023 · Artificial Intelligence

Boost Report Testing Efficiency with an AI Large Model

The article demonstrates how Tencent's Hunyuan large model can generate Python scripts to automatically compare Excel‑based reports, highlight differences, and handle multiple files, turning a tedious manual regression test into a fast, reliable automated process.

AIAutomationLarge Model
0 likes · 4 min read
Boost Report Testing Efficiency with an AI Large Model
Tencent Tech
Tencent Tech
Oct 26, 2023 · Artificial Intelligence

Unlocking Tencent Hunyuan Text‑to‑Image: A Complete Guide and Prompt Tips

This guide introduces Tencent Hunyuan's upgraded text‑to‑image model, explains its technical innovations, provides detailed prompt engineering advice, showcases example prompts and generated images across various styles, and highlights real‑world applications and performance metrics for developers and creators.

AI GenerationLarge ModelPrompt engineering
0 likes · 12 min read
Unlocking Tencent Hunyuan Text‑to‑Image: A Complete Guide and Prompt Tips
ByteDance Cloud Native
ByteDance Cloud Native
Jun 13, 2023 · Artificial Intelligence

How Ray and Cloud‑Native Tech Supercharge Large‑Model Offline Inference

This article explains the challenges of large‑model offline (batch) inference, such as GPU memory limits and distributed scheduling, and shows how Ray’s cloud‑native architecture, model partitioning, and Ray Datasets can be used to build efficient, elastic inference frameworks deployed with KubeRay.

GPU MemoryLarge ModelRay
0 likes · 18 min read
How Ray and Cloud‑Native Tech Supercharge Large‑Model Offline Inference
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Mar 18, 2023 · Artificial Intelligence

Unveiling NetEase’s ‘YuZhi’ Multimodal Model: Boosting Personalized Recommendations

NetEase’s Fuxi team developed the multimodal ‘YuZhi’ model, a large‑scale image‑text dual‑tower system optimized with the EET inference framework, which powers personalized recommendations in NetEase News and Cloud Music, while a partnership with Huawei Ascend AI and MindSpore enables further model acceleration, compression, and the new ‘YuZhi‑Wukong’ model that improves video recommendation metrics by about 5%.

Huawei Ascend AILarge ModelMindSpore
0 likes · 5 min read
Unveiling NetEase’s ‘YuZhi’ Multimodal Model: Boosting Personalized Recommendations
DataFunSummit
DataFunSummit
Dec 19, 2022 · Artificial Intelligence

Multimodal Large‑Model Driven Virtual Digital Humans: Background, Methods, and Applications

This article introduces the rapid development of multimodal digital humans powered by large AI models, covering their background, current challenges, NeRF‑GAN based modeling methods, multimodal dialogue capabilities, and real‑world application cases such as virtual assistants, tourism guides, and sign‑language avatars.

AIGCHuman-Computer InteractionLarge Model
0 likes · 14 min read
Multimodal Large‑Model Driven Virtual Digital Humans: Background, Methods, and Applications
ITPUB
ITPUB
Jun 25, 2021 · Artificial Intelligence

How Alibaba’s Low‑Carbon M6 Model Trains a Trillion‑Parameter AI with 80% Less Energy

Alibaba’s DAMO Academy unveiled the low‑carbon M6 multimodal model, a trillion‑parameter AI trained on just 480 V100 GPUs, achieving over 80% energy reduction and 11‑fold speedup compared to prior trillion‑parameter efforts, and already powering e‑commerce and manufacturing design tools.

GPU efficiencyLarge ModelM6
0 likes · 5 min read
How Alibaba’s Low‑Carbon M6 Model Trains a Trillion‑Parameter AI with 80% Less Energy