Tagged articles
269 articles
Page 3 of 3
Architects' Tech Alliance
Architects' Tech Alliance
Jun 10, 2024 · Industry Insights

What China Can Learn from Domestic Large‑Model Compute: Challenges and Strategic Recommendations

The article analyzes the computing characteristics and system challenges of domestically trained large AI models, outlines the shortcomings of current Chinese efforts, and proposes six strategic actions—including scaling compute, improving data management, building a national R&D team, and boosting funding and policy support—to accelerate China’s transition from following to leading in AI.

AIChinaCompute infrastructure
0 likes · 5 min read
What China Can Learn from Domestic Large‑Model Compute: Challenges and Strategic Recommendations
DevOpsClub
DevOpsClub
Jun 3, 2024 · R&D Management

Scaling One‑Stop R&D Efficiency Platforms for Ten‑Thousand Engineers in the Large‑Model Era

Zhang Le’s IDCF talk reviews the latest design principles and real‑world implementation of one‑stop R&D efficiency platforms in large enterprises, comparing leading Chinese internet firms, outlining strategies for integrating large‑model technology, and detailing the five‑stage evolution of Tencent’s platform that boosts productivity across massive development teams.

Enterprise DevelopmentR&D managementSoftware Efficiency
0 likes · 3 min read
Scaling One‑Stop R&D Efficiency Platforms for Ten‑Thousand Engineers in the Large‑Model Era
Baidu Tech Salon
Baidu Tech Salon
May 27, 2024 · Industry Insights

How Large Language Models Are Transforming Industries

This report examines the technical evolution of large AI models, outlines industry demand, explains how these models add value across sectors, and discusses future challenges and considerations for large‑model‑driven transformation in a wide range of businesses.

AI adoptionartificial intelligenceindustry applications
0 likes · 2 min read
How Large Language Models Are Transforming Industries
Architects' Tech Alliance
Architects' Tech Alliance
May 18, 2024 · Industry Insights

Why Kimi Is Redefining China’s AI Large‑Model Landscape

The article analyzes how Kimi’s superior long‑context capabilities have propelled it to the top of user traffic in China, reshaping the competitive dynamics among domestic and international AI large models and driving a rapid surge in compute demand across both C‑end and B‑end applications.

AIChinaKimi
0 likes · 6 min read
Why Kimi Is Redefining China’s AI Large‑Model Landscape
Architects' Tech Alliance
Architects' Tech Alliance
Apr 25, 2024 · Industry Insights

What China’s AI Labs Learned from Scaling Domestic Large‑Model Training

The article analyzes the computational characteristics and system challenges of training large AI models on domestic platforms, examines framework parallelism and future algorithms, and proposes six strategic measures—including scaling compute, improving data management, building a national R&D team, and boosting AI‑chip investment—to accelerate China’s AI leadership.

AI InfrastructureModel Trainingdomestic AI
0 likes · 5 min read
What China’s AI Labs Learned from Scaling Domestic Large‑Model Training
Baidu Geek Talk
Baidu Geek Talk
Apr 24, 2024 · Industry Insights

How Baidu’s New AI OS “WanYuan” Redefines Intelligent Computing

At the Create 2024 Baidu AI Developer Conference, Baidu unveiled its next‑generation intelligent computing operating system WanYuan, detailing its cluster‑scale management, GPU‑centric performance, integrated large‑model services, and a layered architecture that aims to simplify AI‑native application development and accelerate the AI era.

AIBaiduCluster Management
0 likes · 12 min read
How Baidu’s New AI OS “WanYuan” Redefines Intelligent Computing
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Apr 24, 2024 · Artificial Intelligence

Evolution and Challenges of AI Infrastructure: Scaling Large Models on Cloud GPUs

In this talk from the 2024 China Generative AI Conference, Li Peng outlines the escalating computational demands of large‑model training and inference, identifies power, memory and communication walls, and presents Alibaba Cloud’s DeepGPU solutions and best‑practice strategies for scaling AI workloads on cloud GPUs.

DeepGPUGPU performancecloud computing
0 likes · 13 min read
Evolution and Challenges of AI Infrastructure: Scaling Large Models on Cloud GPUs
Baidu Tech Salon
Baidu Tech Salon
Apr 24, 2024 · Industry Insights

How Baidu’s Wenxin Model and PaddlePaddle Accelerate AI in the Intelligent Era

At the 3rd China International Software Development Conference, Baidu’s CTO Wang Haifeng highlighted the role of operating systems in the AI era, showcasing PaddlePaddle’s open‑source platform, the Wenxin large model’s efficiency gains, and the emergence of intelligent agents and code assistants.

AIChina technologyIntelligent agents
0 likes · 4 min read
How Baidu’s Wenxin Model and PaddlePaddle Accelerate AI in the Intelligent Era
DevOps
DevOps
Apr 18, 2024 · Artificial Intelligence

Expert Round‑Table on AIGC: Technology vs. Market Beliefs, Domestic Model Challenges, and Enterprise Deployment in China

The article presents a 2024 AIGC round‑table where Chinese experts discuss whether to follow a technology‑first or market‑first approach, the challenges of compute, algorithms and data, domestic versus foreign large‑model strategies, multi‑model deployment in enterprises, and criteria for evaluating successful AIGC applications.

AI deploymentAIGCChina AI
0 likes · 14 min read
Expert Round‑Table on AIGC: Technology vs. Market Beliefs, Domestic Model Challenges, and Enterprise Deployment in China
Baidu Tech Salon
Baidu Tech Salon
Apr 16, 2024 · Artificial Intelligence

How Baidu’s AI Tools Turn Everyone Into a Developer – Key Takeaways from Li Yanhong’s Speech

In his Create 2024 AI Developer Conference keynote, Li Yanhong outlines Baidu’s latest large‑model series, AI‑native development platforms (AgentBuilder, AppBuilder, ModelBuilder), performance breakthroughs, real‑world case studies, and the strategic vision that makes AI development accessible to all developers and enterprises.

AIdeveloper toolsindustry trends
0 likes · 28 min read
How Baidu’s AI Tools Turn Everyone Into a Developer – Key Takeaways from Li Yanhong’s Speech
Baidu Tech Salon
Baidu Tech Salon
Apr 16, 2024 · Artificial Intelligence

Baidu Launches Second 'Wenxin Cup' AI Startup Competition with Up to 50 Million RMB Investment

Baidu has launched the second Wenxin Cup AI startup competition, inviting global and university teams to develop AI‑native applications and compete for cash prizes—including a new Special Award of up to 50 million RMB—while providing technical guidance, ecosystem resources, and mentorship to accelerate the AI entrepreneurship ecosystem.

AI DevelopmentAI competitionBaidu
0 likes · 6 min read
Baidu Launches Second 'Wenxin Cup' AI Startup Competition with Up to 50 Million RMB Investment
DataFunSummit
DataFunSummit
Apr 12, 2024 · Artificial Intelligence

Exploring the Application of AI Large Models in the Automotive Industry

This article provides a comprehensive overview of AI large‑model development, defines what constitutes a large model, discusses current challenges such as cost, privacy and safety, and examines how these models can improve efficiency across automotive marketing, sales, service, data management, infrastructure building, and future automation stages.

AIData Managementautomotive
0 likes · 13 min read
Exploring the Application of AI Large Models in the Automotive Industry
DataFunSummit
DataFunSummit
Mar 29, 2024 · Artificial Intelligence

Large Language Model (LLM) Revolution in Recommendation Systems: Overview, Techniques, and Future Directions

This article reviews how the rapid rise of large language models, exemplified by ChatGPT, is transforming recommendation systems by addressing traditional ID‑centric limitations, introducing prompt‑based and ID‑free representations, discussing recent research advances, practical challenges, and future research directions.

AILLMRecommendation Systems
0 likes · 18 min read
Large Language Model (LLM) Revolution in Recommendation Systems: Overview, Techniques, and Future Directions
Baidu Geek Talk
Baidu Geek Talk
Mar 27, 2024 · Industry Insights

How Baidu’s Qianfan Platform Is Accelerating Enterprise AI Adoption

The article reviews Baidu’s Qianfan AI platform, highlighting rapid large‑model advances, enterprise challenges, new AppBuilder features, lightweight model releases, and cost‑effective model routing that together aim to boost AI adoption across industries.

AIEnterprise AIModel Optimization
0 likes · 16 min read
How Baidu’s Qianfan Platform Is Accelerating Enterprise AI Adoption
JD Retail Technology
JD Retail Technology
Mar 12, 2024 · Artificial Intelligence

Multimodal Large Models: Recent Advances, Industry Impact, and Challenges – An Expert Interview

In a detailed interview, Tsinghua researcher Zhao Sicheng and JD Retail senior director Peng Changping discuss the latest progress in multimodal large models, their practical applications in advertising and e‑commerce, persistent challenges such as hallucinations and data alignment, and the skills engineers need to thrive in the emerging AI era.

AI researchMultimodal AIe‑commerce
0 likes · 19 min read
Multimodal Large Models: Recent Advances, Industry Impact, and Challenges – An Expert Interview
Kuaishou Tech
Kuaishou Tech
Mar 9, 2024 · Artificial Intelligence

Kuaishou Launches Future High-Tech Video Intelligence Innovation Center Showcasing AI, Codec, and Transmission Advances

On March 1, Kuaishou hosted the launch of the Future High-Tech Video Intelligence Innovation Center in Beijing, highlighting collaborations with leading universities, breakthroughs in video codecs, a new transmission protocol, 6DoF video, digital humans, and large AI models that aim to drive industry-wide digital and intelligent transformation.

Innovation CenterKuaishouVideo AI
0 likes · 10 min read
Kuaishou Launches Future High-Tech Video Intelligence Innovation Center Showcasing AI, Codec, and Transmission Advances
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 7, 2024 · Cloud Computing

How Cloud Price Cuts and Hybrid Strategies Transform Cost Efficiency

In this round‑table discussion, industry experts analyze Alibaba Cloud's historic price reductions, compare self‑built IDC versus public cloud economics, share iReader's phased migration experience, explore the broader value of cloud services beyond raw resources, and examine how large‑model AI workloads reshape cost and innovation strategies.

Cost Optimizationcloud computingcloud migration
0 likes · 26 min read
How Cloud Price Cuts and Hybrid Strategies Transform Cost Efficiency
Architects' Tech Alliance
Architects' Tech Alliance
Mar 6, 2024 · Industry Insights

Why AI Large Models Are Shaping the Next Industrial Revolution

The article examines the current state and future trajectory of AI large models, highlighting GPT's leadership, domestic Chinese developments, emerging multimodal applications, evolving revenue models, and market size forecasts that signal a transformative shift in AI-driven industry.

AGIAIIndustry analysis
0 likes · 6 min read
Why AI Large Models Are Shaping the Next Industrial Revolution
JD Retail Technology
JD Retail Technology
Feb 1, 2024 · Artificial Intelligence

Evolution and Optimization of JD Retail Advertising Online Model System: From Deep Learning to Distributed Graph Computing and Power Collaboration

The article details JD Retail Advertising's three‑stage evolution of its online model system—deep‑learning era, large‑model era, and power‑collaboration era—highlighting heterogeneous computing optimizations, platform and system capabilities, distributed graph computing, online learning, and dynamic power allocation to dramatically improve algorithm iteration speed and model performance.

AIAdvertisingdistributed graph
0 likes · 13 min read
Evolution and Optimization of JD Retail Advertising Online Model System: From Deep Learning to Distributed Graph Computing and Power Collaboration
DataFunSummit
DataFunSummit
Jan 22, 2024 · Artificial Intelligence

Improving Efficiency of Large‑Scale AI Model Training, Fine‑tuning, and Deployment with Colossal‑AI

This article introduces Colossal‑AI, an open‑source platform that tackles the challenges of training, fine‑tuning, and deploying massive AI models by leveraging efficient memory management, N‑dimensional parallelism, and high‑performance inference to dramatically reduce cost and improve scalability across thousands of GPUs.

AI InfrastructureColossal-AIDistributed Training
0 likes · 21 min read
Improving Efficiency of Large‑Scale AI Model Training, Fine‑tuning, and Deployment with Colossal‑AI
AntTech
AntTech
Jan 9, 2024 · Artificial Intelligence

ATorch: Ant Group’s Open‑Source Distributed Training Acceleration Library for Large‑Scale AI Models

Ant Group’s newly open‑sourced ATorch library extends PyTorch with a layered architecture and automated resource‑aware strategies, boosting large‑model training efficiency up to 60% utilization, enhancing stability, and delivering significant throughput gains across multi‑node, multi‑GPU deployments.

AI accelerationDistributed TrainingPyTorch
0 likes · 6 min read
ATorch: Ant Group’s Open‑Source Distributed Training Acceleration Library for Large‑Scale AI Models
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Jan 9, 2024 · Artificial Intelligence

How AI Large Models Are Revolutionizing Enterprises – Key Insights from Huawei Cloud

This article explores the rapid evolution of AI large models, their practical applications in intelligent customer service, text generation, digital humans, and healthcare, while addressing challenges such as compute costs, data quality, security, and model hallucinations, and offering expert strategies for successful enterprise adoption.

AIEnterprise AIdigital humans
0 likes · 12 min read
How AI Large Models Are Revolutionizing Enterprises – Key Insights from Huawei Cloud
DataFunSummit
DataFunSummit
Jan 5, 2024 · Artificial Intelligence

Multimodal Large Model Platform: History, Architecture, Practices, and Future Outlook by Jiuzhang Yunji DataCanvas

This article reviews the evolution of multimodal large models, introduces Jiuzhang Yunji DataCanvas' multimodal model platform—including AI foundation software, model tools, serving, and prompt management—shares practical building methods, memory‑augmented models, ETL pipelines, knowledge‑base applications, and offers a forward‑looking perspective on enterprise data management and intelligent agents.

AI Foundation SoftwareKnowledge BaseMultimodal AI
0 likes · 14 min read
Multimodal Large Model Platform: History, Architecture, Practices, and Future Outlook by Jiuzhang Yunji DataCanvas
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Jan 4, 2024 · Artificial Intelligence

How to Strengthen AIGC Content Safety with Multimodal Data and Model Upgrades

The article examines the security challenges introduced by large‑model AIGC, outlines three technical upgrade paths—richer training data, few‑shot model fine‑tuning, and multimodal fusion—and demonstrates practical implementations that dramatically improve detection efficiency, accuracy, and scalability.

AI securityAIGCContent Safety
0 likes · 24 min read
How to Strengthen AIGC Content Safety with Multimodal Data and Model Upgrades
DataFunTalk
DataFunTalk
Jan 2, 2024 · Artificial Intelligence

Mid‑Stage Reflections on Large‑Model Technology and Its Industry Impact

This article offers a comprehensive mid‑stage analysis of large‑model technology, discussing its rapid development, emerging challenges such as cost and hallucinations, positioning, scenario applications, cost‑value trade‑offs, and strategic pathways for future research and deployment.

AIApplicationsCost
0 likes · 21 min read
Mid‑Stage Reflections on Large‑Model Technology and Its Industry Impact
Meituan Technology Team
Meituan Technology Team
Dec 28, 2023 · Artificial Intelligence

Key Insights from the Meituan Robotics Institute Academic Annual Meeting on Embodied Intelligence and AI‑Driven Robotics

At the Meituan Robotics Institute’s 2023 academic annual meeting in Shenzhen, leading scholars highlighted embodied intelligence as a learning “child,” emphasized large‑scale models and big data as catalysts for advanced perception and control, identified logistics, manufacturing, elder‑care and scientific research as near‑term markets, and called for deeper industry‑academia collaboration to accelerate robot deployment and explore new applications.

Embodied IntelligenceIndustry-Academia CollaborationLogistics
0 likes · 30 min read
Key Insights from the Meituan Robotics Institute Academic Annual Meeting on Embodied Intelligence and AI‑Driven Robotics
AntTech
AntTech
Dec 26, 2023 · Artificial Intelligence

Key Insights from Wang Weiqiang’s Speech on Large‑Model Security at the AI Innovation and Governance Conference

Wang Weiqiang, chief scientist of Ant Group’s Security Lab, highlighted the urgent need for both rapid detection and long‑term trustworthy safeguards for large AI models, outlining Ant’s data‑detox, guard‑rail, and detection platforms as core solutions to emerging risks such as hallucinations, bias, and data leakage.

AI GovernanceAnt Grouplarge models
0 likes · 10 min read
Key Insights from Wang Weiqiang’s Speech on Large‑Model Security at the AI Innovation and Governance Conference
DataFunSummit
DataFunSummit
Dec 16, 2023 · Artificial Intelligence

Enterprise Large Model Deployment: Data Governance, Fine‑Tuning Strategies, and Cost Economics

The article examines how enterprises can adopt domain‑specific large models by addressing talent and cost challenges, outlining self‑supervised pre‑training, instruction fine‑tuning, data governance for unstructured data, dataset balance, model‑type selection, and integrated product solutions to achieve efficient, high‑performance AI deployments.

AI deploymentData GovernanceEnterprise AI
0 likes · 13 min read
Enterprise Large Model Deployment: Data Governance, Fine‑Tuning Strategies, and Cost Economics
Baidu Geek Talk
Baidu Geek Talk
Dec 6, 2023 · Industry Insights

From MLOps to LMOps: Challenges and Solutions for Large‑Model Operations

This article reviews the evolution from MLOps to LMOps, outlines the core concepts, challenges, and key technologies such as large‑model inference optimization, prompt engineering, and context‑length extension, and offers a forward‑looking perspective on the future of AI operations.

AI OperationsLMOpsMLOps
0 likes · 23 min read
From MLOps to LMOps: Challenges and Solutions for Large‑Model Operations
DaTaobao Tech
DaTaobao Tech
Nov 22, 2023 · Artificial Intelligence

AI Integration in E-commerce: Taobao's Double 11 Experience

Taobao’s Double 11 showcased AI assistants that deliver personalized recommendations, virtual try‑ons, and product comparisons while empowering merchants with AI‑generated content and large‑language‑model tools, illustrating how retrieval‑augmented generation and LoRA address scaling challenges and model hallucinations to create a seamless, data‑driven shopping experience.

AIAIGCDouble 11
0 likes · 11 min read
AI Integration in E-commerce: Taobao's Double 11 Experience
Baidu Geek Talk
Baidu Geek Talk
Nov 7, 2023 · Artificial Intelligence

Interview on AI Image Generation (Text-to-Image) Technology and Baidu Search Applications

In a recent InfoQ Geek Talk, Baidu Search chief architect Tianbao discussed the rapid evolution of AI text‑to‑image technology—highlighting Chinese‑language data preparation, prompt‑engineering challenges, evaluation methods combining human feedback and metrics, and future video‑generation prospects—while announcing openings for visual algorithm engineers.

AI image generationAIGCBaidu
0 likes · 24 min read
Interview on AI Image Generation (Text-to-Image) Technology and Baidu Search Applications
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Nov 6, 2023 · Artificial Intelligence

Large Models and Recommendation Systems: Challenges, Opportunities, and Future Directions

At CNCC 2023, leading researchers and industry experts convened to examine how large language models can transform recommendation systems, outlining four core challenges—model integration, fluency versus intelligence, hallucination versus deception, and user understanding—while highlighting opportunities such as multimodal content, cold‑start solutions, zero‑shot ranking, instruction‑driven algorithms, and responsible, interactive recommendation pipelines.

AICNCC 2023LLM applications
0 likes · 16 min read
Large Models and Recommendation Systems: Challenges, Opportunities, and Future Directions
Meituan Technology Team
Meituan Technology Team
Oct 26, 2023 · Artificial Intelligence

First Tsinghua-MeiTuan Digital Life Joint Research Institute Academic Forum

The inaugural Tsinghua‑Meituan Digital Life Joint Research Institute Academic Forum, scheduled for November 1 2023, will convene Academician Zheng Weimin, Tsinghua scholars, and Meituan technical experts to discuss intelligent unmanned systems in the era of large‑scale models, sharing cutting‑edge research and addressing emerging challenges.

AI technologyResearch Instituteacademic forum
0 likes · 1 min read
First Tsinghua-MeiTuan Digital Life Joint Research Institute Academic Forum
Alimama Tech
Alimama Tech
Sep 20, 2023 · Artificial Intelligence

CCF C³ Forum: AI Technology Driving Business Transformation

The 23rd CCF C³ Forum, organized by Alibaba’s Alimama and the CCF CTO Club, examined how large‑model AI is reshaping intelligent business technology, from data‑driven to knowledge‑driven approaches, enhancing e‑commerce with smarter search, personalized recommendations, content creation, and guiding merchants on future AI‑native strategies.

AI technologyAI-native businessData Intelligence
0 likes · 8 min read
CCF C³ Forum: AI Technology Driving Business Transformation
AntTech
AntTech
Sep 18, 2023 · Artificial Intelligence

Green Computing in the Era of Large Models: Highlights from the 2023 Inclusion·Bund Conference

The 2023 Inclusion·Bund Conference examined the rapid growth of large‑model AI, the resulting GPU shortage, and presented multi‑sector strategies—including policy guidance, ESG standards, AI‑driven optimization, and collaborative standards—to achieve sustainable, energy‑efficient computing across the entire ecosystem.

AIData CentersSustainability
0 likes · 7 min read
Green Computing in the Era of Large Models: Highlights from the 2023 Inclusion·Bund Conference
DataFunSummit
DataFunSummit
Sep 13, 2023 · Artificial Intelligence

Data Engineering, Automated Evaluation, and Knowledge Graph Integration in Large Model Development

This article presents a comprehensive overview of data engineering practices for large model training, reviews current model scales and pre‑training data sources, discusses automated evaluation techniques, and explores how knowledge graphs can be integrated throughout the model lifecycle to improve quality and applicability.

AIautomated evaluationdata engineering
0 likes · 29 min read
Data Engineering, Automated Evaluation, and Knowledge Graph Integration in Large Model Development
DataFunSummit
DataFunSummit
Sep 11, 2023 · Artificial Intelligence

Challenges and Insights for Deploying Large Models on Edge with MNN

The talk presents an overview of the MNN inference engine, outlines the end‑to‑end workflow for deploying large language models on mobile devices, discusses technical challenges and practical solutions, and concludes with future directions for edge AI deployment.

AIInference EngineMNN
0 likes · 2 min read
Challenges and Insights for Deploying Large Models on Edge with MNN
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Aug 25, 2023 · Artificial Intelligence

DataFunSummit 2023: Recommendation Systems Online Summit

The DataFunSummit 2023 online summit (August 26‑27) will explore eight recommendation‑system topics—including core and engineering architecture, model training/inference, large models, graphs, cold start, and multi‑task scenarios—featuring Xiaohongshu leaders who will present on graph‑based business architecture, integrated training‑inference pipelines, and user/content cold‑start strategies.

AI EngineeringRecommendation Systemsarchitecture
0 likes · 6 min read
DataFunSummit 2023: Recommendation Systems Online Summit
Baidu Geek Talk
Baidu Geek Talk
Aug 10, 2023 · Industry Insights

How Baidu’s AIGC Competition Is Shaping the Future of Commercial AI

The article examines Baidu’s inaugural Commercial AI Innovation Competition, highlighting its focus on AIGC commercial applications such as conversion behavior prediction and inference performance optimization, and explores how large‑scale models like PaddleBox and AI‑native tools are poised to transform content creation, marketing, and enterprise operations.

AI industryAIGCBaidu
0 likes · 10 min read
How Baidu’s AIGC Competition Is Shaping the Future of Commercial AI
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Jul 17, 2023 · Artificial Intelligence

How MindSpore’s Auto Parallel Tech Simplifies Large-Model Training

During a livestream titled “Solving the ‘Development Difficulty’ of Large Models with MindSpore Auto Parallel”, Huawei’s MindSpore experts explained how the framework’s distributed training techniques—including data, model, and pipeline parallelism as well as memory‑saving strategies—enable efficient pre‑training of trillion‑parameter models across diverse AI domains.

Data ParallelDistributed TrainingMemory Optimization
0 likes · 6 min read
How MindSpore’s Auto Parallel Tech Simplifies Large-Model Training
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jul 10, 2023 · Artificial Intelligence

How Object Storage Accelerates Large AI Model Training and Inference

This article examines the storage challenges posed by large AI models, analyzes the full workflow from data ingestion to inference, compares HDFS and object‑storage data lakes, and presents Baidu's cloud‑native storage‑acceleration solutions—including RapidFS and PFS—that dramatically improve training speed, checkpoint handling, and model deployment throughput.

AICloud Nativelarge models
0 likes · 19 min read
How Object Storage Accelerates Large AI Model Training and Inference
360 Tech Engineering
360 Tech Engineering
Jul 6, 2023 · Artificial Intelligence

CSIG Enterprise Visit to Qihoo 360: Multimodal and Cross‑Modal Learning in the Era of Large Models

The CSIG‑hosted "Enterprise Visit – Into Qihoo 360" event on June 29, 2023 gathered over a thousand participants to explore multimodal and cross‑modal learning in the large‑model era, featuring keynote speeches from leading university researchers and Qihoo 360 AI experts, a tour of the company's facilities, and discussions on future AI research directions.

CSIGCross-modalQihoo360
0 likes · 8 min read
CSIG Enterprise Visit to Qihoo 360: Multimodal and Cross‑Modal Learning in the Era of Large Models
DevOps
DevOps
Jul 6, 2023 · Cloud Computing

How AI Chips and Strategic Investments Are Redefining the Cloud Computing Landscape

The article analyzes how the rise of large‑language models, the scarcity of AI‑optimized compute chips, and aggressive strategic investments by cloud giants such as Microsoft, Amazon, Google, and Oracle are reshaping competition in the cloud computing market and determining the next wave of growth.

AI chipsAmazonGoogle
0 likes · 12 min read
How AI Chips and Strategic Investments Are Redefining the Cloud Computing Landscape
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jun 21, 2023 · Artificial Intelligence

How Baidu’s AIPod Network Powers Massive AI Model Training

This article explains the design and engineering of Baidu's AIPod high‑performance network, detailing the massive bandwidth, scalability, stability, and low‑latency requirements of large‑scale AI model training and the practical tools used to monitor and troubleshoot such workloads.

AIAIPodDistributed Training
0 likes · 22 min read
How Baidu’s AIPod Network Powers Massive AI Model Training
Baidu Tech Salon
Baidu Tech Salon
May 26, 2023 · Industry Insights

How Large Models Are Redefining AI and Shaping the Next Industrial Revolution

In a 2023 Zhi Guan Cun Forum speech, Baidu CEO Robin Li explains how large AI models are compressing human knowledge, transforming human‑computer interaction, redefining marketing and customer service, spawning AI‑native applications, and reshaping the entire technology stack, ultimately driving a new era of industrial growth.

AI-native applicationsHuman-Computer InteractionTechnology Stack
0 likes · 10 min read
How Large Models Are Redefining AI and Shaping the Next Industrial Revolution
OPPO Amber Lab
OPPO Amber Lab
May 16, 2023 · Artificial Intelligence

Key Insights from the 2022 China AI Industry Conference on Security

At the 2022 China AI Industry Conference, leading scientists and industry experts presented cutting‑edge research on AI security, trustworthy mobile devices, adversarial model defenses, data‑free trojan detection, and large‑model challenges, highlighting certifications, practical implementations, and the need for continued collaborative innovation.

Adversarial Machine LearningMobile Securityartificial intelligence
0 likes · 7 min read
Key Insights from the 2022 China AI Industry Conference on Security
DataFunSummit
DataFunSummit
Apr 20, 2023 · Artificial Intelligence

SenseTime Unveils Multimodal ‘SenseNova’ Large Model System and Its Industry Applications

SenseTime introduced its visual‑centric multimodal large‑model platform SenseNova, detailing model scaling, extensive AI infrastructure, diverse industry deployments such as autonomous driving and generative content, and the challenges of compute efficiency and data acquisition in the race for advanced AI.

AI InfrastructureComputer Visionlarge models
0 likes · 13 min read
SenseTime Unveils Multimodal ‘SenseNova’ Large Model System and Its Industry Applications
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 14, 2023 · Artificial Intelligence

Why Large Models Are Revolutionizing AI: From Foundations to AIGC

This article explores the concept and evolution of large foundation models, their transformative impact on AI-generated content, the underlying technologies such as transformers, diffusion, and CLIP, and discusses the challenges, emerging abilities, and future prospects of these models across multiple modalities.

AIGCGPTdiffusion
0 likes · 32 min read
Why Large Models Are Revolutionizing AI: From Foundations to AIGC
Architects' Tech Alliance
Architects' Tech Alliance
Apr 1, 2023 · Industry Insights

Why GPUs Lag Behind Big AI Models and How In‑Memory Computing Helps

The article examines the growing bottlenecks of large‑scale AI model training caused by the separation of storage and compute, analyzes why conventional GPU architectures cannot keep pace with exponential model growth, and presents in‑memory and near‑memory computing, as well as storage‑compute integration, as promising solutions to boost performance, energy efficiency, and scalability for cloud and edge deployments.

AI computeGPU bottleneckcloud computing
0 likes · 10 min read
Why GPUs Lag Behind Big AI Models and How In‑Memory Computing Helps
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Feb 20, 2023 · Industry Insights

Why Pre‑trained Large Models Are the New Infrastructure for AI Applications

Pre‑trained large models are emerging as the foundational infrastructure for AI across industries; this article analyzes their technical advantages, application trends in NLP, CV and multimodal domains, presents a telecom customer‑service case study with performance benchmarks, and outlines future deployment challenges and research directions.

Computer VisionNLPPrompt Tuning
0 likes · 23 min read
Why Pre‑trained Large Models Are the New Infrastructure for AI Applications
DataFunTalk
DataFunTalk
Feb 14, 2023 · Artificial Intelligence

Cost Optimization and Mixed‑Resource Deployment in Tencent Taiji Machine Learning Platform for Large‑Scale AI Models

The article describes how Tencent's Taiji machine learning platform leverages cloud‑native mixed‑resource strategies—including online idle, tidal, and compute resources—to reduce training costs, improve stability, and support large‑scale AI model training for advertising and other services.

AICloud NativeMachine Learning Platform
0 likes · 17 min read
Cost Optimization and Mixed‑Resource Deployment in Tencent Taiji Machine Learning Platform for Large‑Scale AI Models
DataFunSummit
DataFunSummit
Feb 3, 2023 · Artificial Intelligence

The Evolution of Artificial Intelligence: From Deep Blue to Generative AI and Large Models

From Deep Blue’s 1997 chess victory to today’s generative AI breakthroughs like GPT‑3, DALL‑E, and large‑scale models, this article traces the rapid rise of artificial intelligence, highlighting key milestones, the impact of massive compute and data, and the societal implications of AI’s expanding capabilities.

AI historyartificial intelligencegenerative AI
0 likes · 15 min read
The Evolution of Artificial Intelligence: From Deep Blue to Generative AI and Large Models
DataFunSummit
DataFunSummit
Jan 5, 2023 · Artificial Intelligence

GPU Acceleration Techniques for Large AI Models: Parallelism, Fusion, and Simplification

These notes explain how GPUs address the massive data, serial dependencies, and high computational complexity of modern AI by employing three acceleration strategies—parallelism, operator fusion, and simplification—illustrated with Megatron-LM, MoE models, and practical compression techniques such as quantization, distillation, and pruning.

AIGPUMegatron
0 likes · 16 min read
GPU Acceleration Techniques for Large AI Models: Parallelism, Fusion, and Simplification
DataFunTalk
DataFunTalk
Jan 4, 2023 · Artificial Intelligence

GPU Acceleration Techniques for Large AI Models: Parallelism, Fusion, and Simplification

This article explains how GPUs address the massive data, serial dependencies, and high computational complexity of modern AI by employing three acceleration strategies—parallelism, operator fusion, and simplification—detailing methods such as model, pipeline, and tensor parallelism, Megatron framework, MoE models, and various model compression techniques.

AIGPUMegatron
0 likes · 17 min read
GPU Acceleration Techniques for Large AI Models: Parallelism, Fusion, and Simplification
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Dec 20, 2022 · Artificial Intelligence

How AI Powers SaaS Service Marketing: Real-World Strategies and Future Trends

At QCon 2022, AI expert Feng Minwei shared practical insights on overcoming AI adoption challenges in SaaS service marketing, detailing cost‑reduction, industry expansion, and overseas deployment strategies, while showcasing real‑world cases of intelligent call‑center automation, knowledge‑base enrichment, and the impact of large‑model and generative AI.

AISaaSService Marketing
0 likes · 19 min read
How AI Powers SaaS Service Marketing: Real-World Strategies and Future Trends
Zuoyebang Tech Team
Zuoyebang Tech Team
Sep 23, 2022 · Artificial Intelligence

How AI Powers K‑12 Education: Insights from a Chief Algorithm Expert

In this interview, the chief algorithm expert at Zuoyebang discusses how AI technologies such as NLP, speech recognition, large‑model pre‑training, and knowledge‑graph construction are applied to K‑12 education, covering practical challenges, deployment strategies, and future research directions.

AIEducation TechnologyKnowledge Graph
0 likes · 27 min read
How AI Powers K‑12 Education: Insights from a Chief Algorithm Expert
Baidu Geek Talk
Baidu Geek Talk
Aug 31, 2022 · Artificial Intelligence

Baidu Intelligent Cloud Launches Cloud-native AI 2.0 to Accelerate AI Engineering

Baidu Intelligent Cloud’s new Cloud‑native AI 2.0 platform tackles AI engineering bottlenecks by offering hybrid‑parallel large‑model training, flexible GPU virtualization, and an AI Accelerate Kit that boosts training efficiency over 50 % and cuts inference latency up to 63 %, raising GPU utilization from ~13 % to about 50 %.

AIAI accelerationGPU virtualization
0 likes · 15 min read
Baidu Intelligent Cloud Launches Cloud-native AI 2.0 to Accelerate AI Engineering
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jul 12, 2022 · Artificial Intelligence

How Whale Enables Efficient Giant Model Training on Heterogeneous GPUs

The article introduces Whale, an open‑source distributed training framework that unifies multiple parallelism strategies, uses hardware‑aware load balancing to accelerate giant models like BERT‑Large and the trillion‑parameter M6 on heterogeneous GPU clusters, and details its architecture, planning, and real‑world performance gains.

Deep LearningParallelismhardware-aware scheduling
0 likes · 11 min read
How Whale Enables Efficient Giant Model Training on Heterogeneous GPUs
Baidu Geek Talk
Baidu Geek Talk
Jul 6, 2022 · Artificial Intelligence

Why Training Massive AI Models Demands New Cluster Architectures and Parallelism Strategies

The article examines the industry trend toward ever‑larger AI models, compares their parameter scale to the human brain, outlines the computational and memory challenges of training such models, and details advanced parallelism techniques and Baidu's high‑performance cluster solutions that enable efficient, stable large‑scale model training.

AI InfrastructureBaiduCluster Computing
0 likes · 17 min read
Why Training Massive AI Models Demands New Cluster Architectures and Parallelism Strategies
Baidu Tech Salon
Baidu Tech Salon
Jan 25, 2022 · Artificial Intelligence

What AI Trends Will Shape 2022? Insights from Baidu Research

Amid post‑pandemic uncertainty, Baidu Research outlines 2022 AI breakthroughs—from large‑model advancements and cross‑modal knowledge enhancement to AI‑driven scientific discovery, privacy computing, quantum hardware, autonomous driving, green AI, and inclusive technologies—highlighting how these trends will reshape industries and society.

2022AIGreen AI
0 likes · 10 min read
What AI Trends Will Shape 2022? Insights from Baidu Research
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 17, 2021 · Artificial Intelligence

How Alibaba’s Whale Framework Cuts Large‑Model Training Costs by 80%

Alibaba Cloud’s PAI team and the DAMO Academy introduced the low‑carbon M6 trillion‑parameter multimodal model, demonstrating that their self‑developed Whale framework can train such massive models on just 480 V100 GPUs, reducing energy consumption by over 80% and boosting training efficiency nearly eleven‑fold.

AIDistributed TrainingGPU Optimization
0 likes · 12 min read
How Alibaba’s Whale Framework Cuts Large‑Model Training Costs by 80%