Tagged articles

large models

282 articles · Page 2 of 3

Jun 12, 2025 · Artificial Intelligence

How Alibaba Cloud’s AI Search Evolves with Agentic RAG and Multi‑Model Innovations

This article details Alibaba Cloud AI Search’s development journey, covering its dual product lines, the evolution of Agentic RAG technology, multi‑agent architectures, vector retrieval breakthroughs, GPU‑accelerated indexing, NL2SQL capabilities, deployment models, and future directions for AI‑driven search solutions.

AI SearchGPU AccelerationOpenSearch

0 likes · 33 min read

How Alibaba Cloud’s AI Search Evolves with Agentic RAG and Multi‑Model Innovations

Big Data Tech Team

Jun 11, 2025 · Industry Insights

How AI Large Models Will Revolutionize Data Governance in 2025

This whitepaper examines the accelerating growth of enterprise data, the limitations of traditional rule‑based governance, and how multimodal AI large models—combined with privacy‑preserving techniques—can create a four‑layer, six‑domain architecture that automates metadata management, quality control, and compliance, delivering measurable efficiency gains across finance, manufacturing, and retail sectors.

AIlarge models

0 likes · 9 min read

Data Thinking Notes

Jun 8, 2025 · Artificial Intelligence

Explore the Complete AI Large Model Technology Landscape: Architecture Diagrams Across Industries

This article presents a panoramic view of AI large‑model technologies, showcasing a series of architecture diagrams that illustrate general model frameworks, RAG knowledge‑base structures, agricultural and retail applications, IoT integration, compliance and risk‑management setups, agent platforms, and CRM‑enhanced solutions.

AIRAGarchitecture

0 likes · 3 min read

Explore the Complete AI Large Model Technology Landscape: Architecture Diagrams Across Industries

ITFLY8 Architecture Home

Jun 5, 2025 · Artificial Intelligence

Why Large Models Are Redefining Software: The Four AI Tech Drivers

The article explains how rapid AI advances and the AIAgent architecture are reshaping software development, outlines four key technical drivers—embedding, Transformer scaling laws, scenario Moore's law, and LLM OS—and discusses the security, professionalism, and responsibility challenges enterprises face when deploying AI‑native applications.

AI ArchitectureEmbeddingEnterprise AI

0 likes · 6 min read

Why Large Models Are Redefining Software: The Four AI Tech Drivers

Baobao Algorithm Notes

Jun 3, 2025 · Artificial Intelligence

How to Train a 671B‑Scale Model with RL: Insights from a verl Internship

This article shares a detailed, first‑hand analysis of the technical challenges, framework choices, memory management, weight conversion, precision alignment, and efficiency optimizations encountered while building reinforcement‑learning pipelines for a 671‑billion‑parameter model using the verl ecosystem.

GPU Memory ManagementMegatronRL Training

0 likes · 16 min read

How to Train a 671B‑Scale Model with RL: Insights from a verl Internship

DataFunSummit

Jun 2, 2025 · Artificial Intelligence

Enterprise Knowledge Brain Powered by Large Models and Knowledge Graphs

This article explains how the rapid development of large language models and knowledge graph technologies creates new opportunities for enterprise knowledge management, outlines the challenges of massive unstructured data, describes the architecture and core data flow of a corporate knowledge brain, and showcases key technologies and real‑world applications.

AI ArchitectureData IntegrationEnterprise AI

0 likes · 13 min read

Enterprise Knowledge Brain Powered by Large Models and Knowledge Graphs

Huolala Tech

May 29, 2025 · Artificial Intelligence

How LWS Enables Scalable Multi‑Node Large Model Deployment on Kubernetes

The article explains how the Dolphin AI platform tackles large‑model deployment challenges by replacing standard Kubernetes Deployments with LeaderWorkerSet, detailing its architecture, features, installation steps, example configurations, testing, scaling, rolling updates, fault recovery, and future roadmap for AI workloads.

AI platformDistributed InferenceLeaderWorkerSet

0 likes · 12 min read

How LWS Enables Scalable Multi‑Node Large Model Deployment on Kubernetes

AntTech

May 22, 2025 · Artificial Intelligence

How Massive Data Shapes the AGI Era: Challenges and Opportunities

In his OceanBase developer conference keynote, Ant Group CTO He Zhengyu analyzes how the explosion of data fuels AGI progress, outlines four key data challenges—cost, scarcity, multimodality, and quality assessment—and argues that overcoming them will turn data companies into AI leaders.

AGIartificial-intelligencedata challenges

0 likes · 8 min read

How Massive Data Shapes the AGI Era: Challenges and Opportunities

JD Retail Technology

May 19, 2025 · Artificial Intelligence

How JD’s Omniforce Boosts Large Model Efficiency with Cloud‑Edge Collaboration

The JD Exploration Institute paper introduces Omniforce, a human‑centered, cloud‑edge collaborative AutoML system that uses model distillation, dynamic data governance, Bayesian‑optimized training, and edge deployment to cut large‑model training costs by 70% and improve inference speed by 30%, powering the JoyBuild platform for broader AI adoption.

AI efficiencyAutoMLJoyBuild

0 likes · 6 min read

How JD’s Omniforce Boosts Large Model Efficiency with Cloud‑Edge Collaboration

Baidu Geek Talk

May 14, 2025 · Industry Insights

How RapidFS Boosts AI Model Training with 10 TiB/s Throughput

The article explains how large‑scale AI model training and inference require massive data handling, describes the RapidFS storage acceleration cluster deployed on a 30,000‑card Kunlun chip system with hundreds of domestic CPU servers, and presents performance tests showing linear throughput scaling up to over 1 TiB/s, demonstrating the impact of high‑performance storage on compute efficiency.

AI trainingHigh-performance computingRapidFS

0 likes · 5 min read

How RapidFS Boosts AI Model Training with 10 TiB/s Throughput

AntTech

May 12, 2025 · Industry Insights

How AI Large Models Are Revolutionizing Multimodal Content Safety

An award‑winning joint project by Shanghai Jiao Tong University and Ant Group unveils a multimodal foundation model and advanced detection techniques that dramatically improve AI‑driven content risk governance across massive online services.

AIAnt GroupContent Safety

0 likes · 3 min read

How AI Large Models Are Revolutionizing Multimodal Content Safety

Baobao Algorithm Notes

May 12, 2025 · Artificial Intelligence

Why Dropout Is Dropped in Large‑Scale Model Training: Effects, Efficiency, Stability

Training massive AI models now commonly omits dropout because its original scaling trick fails to match training and inference distributions, leading to poorer performance, higher computational cost, and instability, while alternative regularization like normalization remains useful, as illustrated by practical observations and historical tricks.

AI stabilityDropoutRegularization

0 likes · 6 min read

Why Dropout Is Dropped in Large‑Scale Model Training: Effects, Efficiency, Stability

JD Retail Technology

May 7, 2025 · Artificial Intelligence

Solving Technical Challenges with Large AI Models at JD Retail: Reward Modeling, Query Expansion, and Model Pruning

JD Retail’s engineering team tackles hard AI problems by replacing a monolithic reward model with specialized small models for ad‑image generation, deploying an LLM‑driven query‑expansion pipeline that lifts conversion rates, and pruning text‑to‑image transformers using FFT and RDP to boost throughput 40% without loss, while building comprehensive evaluation tools and a semantic smart‑assistant.

AIModel PruningQuery Expansion

0 likes · 14 min read

Solving Technical Challenges with Large AI Models at JD Retail: Reward Modeling, Query Expansion, and Model Pruning

DevOps

Apr 27, 2025 · Artificial Intelligence

Large Model Technologies: RAG, AI Agents, Multimodal Applications, and Future Trends

This article examines how Retrieval‑Augmented Generation (RAG), AI agents, and multimodal large‑model techniques are reshaping AI‑industry integration, discusses their technical challenges and practical implementations, and outlines future development directions across algorithms, products, and domain‑specific applications.

AI agentsMultimodalRAG

0 likes · 14 min read

Large Model Technologies: RAG, AI Agents, Multimodal Applications, and Future Trends

Meituan Technology Team

Apr 24, 2025 · Artificial Intelligence

Meituan AI Recruitment: Join Our Advanced Technology Teams

Meituan's AI recruitment page showcases diverse opportunities across AI infrastructure, intelligent interaction, visual intelligence, and intelligent products, featuring roles from algorithm engineers to product managers working on cutting-edge technologies including large models, intelligent agents, and multimodal systems.

AI recruitmentIntelligent agentsMultimodal AI

0 likes · 5 min read

Meituan AI Recruitment: Join Our Advanced Technology Teams

Tencent Cloud Developer

Apr 24, 2025 · Industry Insights

How RAG, AI Agents, and Multimodal Models Are Reshaping Industry – Trends, Challenges, and Real‑World Cases

The article analyzes the rapid evolution of large‑model technologies—Retrieval‑Augmented Generation, autonomous agents, and multimodal AI—detailing their technical foundations, practical challenges, industry applications such as unified multimodal tasks, open‑world detection, and video moderation, and forecasting future development directions.

AI agentsIndustry TrendsMultimodal AI

0 likes · 15 min read

How RAG, AI Agents, and Multimodal Models Are Reshaping Industry – Trends, Challenges, and Real‑World Cases

Architects' Tech Alliance

Apr 15, 2025 · Industry Insights

Why DeepSeek’s Private Deployment Is Fueling the AI Model Appliance Market

The article analyzes DeepSeek’s private‑deployment solutions, detailing selection criteria, deployment forms, service models, hardware‑software cost breakdown, technical innovations that lower model and compute barriers, and their impact on government and enterprise AI adoption.

AIDeepSeekHardware Requirements

0 likes · 11 min read

Why DeepSeek’s Private Deployment Is Fueling the AI Model Appliance Market

Beijing SF i-TECH City Technology Team

Apr 8, 2025 · Artificial Intelligence

Automatic Algorithm Design for Operations Optimization Using Large Language Models and Evolutionary Techniques

This document outlines how large language models can be combined with evolutionary algorithms such as genetic algorithms to automatically generate, evaluate, and iteratively improve operations‑optimization code for logistics, resource allocation, and staffing scenarios, reducing development cycles, enhancing adaptability, and achieving higher solution quality.

ai-optimizationautomated-code-generationgenetic algorithm

0 likes · 21 min read

Automatic Algorithm Design for Operations Optimization Using Large Language Models and Evolutionary Techniques

Architects' Tech Alliance

Apr 3, 2025 · Artificial Intelligence

Why NVLink and NVSwitch Are Essential for Training Massive AI Models

Training today's massive AI foundation models demands extensive GPU resources and sophisticated multi‑GPU communication, making technologies like NVLink and NVSwitch crucial for efficient distributed training, while data‑parallel and model‑parallel strategies together optimize performance across large‑scale hardware clusters.

AIGPUNVLink

0 likes · 8 min read

Why NVLink and NVSwitch Are Essential for Training Massive AI Models

Baidu Tech Salon

Apr 2, 2025 · Artificial Intelligence

PaddlePaddle Framework 3.0 Released: Five Core Innovations for Large Models and Scientific Computing

PaddlePaddle 3.0, launched on April 1 2025, introduces five core innovations—including dynamic‑static unified automatic parallelism, a training‑inference integrated PIR, high‑order automatic differentiation for scientific computing, a one‑stage CINN compiler, and heterogeneous multi‑chip adaptation—that dramatically reduce distributed‑training code, boost performance up to four‑fold, and extend the framework to aerospace, automotive, meteorology and life‑science applications while remaining fully compatible with the 2.0 API.

PaddlePaddleScientific Computingautomatic parallelism

0 likes · 21 min read

PaddlePaddle Framework 3.0 Released: Five Core Innovations for Large Models and Scientific Computing

DataFunTalk

Apr 2, 2025 · Artificial Intelligence

Trends, Applications, and Future Directions of Large Models and Inference Acceleration

This article examines the current state and future prospects of large AI models and inference acceleration, covering technology trends, diverse application scenarios from research to industry, and the challenges and opportunities that lie ahead for intelligent data governance, multimodal agents, and AGI.

AGIAIData Governance

0 likes · 11 min read

Trends, Applications, and Future Directions of Large Models and Inference Acceleration

Volcano Engine Developer Services

Apr 1, 2025 · Artificial Intelligence

Taming High Cardinality in AI Model & Autonomous Driving Monitoring with Prometheus

This article explores how high cardinality in Prometheus metrics impacts AI large‑model and autonomous‑driving observability, explains the underlying concepts, outlines the performance and cost challenges, and presents practical design, collection, and query‑side solutions—including metric modeling, pre‑aggregation, and remote‑read pushdown—to keep monitoring efficient and scalable.

AI monitoringCardinalityObservability

0 likes · 12 min read

Taming High Cardinality in AI Model & Autonomous Driving Monitoring with Prometheus

AntTech

Apr 1, 2025 · Artificial Intelligence

AReaL‑boba: Open‑Source Reinforcement Learning Training Framework v0.2 with SOTA Performance

The Ant Research Institute and Tsinghua University's Wu Yi team released AReaL‑boba 0.2, an open‑source reinforcement‑learning training framework that dramatically speeds up large‑scale model training, achieves state‑of‑the‑art mathematical reasoning results, and provides all code, data, and scripts for reproducible research.

AITraining Frameworklarge models

0 likes · 5 min read

AReaL‑boba: Open‑Source Reinforcement Learning Training Framework v0.2 with SOTA Performance

Qborfy AI

Mar 25, 2025 · Artificial Intelligence

How to Start Learning AI: A Structured Roadmap for Beginners

This guide explains why programmers should embrace AI, outlines a four‑stage learning roadmap covering model fundamentals, practical development skills, advanced project work, and continuous community engagement, and lists mainstream large models, frameworks, and API platforms to get started.

AI learningAPIPython

0 likes · 7 min read

How to Start Learning AI: A Structured Roadmap for Beginners

Baidu Geek Talk

Mar 17, 2025 · Industry Insights

From Manual Restarts to Automated Fault Tolerance: The Evolution of AI Training Stability

This article traces the decade‑long evolution of AI training stability—from early small‑model manual operations to large‑scale, multi‑thousand‑GPU clusters—detailing metrics like invalid training time, fault‑tolerance architectures, eBPF‑based hidden‑fault detection, BCCL enhancements, multi‑level restart strategies, and trigger‑based checkpointing that together shrink downtime from minutes to seconds.

AI trainingdistributed systemseBPF

0 likes · 22 min read

From Manual Restarts to Automated Fault Tolerance: The Evolution of AI Training Stability

Nightwalker Tech

Mar 15, 2025 · Artificial Intelligence

Guide to Accessing International AI Large Models via Aggregation Tools, APIs, and Python Code

This article introduces major international and domestic AI large models, recommends desktop aggregation tools and APIs such as POE, Monica, and OpenRouter, and provides complete Python code examples for synchronous and streaming text and multimodal conversations, along with additional API and compute‑rental resources.

AIAPIModel integration

0 likes · 11 min read

Guide to Accessing International AI Large Models via Aggregation Tools, APIs, and Python Code

DevOps

Mar 13, 2025 · Artificial Intelligence

Large Model Commercialization Reshapes Cloud AI Competition: Capital Spending, Strategic Paths, and Multi‑Model Ecosystems

The article analyzes how the commercialization of large AI models is redefining cloud providers' competitive dynamics, highlighting Amazon Bedrock's DeepSeek‑R1 launch, IDC forecasts on model usage, major vendors' capital expenditures, and the shift toward flexible, cost‑effective multi‑model ecosystems for enterprise AI.

AICloud ComputingEnterprise AI

0 likes · 14 min read

Large Model Commercialization Reshapes Cloud AI Competition: Capital Spending, Strategic Paths, and Multi‑Model Ecosystems

JD Tech

Mar 12, 2025 · Artificial Intelligence

From Low‑Resource Large Model Training to Dynamic Margin Selection: A JD Engineer’s Journey

The article recounts a JD retail engineer’s rapid growth through tackling low‑resource large‑model training, developing a margin‑based dynamic data selection method (DynaMS) that earned an ICLR paper, and sharing practical insights on aligning business needs with cutting‑edge AI research.

AI researchData EfficiencyICLR

0 likes · 11 min read

From Low‑Resource Large Model Training to Dynamic Margin Selection: A JD Engineer’s Journey

DaTaobao Tech

Mar 12, 2025 · Artificial Intelligence

Multimodal Automatic Layout Generation for E-commerce

The project develops a multimodal automatic layout generation system for e‑commerce by fine‑tuning the qwen‑vl‑7b vision‑language model with LoRA on poster and Taobao image‑layout data, employing diffusion‑based image generation and coordinate‑prediction methods to produce structured layouts that power poster, marketing image, and video‑cover creation with over 90% adoption, while exploring multi‑image, style‑aware, and iterative refinement extensions.

LLMMultimodal AIdiffusion

0 likes · 12 min read

Multimodal Automatic Layout Generation for E-commerce

AI Frontier Lectures

Mar 10, 2025 · Industry Insights

Why DeepSeek’s Rise Is Shaking China’s AGI Landscape

The article analyzes how DeepSeek’s unexpected success has triggered a strategic rethink across Chinese AI firms, prompting shifts from product‑centric growth to foundational model research, reshaping talent structures at Tencent and ByteDance, and questioning where the true barriers to AGI lie.

AGIChina AIDeepSeek

0 likes · 13 min read

Why DeepSeek’s Rise Is Shaking China’s AGI Landscape

Cognitive Technology Team

Mar 9, 2025 · Artificial Intelligence

AGI Learning Framework and Practical AI Application Guide

This article outlines a systematic AGI learning framework across five capability levels, recommends key papers and books, and provides practical steps for engineers to combine study with hands‑on large‑model projects, identify suitable use‑cases, and stay competitive in the evolving AI landscape.

AGIAI ApplicationsLearning Framework

0 likes · 7 min read

AGI Learning Framework and Practical AI Application Guide

AntData

Mar 7, 2025 · Artificial Intelligence

Design and Implementation of a Cloud‑Native AI Storage Acceleration System (PCache) for Large‑Scale Model Training

This article examines the challenges of AI storage for massive models, describes Ant Group's multi‑cloud, high‑availability PCache architecture, and details its GPU‑mixed deployment, metadata services, data‑link optimizations, and performance results that enable petabyte‑scale training with low cost and high stability.

AI storageMulti-CloudPCache

0 likes · 19 min read

Design and Implementation of a Cloud‑Native AI Storage Acceleration System (PCache) for Large‑Scale Model Training

DataFunSummit

Mar 3, 2025 · Artificial Intelligence

DeepSeek Open Source Week: Seven Core Technologies Reshaping Large‑Model Training

The DeepSeek open‑source week introduced seven breakthrough technologies—FlashMLA, DeepGEMM, DeepEP, DualPipe, EPLB, 3FS, and Smallpond—that together overhaul data flow, algorithmic complexity, hardware utilization, MoE communication, and resource balancing, dramatically improving large‑model training efficiency and lowering entry barriers for the AI industry.

AI hardwareDeepSeekdata pipelines

0 likes · 17 min read

DeepSeek Open Source Week: Seven Core Technologies Reshaping Large‑Model Training

JD Tech Talk

Mar 3, 2025 · Artificial Intelligence

AI Engine Technology Based on Domestic Chips for JD Retail

This article describes JD Retail's AI engine built on domestic NPU chips, covering challenges, heterogeneous GPU‑NPU scheduling, high‑performance training and inference engines, extensive model support, real‑world deployment cases, and future plans for large‑scale chip clusters and ecosystem development.

AIGPUNPU

0 likes · 20 min read

AI Engine Technology Based on Domestic Chips for JD Retail

DaTaobao Tech

Mar 3, 2025 · Artificial Intelligence

How Taobao’s “Faxiang” AI Model Revolutionizes E‑Commerce Video Generation

Taobao’s AIGC video generation platform, built on a large‑scale “Faxiang” model that evolved from UNet to DiT, leverages over 2 billion curated e‑commerce videos, expert alignment, Lora fine‑tuning, and multi‑control capabilities to deliver diverse, high‑quality product videos that dramatically boost conversion metrics across the marketplace.

AI video generationAIGCMultimodal

0 likes · 11 min read

How Taobao’s “Faxiang” AI Model Revolutionizes E‑Commerce Video Generation

Data Thinking Notes

Mar 2, 2025 · Artificial Intelligence

How DeepSeek’s Open‑Source Week Accelerates AI with Cutting‑Edge GPU and Storage Innovations

During DeepSeek’s Open‑Source Week (Feb 24‑28), five production‑tested projects were released, spanning GPU‑optimized MLA kernels, MoE communication libraries, high‑performance FP8 GEMM, dual‑pipeline parallelism, and a AI‑focused distributed file system, each delivering significant performance and efficiency gains for large‑scale AI workloads.

AIGPU Optimizationdistributed training

0 likes · 13 min read

How DeepSeek’s Open‑Source Week Accelerates AI with Cutting‑Edge GPU and Storage Innovations

ZhongAn Tech Team

Feb 22, 2025 · Artificial Intelligence

How SkyReels, DeepSeek NSA, Grok‑3, and KG²RAG Are Shaping the Next AI Wave

This issue reviews China's first open‑source short‑film model SkyReels‑V1, DeepSeek's Native Sparse Attention breakthrough, xAI's massive Grok‑3 deployment on 200k H100 GPUs, and a knowledge‑graph‑guided RAG framework, highlighting their performance gains, architectural innovations, and industry impact.

AIIndustry TrendsKnowledge Graph

0 likes · 15 min read

How SkyReels, DeepSeek NSA, Grok‑3, and KG²RAG Are Shaping the Next AI Wave

Software Engineering 3.0 Era

Feb 21, 2025 · Artificial Intelligence

How NSA and MoE Are Shaping the Future of Large‑Model Development

The article examines Native Sparse Attention (NSA) and Mixture‑of‑Experts (MoE) as complementary innovations that improve data quality, model architecture, and inference efficiency for large models, while also discussing their challenges and potential research directions.

Mixture of ExpertsModel OptimizationNative Sparse Attention

0 likes · 11 min read

How NSA and MoE Are Shaping the Future of Large‑Model Development

DataFunTalk

Feb 19, 2025 · Artificial Intelligence

Large Models: Concepts, Principles, Classifications and Applications

This report provides a comprehensive overview of large-scale AI models, explaining their definition, massive parameter and data requirements, underlying transformer architecture, classification into language, vision and multimodal models, notable examples such as DeepSeek, and a survey of popular AIGC tools and practical use cases.

AIGC toolsMultimodal AIdeep learning

0 likes · 9 min read

Large Models: Concepts, Principles, Classifications and Applications

Xiaohongshu Tech REDtech

Feb 17, 2025 · Artificial Intelligence

WorldSense: A New Benchmark for Evaluating Multimodal Large Models in Real‑World Scenarios

WorldSense, a new benchmark of 1,662 real‑world video‑audio clips and 3,172 QA pairs across 26 cognitive tasks, reveals that current multimodal large models achieve only 25%–48% accuracy, highlighting the crucial role of combined visual‑audio input and the difficulty of audio‑ and emotion‑related reasoning.

Multimodal AIbenchmark datasetlarge models

0 likes · 12 min read

WorldSense: A New Benchmark for Evaluating Multimodal Large Models in Real‑World Scenarios

Lao Guo's Learning Space

Feb 14, 2025 · Artificial Intelligence

Key AI Concepts Explained: Definition, Large‑Model Role, and Future Implications

The article defines Artificial Intelligence, explains how large models enable computers to mimic human intelligence for tasks and learning, and presents a personal view that machines may eventually surpass humans and evolve into a silicon‑based intelligent life with autonomous will.

AI Fundamentalsartificial-intelligencefuture of AI

0 likes · 2 min read

Key AI Concepts Explained: Definition, Large‑Model Role, and Future Implications

DevOps

Feb 13, 2025 · Artificial Intelligence

60 Thoughts of DeepSeek Founder Liang Wenfeng on AGI, Large Models, and Innovation

The article presents DeepSeek founder Liang Wenfeng’s 60 reflections on artificial general intelligence, large‑model research, open‑source culture, talent strategy, and the broader AI ecosystem, while also highlighting his vision for democratizing AI and upcoming AI‑coding events in Beijing.

AGIDeepSeekartificial-intelligence

0 likes · 21 min read

60 Thoughts of DeepSeek Founder Liang Wenfeng on AGI, Large Models, and Innovation

Architects' Tech Alliance

Feb 10, 2025 · Industry Insights

What Makes DeepSeek’s New V3 Model Rival GPT‑4o? A Deep Dive into Large‑Scale AI

This article explains what defines a large AI model, compares parameter scales of GPT‑3, GPT‑4 and M6, and analyzes DeepSeek’s recent releases—V3, R1, and Janus‑Pro—highlighting their benchmark performance, reinforcement‑learning techniques, and cost efficiency versus leading proprietary models.

AI benchmarkDeepSeekModel Scaling

0 likes · 5 min read

What Makes DeepSeek’s New V3 Model Rival GPT‑4o? A Deep Dive into Large‑Scale AI

Alibaba Cloud Developer

Feb 10, 2025 · Artificial Intelligence

Understanding the AI Wave: A Deep Dive into Large Models and Their Impact

This article offers a comprehensive overview of large models, covering their historical evolution, technical foundations, the current "hundred‑model" competition, practical use cases across industries, and future challenges such as safety, controllability, and efficient deployment.

Scaling Lawlarge modelsretrieval‑augmented generation

0 likes · 33 min read

Understanding the AI Wave: A Deep Dive into Large Models and Their Impact

Architect's Alchemy Furnace

Feb 8, 2025 · Artificial Intelligence

How to Choose the Right Hardware for AI Models from 1.5B to 671B

This guide outlines the hardware requirements for AI models ranging from lightweight 1.5 B parameters to massive 671 B models, detailing CPU cores, memory, GPU recommendations, storage needs, optimization tips, deployment suggestions, and suitable application scenarios.

AI hardwareDeepSeekGPU Optimization

0 likes · 5 min read

How to Choose the Right Hardware for AI Models from 1.5B to 671B

AIWalker

Feb 6, 2025 · Artificial Intelligence

FluxSR: The First 12B‑Parameter Single‑Step Diffusion Model for Real‑World Super‑Resolution

FluxSR introduces a novel single‑step diffusion approach for real‑world image super‑resolution built on the 12‑billion‑parameter FLUX.1‑dev model, employing Flow‑Trajectory Distillation, TV‑LPIPS and attention‑diversity losses to achieve high fidelity, reduced artifacts, and lower memory and compute costs.

Flow Distillationdiffusionimage restoration

0 likes · 16 min read

FluxSR: The First 12B‑Parameter Single‑Step Diffusion Model for Real‑World Super‑Resolution

JD Tech Talk

Jan 26, 2025 · Operations

Evolution of Operations and the Application of Large Models in Modern IT Ops

This article reviews the transformation of IT operations from manual processes to automation, AIOps, and ChatOps, and examines how large language models enhance intelligent assistance, automated diagnosis, and log analysis to improve efficiency, reliability, and rapid incident resolution.

AIOpsAutomationChatOps

0 likes · 7 min read

Evolution of Operations and the Application of Large Models in Modern IT Ops

AI Code to Success

Jan 23, 2025 · Industry Insights

Core Tech vs Application Optimization: Where’s the Real Battleground in the AI Large‑Model Race?

The article analyzes the 2025 AI large‑model landscape, contrasting slowing foundational breakthroughs with fierce application competition, highlighting MiniMax’s low‑cost linear‑attention models, multimodal advances, and the strategic shift from price wars to sustainable, technology‑driven growth.

AIIndustry AnalysisMultimodal

0 likes · 7 min read

Core Tech vs Application Optimization: Where’s the Real Battleground in the AI Large‑Model Race?

Huawei Cloud Developer Alliance

Jan 22, 2025 · Artificial Intelligence

How Huawei’s AI Large‑Model Teacher Training Empowered Educators in Henan

From January 16‑18, Huawei Cloud hosted a three‑day AI large‑model teacher training at Henan Information Statistics Vocational College, gathering over 40 educators from 18 schools; the program covered model fundamentals, prompt engineering, and industry‑education integration, boosting teachers’ AI expertise and fostering future AI talent development.

AIartificial-intelligenceeducation

0 likes · 5 min read

How Huawei’s AI Large‑Model Teacher Training Empowered Educators in Henan

Baidu Geek Talk

Jan 20, 2025 · Industry Insights

How Baidu’s Qianfan AppBuilder Is Redefining AI‑Native App Development

The interview explores how Baidu Cloud's Qianfan AppBuilder platform evolves from traditional coding to AI‑native low‑code development, detailing the impact of large‑model agents, Retrieval‑Augmented Generation, security, multimodal support, and future roadmap on enterprise productivity and digital transformation.

AI agentsAI native appsEnterprise AI

0 likes · 18 min read

How Baidu’s Qianfan AppBuilder Is Redefining AI‑Native App Development

DataFunSummit

Jan 19, 2025 · Artificial Intelligence

Understanding MLOps and LMOps: Evolution, Engineering Practices, and Future Trends for Large Models

This article reviews the development of MLOps, introduces the emerging LMOps framework for large‑model engineering, outlines key architectural components, discusses current challenges and industry trends, and presents future directions and standardization efforts in AI operations.

AI EngineeringAI OpsLMOps

0 likes · 18 min read

Understanding MLOps and LMOps: Evolution, Engineering Practices, and Future Trends for Large Models

Baidu Tech Salon

Jan 10, 2025 · Industry Insights

How Baidu’s PaddlePaddle Fuels AI Ecosystem Growth in Wuhan – Key Takeaways from the Wenxin China Tour

The Wenxin China Tour’s Wuhan stop showcased the launch of Baidu’s PaddlePaddle AI Empowerment Center, presented industry‑wide AI ecosystem data, highlighted regional collaborations with universities and enterprises, and featured workshops and award ceremonies that illustrate the rapid adoption of large‑model technologies in Wuhan’s emerging AI market.

AIAI EducationPaddlePaddle

0 likes · 10 min read

How Baidu’s PaddlePaddle Fuels AI Ecosystem Growth in Wuhan – Key Takeaways from the Wenxin China Tour

Baidu Geek Talk

Jan 8, 2025 · Artificial Intelligence

Evolution of Video Search Ranking Architecture Towards an End‑to‑End Large‑Model Framework

The article outlines how video search ranking has shifted from a tightly‑coupled multi‑stage cascade to an extensible, end‑to‑end, model‑centric framework called Rankflow, leveraging large‑model inference, decoupled recall, fine‑grained parallelism, and elastic compute allocation to boost performance, flexibility, and maintainability while paving the way for future retrieval‑augmented generation integration.

AIelastic resourceslarge models

0 likes · 11 min read

Evolution of Video Search Ranking Architecture Towards an End‑to‑End Large‑Model Framework

Architects' Tech Alliance

Jan 2, 2025 · Industry Insights

Why AI Compute Demand Is Poised to Surge 500× Over the Next Decade

The article outlines how AI compute requirements have exploded since the early large‑model era, predicts a 500‑fold increase in the next ten years, presents market size and financing data, and notes a bundled collection of 44 technical PDFs for architects.

AIIndustry TrendsMarket Analysis

0 likes · 4 min read

Why AI Compute Demand Is Poised to Surge 500× Over the Next Decade

Data Thinking Notes

Jan 2, 2025 · Artificial Intelligence

How AI Large Models Are Revolutionizing China’s Banking and State Enterprises

This article examines the rapid rise of AI large‑model technology across China’s financial sector and state‑owned enterprises, highlighting over 200 models deployed by 2023, detailed banking use‑cases, a growing portfolio of central‑enterprise projects worth millions, and the future shift from internal efficiency gains to outward customer‑facing innovation.

AIState‑owned enterprisesbanking

0 likes · 14 min read

How AI Large Models Are Revolutionizing China’s Banking and State Enterprises

AI Product Manager Community

Dec 31, 2024 · Industry Insights

How Microsoft’s 2024 AI Strategy Redefines Enterprise Productivity

The article analyzes Microsoft’s 2024 AI initiatives—including a revamped business architecture, expanded Copilot capabilities, new Phi‑3.5 models, and ecosystem partnerships—to show how the company aims to boost enterprise efficiency and shape the future of AI‑driven productivity.

AI StrategyCopilotEnterprise Productivity

0 likes · 8 min read

How Microsoft’s 2024 AI Strategy Redefines Enterprise Productivity

Architects' Tech Alliance

Dec 25, 2024 · Artificial Intelligence

Performance Analysis of NVIDIA H20 and L20 AI Inference Chips

This article evaluates NVIDIA's China‑specific H20 and L20 inference chips, comparing their compute and memory‑bandwidth characteristics against A100, H100 and H200, and shows how they achieve superior throughput in large‑model inference despite reduced specifications.

AIGPUH20

0 likes · 6 min read

Performance Analysis of NVIDIA H20 and L20 AI Inference Chips

ZhongAn Tech Team

Dec 22, 2024 · Industry Insights

What’s Driving the AI Boom? New Models, Data Limits, and the Rise of Forgetting

This issue reviews the latest AI breakthroughs—including OpenAI’s O3 and o1 models, pricing cuts, new features in ChatGPT, product launches like Pika 2.0 and Gemini 2.0, a heated debate on pre‑training data bottlenecks sparked by Ilya Sutskever, a novel black‑box forgetting method, and DeepMind’s Genie 2 3D world generator—highlighting how industry dynamics and research directions are reshaping the field.

3D generationAIIndustry Trends

0 likes · 12 min read

What’s Driving the AI Boom? New Models, Data Limits, and the Rise of Forgetting

DataFunSummit

Nov 26, 2024 · Information Security

AI‑Driven Security Operations (AISECOPS): Architecture, Practices, and Evaluation

This article explains how large‑model AI can be integrated into security operations (AISECOPS) to simplify application integration, improve fault detection, and automate protection across complex north‑south and east‑west network layers, while addressing challenges such as data quality, cost control, model selection, and safety frameworks.

AISECOPSEmbeddingSecurity Operations

0 likes · 22 min read

AI‑Driven Security Operations (AISECOPS): Architecture, Practices, and Evaluation

Tencent Tech

Nov 19, 2024 · Artificial Intelligence

How Tencent’s Angel Platform Secured the 2024 World Internet Conference Leading Technology Award

Tencent’s Angel machine learning platform, recognized for breakthroughs in trillion‑scale model training, inference, and deployment, won the 2024 World Internet Conference Leading Technology Award, highlighting its self‑developed hardware‑software stack, high‑performance networking, and extensive real‑world AI applications.

AI platformAngelTechnology Award

0 likes · 6 min read

How Tencent’s Angel Platform Secured the 2024 World Internet Conference Leading Technology Award

DataFunTalk

Nov 17, 2024 · Artificial Intelligence

Federated Learning and Data Security in the Era of Large Models: Research Overview and the FLAIR Platform

This presentation reviews recent research on data security and utilization in the large‑model era, covering privacy‑preserving federated learning, knowledge‑transfer techniques, prototype‑based modeling, multi‑model fusion methods such as FuseGen, and introduces the federated knowledge computing platform FLAIR for both horizontal and vertical federated scenarios.

Data SecurityFLAIRKnowledge Transfer

0 likes · 19 min read

Federated Learning and Data Security in the Era of Large Models: Research Overview and the FLAIR Platform

360 Tech Engineering

Nov 15, 2024 · Artificial Intelligence

Advances in Multimodal Large Models and Document Understanding Presented at the 2024 Global Machine Learning Conference (Beijing)

At the 2024 Global Machine Learning Conference in Beijing, 360 AI Research Institute showcased cutting‑edge multimodal large‑model research, fine‑grained open‑world object detection, and document understanding technologies, highlighting open‑source releases, real‑world deployments, and competitive achievements in AI competitions.

AI researchKnowledge GraphMultimodal AI

0 likes · 7 min read

Advances in Multimodal Large Models and Document Understanding Presented at the 2024 Global Machine Learning Conference (Beijing)

Alimama Tech

Nov 13, 2024 · Artificial Intelligence

DeepString: Alibaba's Anti‑Fraud Platform Using Large Models for Real‑Time Traffic Detection

Alibaba's anti-fraud platform DeepString uses large unsupervised models to detect abnormal traffic in real time across multiple advertising products, combining a foundation model for event mining, anomaly measurement, and an alignment model for online filtering, reducing reliance on manual labeling and domain expertise.

Risk Managementalgorithm frameworkanti-fraud

0 likes · 19 min read

DeepString: Alibaba's Anti‑Fraud Platform Using Large Models for Real‑Time Traffic Detection

Baidu Tech Salon

Nov 13, 2024 · Industry Insights

Baidu’s iRAG and “Miaoda”: Solving AI Hallucinations and Powering the No‑Code Revolution

At Baidu World 2024, CEO Robin Li unveiled the iRAG retrieval‑augmented image generation model that dramatically reduces hallucinations and introduced the no‑code platform “Miaoda,” showcasing intelligent agents as the next mainstream AI application while highlighting explosive growth in daily model usage.

AIIndustry TrendsIntelligent agents

0 likes · 11 min read

Baidu’s iRAG and “Miaoda”: Solving AI Hallucinations and Powering the No‑Code Revolution

Architects' Tech Alliance

Nov 10, 2024 · Industry Insights

AI Compute Infrastructure: Trends, Scaling Laws, and the Rise of Massive Clusters

The article analyzes the development of AI compute infrastructure, detailing the three‑level architecture from chip to cluster, the scaling law linking model parameters to compute demand, the rapid growth of massive “ten‑thousand‑card” clusters worldwide, and the emerging demand for inference workloads driving new deployment and scheduling strategies.

AI computeIndustry TrendsInference Demand

0 likes · 15 min read

AI Compute Infrastructure: Trends, Scaling Laws, and the Rise of Massive Clusters

JD Tech Talk

Nov 8, 2024 · Artificial Intelligence

Exploring UI Design‑to‑Code Automation: Practices from Meituan, Xianyu, Microsoft and Large‑Model Flutter Generation

This article surveys recent advances in automatically converting UI design drafts into code, reviewing solutions from Meituan, Xianyu, Microsoft, a range of design‑to‑code tools, Flutter‑specific generators, JD's Ling platform, and practical experiments with large language models for Flutter code generation.

Design AutomationFlutterUI2Code

0 likes · 7 min read

Exploring UI Design‑to‑Code Automation: Practices from Meituan, Xianyu, Microsoft and Large‑Model Flutter Generation

Baidu Tech Salon

Nov 6, 2024 · Industry Insights

How Large AI Models Are Powering the Next Industrial Revolution

At the 7th China International Import Expo forum, Baidu's AI chief explained how large models, with their superior performance, strong generalization, and standardized development processes, are driving a new wave of industrial transformation across sectors such as transportation, finance, and scientific research.

AI ApplicationsBaiduartificial-intelligence

0 likes · 5 min read

How Large AI Models Are Powering the Next Industrial Revolution

DataFunSummit

Nov 2, 2024 · Artificial Intelligence

AI Large Models in Finance: Applications, Case Studies, and Future Challenges

This article explores how AI large models are transforming the financial sector through intelligent advisory, automated strategy generation, risk prediction, asset allocation, and other applications, presenting detailed implementations, real-world case studies, and discussing future opportunities and challenges such as data privacy, model transparency, and regulatory compliance.

AIfinanceinvestment

0 likes · 16 min read

AI Large Models in Finance: Applications, Case Studies, and Future Challenges

DataFunSummit

Oct 29, 2024 · Artificial Intelligence

Technical Maturity Curve of User Profiling and Tag Systems in the Large‑Model Era

This article explains the concept of a technology maturity curve, why it should be evaluated, and how user profiling and tag systems evolve under the influence of large‑model AI, detailing seven key assessment dimensions and a comprehensive architecture that guides enterprises in strategic decision‑making.

AITechnology Maturitylarge models

0 likes · 21 min read

Technical Maturity Curve of User Profiling and Tag Systems in the Large‑Model Era

AsiaInfo Technology: New Tech Exploration

Oct 23, 2024 · Artificial Intelligence

How to Optimize Distributed Training for Massive AI Models: Strategies & Performance Insights

This article examines the challenges of scaling large AI models across multiple GPUs, explores data, pipeline, and tensor parallelism, analyzes collective communication patterns and data‑channel technologies such as PCIe, NVLink and RDMA, and offers concrete optimization recommendations to boost training efficiency.

GPU communicationcollective communicationdistributed training

0 likes · 21 min read

How to Optimize Distributed Training for Massive AI Models: Strategies & Performance Insights

JD Retail Technology

Oct 15, 2024 · Artificial Intelligence

Large‑Model‑Driven Evolution of E‑commerce Search and Recommendation at JD Retail

The article examines how large language models are reshaping JD Retail's e‑commerce search and recommendation pipelines, detailing industry evolution, technical challenges such as knowledge hallucination, intent understanding, personalization, cost, and safety, and presenting JD's end‑to‑end AIGC architecture, data preprocessing, alignment, evaluation, and next‑generation AI search solutions.

AIKnowledge GraphMultimodal

0 likes · 36 min read

Large‑Model‑Driven Evolution of E‑commerce Search and Recommendation at JD Retail

Alibaba Cloud Big Data AI Platform

Oct 12, 2024 · Operations

How GitOps Powers AI‑Driven Large‑Scale Cloud‑Native Operations

The article summarizes Alibaba Cloud's 2024 conference talks on AI‑enhanced observability, presenting a cloud‑native GitOps solution for massive clusters and showcasing large‑model applications in intelligent Q&A and diagnosis to improve operational stability, cost, and efficiency.

AIOpsCloud NativeGitOps

0 likes · 6 min read

How GitOps Powers AI‑Driven Large‑Scale Cloud‑Native Operations

DataFunSummit

Oct 10, 2024 · Artificial Intelligence

AIGC‑Assisted Marketing Material Generation at Shujia Technology

This article describes Shujia Technology's use of artificial intelligence to generate marketing images and videos, outlining the background, challenges of high-volume content production, detailed solutions for image and video assets—including layout models, diffusion models, and digital human synthesis—and future research directions.

AIGCMarketingdigital human

0 likes · 12 min read

AIGC‑Assisted Marketing Material Generation at Shujia Technology

Baidu Geek Talk

Oct 9, 2024 · Artificial Intelligence

How Baidu’s Baige 4.0 Architecture Redefines AI Compute Efficiency

This article analyzes Baidu's Baige 4.0 AI infrastructure, detailing its four‑layer architecture, XMAN 5.0 hardware, HPN network, BCCL communication library, and AIAK inference upgrades, and explains how these innovations address large‑model training and inference challenges while boosting performance, utilization, and cost efficiency.

AI InfrastructureGPU AccelerationHigh-performance computing

0 likes · 16 min read

How Baidu’s Baige 4.0 Architecture Redefines AI Compute Efficiency

Java Tech Enthusiast

Sep 30, 2024 · Artificial Intelligence

The AI Smile Curve: Profit Distribution and Future Outlook

The AI industry’s profit landscape mirrors a smile curve, with upstream GPU manufacturers and downstream application developers capturing most returns while costly large‑model R&D yields low margins, prompting predictions of GPU valuation corrections, a push for consumer‑facing killer apps, and massive application turnover through creative destruction.

AIGPUIndustry Analysis

0 likes · 11 min read

The AI Smile Curve: Profit Distribution and Future Outlook

Alibaba Cloud Big Data AI Platform

Sep 26, 2024 · Artificial Intelligence

How Alibaba Cloud’s PAI Tackles Large‑Model Training and Inference Challenges in 2024

At the 2024 Yunqi Conference, Alibaba Cloud’s AI Infra experts detailed the latest challenges of large‑model deployment—such as hardware costs, resource management, and software‑hardware coordination—and introduced PAI’s new capabilities, including stability tools, automated distributed training, reinforcement‑learning frameworks, inference optimizations, and integrated big‑data AI solutions.

AI InfraInference Optimizationbig data integration

0 likes · 14 min read

How Alibaba Cloud’s PAI Tackles Large‑Model Training and Inference Challenges in 2024

Data Thinking Notes

Sep 24, 2024 · Artificial Intelligence

Leveraging Large Models to Transform Data Governance: Quality, Cost, Efficiency

This article explains how large language models enhance data governance by improving data quality, reducing implementation costs, and increasing operational efficiency through knowledge bases and interactive prompt libraries, and it also outlines practical empowerment pathways for organizations seeking to leverage AI-driven analytics.

AIData GovernanceEfficiency

0 likes · 3 min read

Leveraging Large Models to Transform Data Governance: Quality, Cost, Efficiency

JD Tech Talk

Sep 23, 2024 · Artificial Intelligence

JD Advertising R&D: AI‑Driven Solutions for Traffic Valuation, Multimodal Understanding, Auction Mechanisms, Generative Recommendation, and Large‑Model Engineering

The JD Advertising R&D team applies cutting‑edge AI techniques—including query intent models, multimodal representation pipelines, reinforcement‑learning‑based auction mechanisms, generative recommendation with quantized product tokens, and large‑model infrastructure—to boost traffic valuation, ad relevance, revenue, and creative generation across the platform.

AIAdvertisingGraph Neural Networks

0 likes · 19 min read

JD Advertising R&D: AI‑Driven Solutions for Traffic Valuation, Multimodal Understanding, Auction Mechanisms, Generative Recommendation, and Large‑Model Engineering

JD Cloud Developers

Sep 23, 2024 · Artificial Intelligence

How JD’s Advertising Lab Leverages Large‑Scale AI to Transform E‑Commerce Ads

JD's advertising research team combines deep learning, multimodal modeling, reinforcement‑learning auctions, and generative recommendation to boost ad relevance, improve long‑tail product exposure, and overcome large‑model inference challenges in a high‑traffic e‑commerce environment.

Graph Neural NetworkMultimodaladvertising AI

0 likes · 22 min read

How JD’s Advertising Lab Leverages Large‑Scale AI to Transform E‑Commerce Ads

360 Zhihui Cloud Developer

Sep 19, 2024 · Operations

How TAI Platform Optimizes Large‑Model Scheduling and Fault Recovery on Kubernetes

This article explains how the TAI platform leverages Kubernetes and Volcano to tackle fault, efficiency, and usability challenges in large‑model training and inference, detailing custom resources, automated fault detection, and advanced scheduling strategies that boost resource utilization and performance.

SchedulingVolcanofault-recovery

0 likes · 9 min read

How TAI Platform Optimizes Large‑Model Scheduling and Fault Recovery on Kubernetes

Baobao Algorithm Notes

Sep 18, 2024 · Artificial Intelligence

Why Training on 1,000 GPUs Is Harder Than You Think—and How to Tame It

Training deep learning models on a thousand GPUs faces steep communication overhead, higher failure probability, and scaling inefficiencies, but by profiling each step, overlapping compute and communication, using gradient bucketing and accumulation, and employing elastic training techniques, practitioners can approach near‑linear performance while mitigating common pitfalls.

GPU scalingPerformance OptimizationPyTorch

0 likes · 13 min read

Why Training on 1,000 GPUs Is Harder Than You Think—and How to Tame It

AntTech

Sep 16, 2024 · Artificial Intelligence

Opportunities and Challenges in the Era of Large Models: Technology Integration and Industry Leap

In his keynote at the 2024 Inclusion·Bund Conference, HKUST Board Chair Shen Xiangyang discusses how large‑model AI reshapes human‑computer interaction, introduces the concept of Intelligent Augmentation, emphasizes responsible AI governance, and outlines the practical steps needed to deploy AI agents in industry.

AIAI GovernanceHuman-Computer Interaction

0 likes · 4 min read

Opportunities and Challenges in the Era of Large Models: Technology Integration and Industry Leap

AntTech

Sep 14, 2024 · Artificial Intelligence

WDTA Releases First International Standard for Large‑Model Supply‑Chain Security

At the 2024 Inclusion·Bund Conference, the World Digital Technology Academy (WDTA) unveiled the first international standard for large‑model supply‑chain security, a collaborative effort by CSA Greater China, Ant Group, Microsoft, Google, Meta, PrivateAI and others, marking a significant step in global AI governance and trust.

AI GovernanceInternational Standardslarge models

0 likes · 7 min read

WDTA Releases First International Standard for Large‑Model Supply‑Chain Security

Efficient Ops

Sep 11, 2024 · Artificial Intelligence

How AI Large Models Can Automate DevOps Pipeline Failure Analysis

This article explores how AI large‑model technology can be integrated into DevOps pipelines to automatically detect, classify, and resolve interruption events, dramatically reducing manual troubleshooting time and improving overall software development and operations efficiency.

AIdevopsintelligent analysis

0 likes · 11 min read

How AI Large Models Can Automate DevOps Pipeline Failure Analysis

AntTech

Sep 11, 2024 · Artificial Intelligence

2024 Inclusion·Bund Conference Forum: Exploring the Creative Boundaries and Application Imagination of Large Models

The 2024 Inclusion·Bund Conference hosted a forum on "Large Model Creativity Boundaries and Application Imagination," featuring leading AI experts who discussed agents, multimodal technology, knowledge graphs, announced a new industry alliance, unveiled three major model products, and presented a trustworthy AI framework report for finance, healthcare, and government sectors.

AIKnowledge GraphTrustworthy AI

0 likes · 6 min read

2024 Inclusion·Bund Conference Forum: Exploring the Creative Boundaries and Application Imagination of Large Models

AntTech

Sep 6, 2024 · Artificial Intelligence

Large Model Industry Trustworthy Application Framework Research Report

Ant Group and the China Academy of Information and Communications Technology released a research report outlining a trustworthy application framework for large models in rigorous sectors such as finance and healthcare, detailing technical safeguards, industry case studies, and guidance for scalable, secure AI deployment.

AI DeploymentAI GovernanceHealthcare AI

0 likes · 3 min read

Large Model Industry Trustworthy Application Framework Research Report

Baobao Algorithm Notes

Aug 29, 2024 · Industry Insights

Why Pretraining Boosts New Engineers More Than SFT: A Practical Guide

The answer argues that fresh graduates should join pre‑training teams because the required engineering tasks—large‑scale data crawling, Hadoop/Spark pipelines, torch and CUDA setup, megatron code debugging, and scaling‑law experiments—rapidly sharpen coding skills, while SFT work focuses mainly on data labeling and offers slower technical growth.

AI EngineeringCareer AdviceSFT

0 likes · 7 min read

Why Pretraining Boosts New Engineers More Than SFT: A Practical Guide

Baidu Geek Talk

Aug 28, 2024 · Artificial Intelligence

How PaddlePaddle 3.0 Simplifies Large‑Model Distributed Training with Automatic Parallelism

This article explains the challenges of scaling large AI models, introduces PaddlePaddle 3.0's four‑dimensional hybrid parallelism and its unified automatic parallel framework, details core concepts such as ProcessMesh and Placements, provides step‑by‑step code examples, and outlines performance‑optimizing strategies like operator fusion and pipeline scheduling.

Hybrid ParallelPaddlePaddlePerformance Optimization

0 likes · 17 min read

How PaddlePaddle 3.0 Simplifies Large‑Model Distributed Training with Automatic Parallelism

DataFunSummit

Aug 25, 2024 · Artificial Intelligence

Applying Large AI Models to Financial Data Governance and Innovative Use Cases

This article presents a comprehensive technical overview of how large AI models are reshaping financial data production, governance, multimodal document understanding, lakehouse storage, private‑domain model deployment, data‑centric engineering methods, and multi‑agent intelligent advisory within the finance sector.

AIMultimodalRAG

0 likes · 21 min read

Applying Large AI Models to Financial Data Governance and Innovative Use Cases

Data Thinking Notes

Aug 20, 2024 · Artificial Intelligence

How Large AI Models Transform Data Governance: Strategies and Challenges

This article explores how the rise of massive AI models reshapes data governance, detailing model fundamentals, architectural types, emerging challenges, a five‑domain governance framework, and practical AI‑driven applications for data standards, metadata, quality, and security, while also looking ahead to future trends.

AIData GovernanceData Quality

0 likes · 14 min read

How Large AI Models Transform Data Governance: Strategies and Challenges

JD Retail Technology

Aug 16, 2024 · Artificial Intelligence

Interview with JD Retail AI Director Zhai Zhouwei on the Evolution and Future of E‑commerce Search Powered by Large Models

In this interview, JD Retail’s AI director Zhai Zhouwei outlines the four historical stages of e‑commerce search, explains how large‑model AI is reshaping user interaction, retrieval and content generation, discusses practical challenges and solutions, and shares his vision and advice for enterprises adopting these technologies.

AIJD.comNLP

0 likes · 9 min read

Interview with JD Retail AI Director Zhai Zhouwei on the Evolution and Future of E‑commerce Search Powered by Large Models

DataFunTalk

Aug 11, 2024 · Artificial Intelligence

AI‑Driven Security Operations (AISECOPS): Architecture, Practices, and Evaluation

This article presents a comprehensive overview of AI‑enabled security operations, detailing the industry pain points, the AISECOPS workflow, model selection between OpenAI embeddings and ST5, classification methods, performance and cost evaluations, and future directions for integrating agents and secure AI pipelines.

AICost EvaluationOps Automation

0 likes · 22 min read

DaTaobao Tech

Aug 7, 2024 · Artificial Intelligence

Overview of Large Model Development, AIGC Practices, and Prompt Engineering

The article surveys the rapid emergence of large AI models and AIGC, explains core concepts like AI, AGI, and LLMs, details prompt‑engineering techniques such as chain‑of‑thought, outlines a seven‑layer AIGC stack, discusses technical and ethical challenges, and highlights future multimodal and industry‑specific applications.

AIAIGCLLM

0 likes · 25 min read

Overview of Large Model Development, AIGC Practices, and Prompt Engineering

Open Source Linux

Aug 6, 2024 · Artificial Intelligence

What Is AI? A Beginner’s Guide to Definitions, Types, and Real‑World Impact

This article explains what artificial intelligence (AI) is, how it differs from traditional programming, outlines its main categories, introduces machine learning, deep learning, neural network models such as CNN, RNN, and Transformer, describes large models and GPT, and discusses AI’s wide‑range applications and societal implications.

AIAI ApplicationsGPT

0 likes · 16 min read

What Is AI? A Beginner’s Guide to Definitions, Types, and Real‑World Impact

DataFunTalk

Aug 2, 2024 · Artificial Intelligence

From Big Data to Large Models: Alibaba Cloud AI Platform Architecture and Practices for Search Recommendation

This presentation details Alibaba Cloud's AI platform, covering the end‑to‑end pipeline from big‑data processing and feature engineering to large‑model training, inference optimization, recommendation system architecture, and RAG applications, highlighting practical engineering solutions and performance gains.

AI platformBig DataFeature Store

0 likes · 18 min read

From Big Data to Large Models: Alibaba Cloud AI Platform Architecture and Practices for Search Recommendation

Kuaishou Tech

Jul 31, 2024 · Artificial Intelligence

Kuaishou Showcases AI‑Driven Multimedia Innovations at China Multimedia 2024

At the China Multimedia 2024 conference in Yinchuan, Kuaishou presented its latest AI‑driven large‑model technologies—including text‑to‑image, text‑to‑video, and audio models—alongside advances in intelligent video coding, a new research‑fund initiative, and recent industry awards.

AIKuaishouVideo Coding

0 likes · 5 min read

Kuaishou Showcases AI‑Driven Multimedia Innovations at China Multimedia 2024

Model Perspective

Jul 30, 2024 · Artificial Intelligence

Your Complete AI Learning Roadmap: From Basics to Large Model Mastery

This guide presents a comprehensive AI learning roadmap, dividing study into five progressive stages—from foundational math and programming to core deep‑learning and reinforcement‑learning techniques, large‑model training, industry applications, and future trends—plus curated book lists, tool recommendations, and practical RAG tutorials.

AI learning roadmapAI resourcesRAG

0 likes · 9 min read

Your Complete AI Learning Roadmap: From Basics to Large Model Mastery

JD Tech Talk

Jul 23, 2024 · Artificial Intelligence

Intelligent Parcel Identification Using Large Language Models in JD Express Logistics

This article examines how JD Express applies large‑language‑model‑based natural language processing to accurately recognize and classify shipped items, addressing low matching rates, improving packaging recommendations, reducing damage and claims, and outlining architecture, model selection criteria, caching strategies, and future operational benefits.

AIJD ExpressNLP

0 likes · 21 min read

Intelligent Parcel Identification Using Large Language Models in JD Express Logistics

DataFunSummit

Jul 21, 2024 · Information Security

Ping An's Data Security Compliance Management Practices and Large‑Model Applications

This article presents Ping An's comprehensive approach to data security compliance, detailing its evolving data management framework, the integration of large‑model AI for classification, risk monitoring, and assessment, and practical insights from a Q&A session on governance and operational challenges.

AIData SecurityRisk Management

0 likes · 14 min read

Ping An's Data Security Compliance Management Practices and Large‑Model Applications

Architects' Tech Alliance

Jul 15, 2024 · Artificial Intelligence

Why Model-as-a-Service (MaaS) Is Shaping the Future of AI Deployment

This article examines the Model-as-a-Service (MaaS) paradigm, tracing its origins, defining its expanded capabilities for large‑model ecosystems, outlining the full‑stack services it offers, and analyzing current industry adoption, deployment models, and the technical and regulatory challenges that must be addressed for scalable AI rollout.

AI DeploymentAI InfrastructureIndustry Analysis

0 likes · 11 min read

Why Model-as-a-Service (MaaS) Is Shaping the Future of AI Deployment