Tagged articles
269 articles
Page 2 of 3
AntTech
AntTech
May 12, 2025 · Industry Insights

How AI Large Models Are Revolutionizing Multimodal Content Safety

An award‑winning joint project by Shanghai Jiao Tong University and Ant Group unveils a multimodal foundation model and advanced detection techniques that dramatically improve AI‑driven content risk governance across massive online services.

AIAnt GroupContent Safety
0 likes · 3 min read
How AI Large Models Are Revolutionizing Multimodal Content Safety
Baobao Algorithm Notes
Baobao Algorithm Notes
May 12, 2025 · Artificial Intelligence

Why Dropout Is Dropped in Large‑Scale Model Training: Effects, Efficiency, Stability

Training massive AI models now commonly omits dropout because its original scaling trick fails to match training and inference distributions, leading to poorer performance, higher computational cost, and instability, while alternative regularization like normalization remains useful, as illustrated by practical observations and historical tricks.

AI stabilityDropoutRegularization
0 likes · 6 min read
Why Dropout Is Dropped in Large‑Scale Model Training: Effects, Efficiency, Stability
JD Retail Technology
JD Retail Technology
May 7, 2025 · Artificial Intelligence

Solving Technical Challenges with Large AI Models at JD Retail: Reward Modeling, Query Expansion, and Model Pruning

JD Retail’s engineering team tackles hard AI problems by replacing a monolithic reward model with specialized small models for ad‑image generation, deploying an LLM‑driven query‑expansion pipeline that lifts conversion rates, and pruning text‑to‑image transformers using FFT and RDP to boost throughput 40% without loss, while building comprehensive evaluation tools and a semantic smart‑assistant.

AIModel PruningReward Modeling
0 likes · 14 min read
Solving Technical Challenges with Large AI Models at JD Retail: Reward Modeling, Query Expansion, and Model Pruning
DevOps
DevOps
Apr 27, 2025 · Artificial Intelligence

Large Model Technologies: RAG, AI Agents, Multimodal Applications, and Future Trends

This article examines how Retrieval‑Augmented Generation (RAG), AI agents, and multimodal large‑model techniques are reshaping AI‑industry integration, discusses their technical challenges and practical implementations, and outlines future development directions across algorithms, products, and domain‑specific applications.

AI agentsRAGRetrieval-Augmented Generation
0 likes · 14 min read
Large Model Technologies: RAG, AI Agents, Multimodal Applications, and Future Trends
Meituan Technology Team
Meituan Technology Team
Apr 24, 2025 · Artificial Intelligence

Meituan AI Recruitment: Join Our Advanced Technology Teams

Meituan's AI recruitment page showcases diverse opportunities across AI infrastructure, intelligent interaction, visual intelligence, and intelligent products, featuring roles from algorithm engineers to product managers working on cutting-edge technologies including large models, intelligent agents, and multimodal systems.

AI RecruitmentComputer VisionIntelligent agents
0 likes · 5 min read
Meituan AI Recruitment: Join Our Advanced Technology Teams
Tencent Cloud Developer
Tencent Cloud Developer
Apr 24, 2025 · Industry Insights

How RAG, AI Agents, and Multimodal Models Are Reshaping Industry – Trends, Challenges, and Real‑World Cases

The article analyzes the rapid evolution of large‑model technologies—Retrieval‑Augmented Generation, autonomous agents, and multimodal AI—detailing their technical foundations, practical challenges, industry applications such as unified multimodal tasks, open‑world detection, and video moderation, and forecasting future development directions.

AI agentsMultimodal AIRAG
0 likes · 15 min read
How RAG, AI Agents, and Multimodal Models Are Reshaping Industry – Trends, Challenges, and Real‑World Cases
Beijing SF i-TECH City Technology Team
Beijing SF i-TECH City Technology Team
Apr 8, 2025 · Artificial Intelligence

Automatic Algorithm Design for Operations Optimization Using Large Language Models and Evolutionary Techniques

This document outlines how large language models can be combined with evolutionary algorithms such as genetic algorithms to automatically generate, evaluate, and iteratively improve operations‑optimization code for logistics, resource allocation, and staffing scenarios, reducing development cycles, enhancing adaptability, and achieving higher solution quality.

AI Optimizationautomated-code-generationgenetic algorithm
0 likes · 21 min read
Automatic Algorithm Design for Operations Optimization Using Large Language Models and Evolutionary Techniques
Architects' Tech Alliance
Architects' Tech Alliance
Apr 3, 2025 · Artificial Intelligence

Why NVLink and NVSwitch Are Essential for Training Massive AI Models

Training today's massive AI foundation models demands extensive GPU resources and sophisticated multi‑GPU communication, making technologies like NVLink and NVSwitch crucial for efficient distributed training, while data‑parallel and model‑parallel strategies together optimize performance across large‑scale hardware clusters.

AIDistributed TrainingGPU
0 likes · 8 min read
Why NVLink and NVSwitch Are Essential for Training Massive AI Models
Baidu Tech Salon
Baidu Tech Salon
Apr 2, 2025 · Artificial Intelligence

PaddlePaddle Framework 3.0 Released: Five Core Innovations for Large Models and Scientific Computing

PaddlePaddle 3.0, launched on April 1 2025, introduces five core innovations—including dynamic‑static unified automatic parallelism, a training‑inference integrated PIR, high‑order automatic differentiation for scientific computing, a one‑stage CINN compiler, and heterogeneous multi‑chip adaptation—that dramatically reduce distributed‑training code, boost performance up to four‑fold, and extend the framework to aerospace, automotive, meteorology and life‑science applications while remaining fully compatible with the 2.0 API.

Deep LearningPaddlePaddleautomatic parallelism
0 likes · 21 min read
PaddlePaddle Framework 3.0 Released: Five Core Innovations for Large Models and Scientific Computing
Volcano Engine Developer Services
Volcano Engine Developer Services
Apr 1, 2025 · Artificial Intelligence

Taming High Cardinality in AI Model & Autonomous Driving Monitoring with Prometheus

This article explores how high cardinality in Prometheus metrics impacts AI large‑model and autonomous‑driving observability, explains the underlying concepts, outlines the performance and cost challenges, and presents practical design, collection, and query‑side solutions—including metric modeling, pre‑aggregation, and remote‑read pushdown—to keep monitoring efficient and scalable.

AI MonitoringCardinalityObservability
0 likes · 12 min read
Taming High Cardinality in AI Model & Autonomous Driving Monitoring with Prometheus
AntTech
AntTech
Apr 1, 2025 · Artificial Intelligence

AReaL‑boba: Open‑Source Reinforcement Learning Training Framework v0.2 with SOTA Performance

The Ant Research Institute and Tsinghua University's Wu Yi team released AReaL‑boba 0.2, an open‑source reinforcement‑learning training framework that dramatically speeds up large‑scale model training, achieves state‑of‑the‑art mathematical reasoning results, and provides all code, data, and scripts for reproducible research.

AITraining Frameworklarge models
0 likes · 5 min read
AReaL‑boba: Open‑Source Reinforcement Learning Training Framework v0.2 with SOTA Performance
Qborfy AI
Qborfy AI
Mar 25, 2025 · Artificial Intelligence

How to Start Learning AI: A Structured Roadmap for Beginners

This guide explains why programmers should embrace AI, outlines a four‑stage learning roadmap covering model fundamentals, practical development skills, advanced project work, and continuous community engagement, and lists mainstream large models, frameworks, and API platforms to get started.

AI learningAPIPython
0 likes · 7 min read
How to Start Learning AI: A Structured Roadmap for Beginners
Baidu Geek Talk
Baidu Geek Talk
Mar 17, 2025 · Industry Insights

From Manual Restarts to Automated Fault Tolerance: The Evolution of AI Training Stability

This article traces the decade‑long evolution of AI training stability—from early small‑model manual operations to large‑scale, multi‑thousand‑GPU clusters—detailing metrics like invalid training time, fault‑tolerance architectures, eBPF‑based hidden‑fault detection, BCCL enhancements, multi‑level restart strategies, and trigger‑based checkpointing that together shrink downtime from minutes to seconds.

AI trainingDistributed SystemsInfrastructure
0 likes · 22 min read
From Manual Restarts to Automated Fault Tolerance: The Evolution of AI Training Stability
Nightwalker Tech
Nightwalker Tech
Mar 15, 2025 · Artificial Intelligence

Guide to Accessing International AI Large Models via Aggregation Tools, APIs, and Python Code

This article introduces major international and domestic AI large models, recommends desktop aggregation tools and APIs such as POE, Monica, and OpenRouter, and provides complete Python code examples for synchronous and streaming text and multimodal conversations, along with additional API and compute‑rental resources.

AIAPIOpenRouter
0 likes · 11 min read
Guide to Accessing International AI Large Models via Aggregation Tools, APIs, and Python Code
DevOps
DevOps
Mar 13, 2025 · Artificial Intelligence

Large Model Commercialization Reshapes Cloud AI Competition: Capital Spending, Strategic Paths, and Multi‑Model Ecosystems

The article analyzes how the commercialization of large AI models is redefining cloud providers' competitive dynamics, highlighting Amazon Bedrock's DeepSeek‑R1 launch, IDC forecasts on model usage, major vendors' capital expenditures, and the shift toward flexible, cost‑effective multi‑model ecosystems for enterprise AI.

AIEnterprise AIMulti-Model Ecosystem
0 likes · 14 min read
Large Model Commercialization Reshapes Cloud AI Competition: Capital Spending, Strategic Paths, and Multi‑Model Ecosystems
DaTaobao Tech
DaTaobao Tech
Mar 12, 2025 · Artificial Intelligence

Multimodal Automatic Layout Generation for E-commerce

The project develops a multimodal automatic layout generation system for e‑commerce by fine‑tuning the qwen‑vl‑7b vision‑language model with LoRA on poster and Taobao image‑layout data, employing diffusion‑based image generation and coordinate‑prediction methods to produce structured layouts that power poster, marketing image, and video‑cover creation with over 90% adoption, while exploring multi‑image, style‑aware, and iterative refinement extensions.

LLMMultimodal AIdiffusion
0 likes · 12 min read
Multimodal Automatic Layout Generation for E-commerce
AI Frontier Lectures
AI Frontier Lectures
Mar 10, 2025 · Industry Insights

Why DeepSeek’s Rise Is Shaking China’s AGI Landscape

The article analyzes how DeepSeek’s unexpected success has triggered a strategic rethink across Chinese AI firms, prompting shifts from product‑centric growth to foundational model research, reshaping talent structures at Tencent and ByteDance, and questioning where the true barriers to AGI lie.

AGIChina AIDeepSeek
0 likes · 13 min read
Why DeepSeek’s Rise Is Shaking China’s AGI Landscape
Cognitive Technology Team
Cognitive Technology Team
Mar 9, 2025 · Artificial Intelligence

AGI Learning Framework and Practical AI Application Guide

This article outlines a systematic AGI learning framework across five capability levels, recommends key papers and books, and provides practical steps for engineers to combine study with hands‑on large‑model projects, identify suitable use‑cases, and stay competitive in the evolving AI landscape.

AGIAI applicationsEngineering
0 likes · 7 min read
AGI Learning Framework and Practical AI Application Guide
AntData
AntData
Mar 7, 2025 · Artificial Intelligence

Design and Implementation of a Cloud‑Native AI Storage Acceleration System (PCache) for Large‑Scale Model Training

This article examines the challenges of AI storage for massive models, describes Ant Group's multi‑cloud, high‑availability PCache architecture, and details its GPU‑mixed deployment, metadata services, data‑link optimizations, and performance results that enable petabyte‑scale training with low cost and high stability.

AI storagePCachePerformance Optimization
0 likes · 19 min read
Design and Implementation of a Cloud‑Native AI Storage Acceleration System (PCache) for Large‑Scale Model Training
DataFunSummit
DataFunSummit
Mar 3, 2025 · Artificial Intelligence

DeepSeek Open Source Week: Seven Core Technologies Reshaping Large‑Model Training

The DeepSeek open‑source week introduced seven breakthrough technologies—FlashMLA, DeepGEMM, DeepEP, DualPipe, EPLB, 3FS, and Smallpond—that together overhaul data flow, algorithmic complexity, hardware utilization, MoE communication, and resource balancing, dramatically improving large‑model training efficiency and lowering entry barriers for the AI industry.

AI hardwareDeepSeekdata pipelines
0 likes · 17 min read
DeepSeek Open Source Week: Seven Core Technologies Reshaping Large‑Model Training
JD Tech Talk
JD Tech Talk
Mar 3, 2025 · Artificial Intelligence

AI Engine Technology Based on Domestic Chips for JD Retail

This article describes JD Retail's AI engine built on domestic NPU chips, covering challenges, heterogeneous GPU‑NPU scheduling, high‑performance training and inference engines, extensive model support, real‑world deployment cases, and future plans for large‑scale chip clusters and ecosystem development.

AIDistributed TrainingGPU
0 likes · 20 min read
AI Engine Technology Based on Domestic Chips for JD Retail
DaTaobao Tech
DaTaobao Tech
Mar 3, 2025 · Artificial Intelligence

How Taobao’s “Faxiang” AI Model Revolutionizes E‑Commerce Video Generation

Taobao’s AIGC video generation platform, built on a large‑scale “Faxiang” model that evolved from UNet to DiT, leverages over 2 billion curated e‑commerce videos, expert alignment, Lora fine‑tuning, and multi‑control capabilities to deliver diverse, high‑quality product videos that dramatically boost conversion metrics across the marketplace.

AI video generationAIGCe‑commerce
0 likes · 11 min read
How Taobao’s “Faxiang” AI Model Revolutionizes E‑Commerce Video Generation
Data Thinking Notes
Data Thinking Notes
Mar 2, 2025 · Artificial Intelligence

How DeepSeek’s Open‑Source Week Accelerates AI with Cutting‑Edge GPU and Storage Innovations

During DeepSeek’s Open‑Source Week (Feb 24‑28), five production‑tested projects were released, spanning GPU‑optimized MLA kernels, MoE communication libraries, high‑performance FP8 GEMM, dual‑pipeline parallelism, and a AI‑focused distributed file system, each delivering significant performance and efficiency gains for large‑scale AI workloads.

AIDistributed TrainingGPU Optimization
0 likes · 13 min read
How DeepSeek’s Open‑Source Week Accelerates AI with Cutting‑Edge GPU and Storage Innovations
ZhongAn Tech Team
ZhongAn Tech Team
Feb 22, 2025 · Artificial Intelligence

How SkyReels, DeepSeek NSA, Grok‑3, and KG²RAG Are Shaping the Next AI Wave

This issue reviews China's first open‑source short‑film model SkyReels‑V1, DeepSeek's Native Sparse Attention breakthrough, xAI's massive Grok‑3 deployment on 200k H100 GPUs, and a knowledge‑graph‑guided RAG framework, highlighting their performance gains, architectural innovations, and industry impact.

AIKnowledge GraphRAG
0 likes · 15 min read
How SkyReels, DeepSeek NSA, Grok‑3, and KG²RAG Are Shaping the Next AI Wave
DataFunTalk
DataFunTalk
Feb 19, 2025 · Artificial Intelligence

Large Models: Concepts, Principles, Classifications and Applications

This report provides a comprehensive overview of large-scale AI models, explaining their definition, massive parameter and data requirements, underlying transformer architecture, classification into language, vision and multimodal models, notable examples such as DeepSeek, and a survey of popular AIGC tools and practical use cases.

AIGC toolsDeep LearningMultimodal AI
0 likes · 9 min read
Large Models: Concepts, Principles, Classifications and Applications
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Feb 17, 2025 · Artificial Intelligence

WorldSense: A New Benchmark for Evaluating Multimodal Large Models in Real‑World Scenarios

WorldSense, a new benchmark of 1,662 real‑world video‑audio clips and 3,172 QA pairs across 26 cognitive tasks, reveals that current multimodal large models achieve only 25%–48% accuracy, highlighting the crucial role of combined visual‑audio input and the difficulty of audio‑ and emotion‑related reasoning.

Multimodal AIbenchmark datasetlarge models
0 likes · 12 min read
WorldSense: A New Benchmark for Evaluating Multimodal Large Models in Real‑World Scenarios
Lao Guo's Learning Space
Lao Guo's Learning Space
Feb 14, 2025 · Artificial Intelligence

Key AI Concepts Explained: Definition, Large‑Model Role, and Future Implications

The article defines Artificial Intelligence, explains how large models enable computers to mimic human intelligence for tasks and learning, and presents a personal view that machines may eventually surpass humans and evolve into a silicon‑based intelligent life with autonomous will.

AI fundamentalsartificial intelligencefuture of AI
0 likes · 2 min read
Key AI Concepts Explained: Definition, Large‑Model Role, and Future Implications
DevOps
DevOps
Feb 13, 2025 · Artificial Intelligence

60 Thoughts of DeepSeek Founder Liang Wenfeng on AGI, Large Models, and Innovation

The article presents DeepSeek founder Liang Wenfeng’s 60 reflections on artificial general intelligence, large‑model research, open‑source culture, talent strategy, and the broader AI ecosystem, while also highlighting his vision for democratizing AI and upcoming AI‑coding events in Beijing.

AGIDeepSeekInnovation
0 likes · 21 min read
60 Thoughts of DeepSeek Founder Liang Wenfeng on AGI, Large Models, and Innovation
Architects' Tech Alliance
Architects' Tech Alliance
Feb 10, 2025 · Industry Insights

What Makes DeepSeek’s New V3 Model Rival GPT‑4o? A Deep Dive into Large‑Scale AI

This article explains what defines a large AI model, compares parameter scales of GPT‑3, GPT‑4 and M6, and analyzes DeepSeek’s recent releases—V3, R1, and Janus‑Pro—highlighting their benchmark performance, reinforcement‑learning techniques, and cost efficiency versus leading proprietary models.

AI BenchmarkDeepSeekModel Scaling
0 likes · 5 min read
What Makes DeepSeek’s New V3 Model Rival GPT‑4o? A Deep Dive into Large‑Scale AI
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 10, 2025 · Artificial Intelligence

Understanding the AI Wave: A Deep Dive into Large Models and Their Impact

This article offers a comprehensive overview of large models, covering their historical evolution, technical foundations, the current "hundred‑model" competition, practical use cases across industries, and future challenges such as safety, controllability, and efficient deployment.

large modelsretrieval‑augmented generationscaling law
0 likes · 33 min read
Understanding the AI Wave: A Deep Dive into Large Models and Their Impact
AIWalker
AIWalker
Feb 6, 2025 · Artificial Intelligence

FluxSR: The First 12B‑Parameter Single‑Step Diffusion Model for Real‑World Super‑Resolution

FluxSR introduces a novel single‑step diffusion approach for real‑world image super‑resolution built on the 12‑billion‑parameter FLUX.1‑dev model, employing Flow‑Trajectory Distillation, TV‑LPIPS and attention‑diversity losses to achieve high fidelity, reduced artifacts, and lower memory and compute costs.

Flow DistillationImage Restorationdiffusion
0 likes · 16 min read
FluxSR: The First 12B‑Parameter Single‑Step Diffusion Model for Real‑World Super‑Resolution
JD Tech Talk
JD Tech Talk
Jan 26, 2025 · Operations

Evolution of Operations and the Application of Large Models in Modern IT Ops

This article reviews the transformation of IT operations from manual processes to automation, AIOps, and ChatOps, and examines how large language models enhance intelligent assistance, automated diagnosis, and log analysis to improve efficiency, reliability, and rapid incident resolution.

AutomationChatOpsaiops
0 likes · 7 min read
Evolution of Operations and the Application of Large Models in Modern IT Ops
AI Code to Success
AI Code to Success
Jan 23, 2025 · Industry Insights

Core Tech vs Application Optimization: Where’s the Real Battleground in the AI Large‑Model Race?

The article analyzes the 2025 AI large‑model landscape, contrasting slowing foundational breakthroughs with fierce application competition, highlighting MiniMax’s low‑cost linear‑attention models, multimodal advances, and the strategic shift from price wars to sustainable, technology‑driven growth.

AIIndustry analysislarge models
0 likes · 7 min read
Core Tech vs Application Optimization: Where’s the Real Battleground in the AI Large‑Model Race?
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Jan 22, 2025 · Artificial Intelligence

How Huawei’s AI Large‑Model Teacher Training Empowered Educators in Henan

From January 16‑18, Huawei Cloud hosted a three‑day AI large‑model teacher training at Henan Information Statistics Vocational College, gathering over 40 educators from 18 schools; the program covered model fundamentals, prompt engineering, and industry‑education integration, boosting teachers’ AI expertise and fostering future AI talent development.

AITeacher Trainingartificial intelligence
0 likes · 5 min read
How Huawei’s AI Large‑Model Teacher Training Empowered Educators in Henan
Baidu Geek Talk
Baidu Geek Talk
Jan 20, 2025 · Industry Insights

How Baidu’s Qianfan AppBuilder Is Redefining AI‑Native App Development

The interview explores how Baidu Cloud's Qianfan AppBuilder platform evolves from traditional coding to AI‑native low‑code development, detailing the impact of large‑model agents, Retrieval‑Augmented Generation, security, multimodal support, and future roadmap on enterprise productivity and digital transformation.

AI agentsAI native appsEnterprise AI
0 likes · 18 min read
How Baidu’s Qianfan AppBuilder Is Redefining AI‑Native App Development
Baidu Tech Salon
Baidu Tech Salon
Jan 10, 2025 · Industry Insights

How Baidu’s PaddlePaddle Fuels AI Ecosystem Growth in Wuhan – Key Takeaways from the Wenxin China Tour

The Wenxin China Tour’s Wuhan stop showcased the launch of Baidu’s PaddlePaddle AI Empowerment Center, presented industry‑wide AI ecosystem data, highlighted regional collaborations with universities and enterprises, and featured workshops and award ceremonies that illustrate the rapid adoption of large‑model technologies in Wuhan’s emerging AI market.

AIAI educationIndustry Ecosystem
0 likes · 10 min read
How Baidu’s PaddlePaddle Fuels AI Ecosystem Growth in Wuhan – Key Takeaways from the Wenxin China Tour
Baidu Geek Talk
Baidu Geek Talk
Jan 8, 2025 · Artificial Intelligence

Evolution of Video Search Ranking Architecture Towards an End‑to‑End Large‑Model Framework

The article outlines how video search ranking has shifted from a tightly‑coupled multi‑stage cascade to an extensible, end‑to‑end, model‑centric framework called Rankflow, leveraging large‑model inference, decoupled recall, fine‑grained parallelism, and elastic compute allocation to boost performance, flexibility, and maintainability while paving the way for future retrieval‑augmented generation integration.

AIelastic resourceslarge models
0 likes · 11 min read
Evolution of Video Search Ranking Architecture Towards an End‑to‑End Large‑Model Framework
Data Thinking Notes
Data Thinking Notes
Jan 2, 2025 · Artificial Intelligence

How AI Large Models Are Revolutionizing China’s Banking and State Enterprises

This article examines the rapid rise of AI large‑model technology across China’s financial sector and state‑owned enterprises, highlighting over 200 models deployed by 2023, detailed banking use‑cases, a growing portfolio of central‑enterprise projects worth millions, and the future shift from internal efficiency gains to outward customer‑facing innovation.

AIBankingDigital Transformation
0 likes · 14 min read
How AI Large Models Are Revolutionizing China’s Banking and State Enterprises
AI Product Manager Community
AI Product Manager Community
Dec 31, 2024 · Industry Insights

How Microsoft’s 2024 AI Strategy Redefines Enterprise Productivity

The article analyzes Microsoft’s 2024 AI initiatives—including a revamped business architecture, expanded Copilot capabilities, new Phi‑3.5 models, and ecosystem partnerships—to show how the company aims to boost enterprise efficiency and shape the future of AI‑driven productivity.

AI strategyCopilotEnterprise Productivity
0 likes · 8 min read
How Microsoft’s 2024 AI Strategy Redefines Enterprise Productivity
ZhongAn Tech Team
ZhongAn Tech Team
Dec 22, 2024 · Industry Insights

What’s Driving the AI Boom? New Models, Data Limits, and the Rise of Forgetting

This issue reviews the latest AI breakthroughs—including OpenAI’s O3 and o1 models, pricing cuts, new features in ChatGPT, product launches like Pika 2.0 and Gemini 2.0, a heated debate on pre‑training data bottlenecks sparked by Ilya Sutskever, a novel black‑box forgetting method, and DeepMind’s Genie 2 3D world generator—highlighting how industry dynamics and research directions are reshaping the field.

3D generationAIModel Forgetting
0 likes · 12 min read
What’s Driving the AI Boom? New Models, Data Limits, and the Rise of Forgetting
DataFunSummit
DataFunSummit
Nov 26, 2024 · Information Security

AI‑Driven Security Operations (AISECOPS): Architecture, Practices, and Evaluation

This article explains how large‑model AI can be integrated into security operations (AISECOPS) to simplify application integration, improve fault detection, and automate protection across complex north‑south and east‑west network layers, while addressing challenges such as data quality, cost control, model selection, and safety frameworks.

AISECOPSCost OptimizationEmbedding
0 likes · 22 min read
AI‑Driven Security Operations (AISECOPS): Architecture, Practices, and Evaluation
Tencent Tech
Tencent Tech
Nov 19, 2024 · Artificial Intelligence

How Tencent’s Angel Platform Secured the 2024 World Internet Conference Leading Technology Award

Tencent’s Angel machine learning platform, recognized for breakthroughs in trillion‑scale model training, inference, and deployment, won the 2024 World Internet Conference Leading Technology Award, highlighting its self‑developed hardware‑software stack, high‑performance networking, and extensive real‑world AI applications.

AI PlatformAngelDistributed Training
0 likes · 6 min read
How Tencent’s Angel Platform Secured the 2024 World Internet Conference Leading Technology Award
DataFunTalk
DataFunTalk
Nov 17, 2024 · Artificial Intelligence

Federated Learning and Data Security in the Era of Large Models: Research Overview and the FLAIR Platform

This presentation reviews recent research on data security and utilization in the large‑model era, covering privacy‑preserving federated learning, knowledge‑transfer techniques, prototype‑based modeling, multi‑model fusion methods such as FuseGen, and introduces the federated knowledge computing platform FLAIR for both horizontal and vertical federated scenarios.

FLAIRFederated LearningKnowledge Transfer
0 likes · 19 min read
Federated Learning and Data Security in the Era of Large Models: Research Overview and the FLAIR Platform
360 Tech Engineering
360 Tech Engineering
Nov 15, 2024 · Artificial Intelligence

Advances in Multimodal Large Models and Document Understanding Presented at the 2024 Global Machine Learning Conference (Beijing)

At the 2024 Global Machine Learning Conference in Beijing, 360 AI Research Institute showcased cutting‑edge multimodal large‑model research, fine‑grained open‑world object detection, and document understanding technologies, highlighting open‑source releases, real‑world deployments, and competitive achievements in AI competitions.

AI researchKnowledge GraphMultimodal AI
0 likes · 7 min read
Advances in Multimodal Large Models and Document Understanding Presented at the 2024 Global Machine Learning Conference (Beijing)
Alimama Tech
Alimama Tech
Nov 13, 2024 · Artificial Intelligence

DeepString: Alibaba's Anti‑Fraud Platform Using Large Models for Real‑Time Traffic Detection

Alibaba's anti-fraud platform DeepString uses large unsupervised models to detect abnormal traffic in real time across multiple advertising products, combining a foundation model for event mining, anomaly measurement, and an alignment model for online filtering, reducing reliance on manual labeling and domain expertise.

algorithm frameworkanti-fraudlarge models
0 likes · 19 min read
DeepString: Alibaba's Anti‑Fraud Platform Using Large Models for Real‑Time Traffic Detection
Baidu Tech Salon
Baidu Tech Salon
Nov 13, 2024 · Industry Insights

Baidu’s iRAG and “Miaoda”: Solving AI Hallucinations and Powering the No‑Code Revolution

At Baidu World 2024, CEO Robin Li unveiled the iRAG retrieval‑augmented image generation model that dramatically reduces hallucinations and introduced the no‑code platform “Miaoda,” showcasing intelligent agents as the next mainstream AI application while highlighting explosive growth in daily model usage.

AIIntelligent agentsNo-code
0 likes · 11 min read
Baidu’s iRAG and “Miaoda”: Solving AI Hallucinations and Powering the No‑Code Revolution
Architects' Tech Alliance
Architects' Tech Alliance
Nov 10, 2024 · Industry Insights

AI Compute Infrastructure: Trends, Scaling Laws, and the Rise of Massive Clusters

The article analyzes the development of AI compute infrastructure, detailing the three‑level architecture from chip to cluster, the scaling law linking model parameters to compute demand, the rapid growth of massive “ten‑thousand‑card” clusters worldwide, and the emerging demand for inference workloads driving new deployment and scheduling strategies.

AI computeInference DemandInfrastructure
0 likes · 15 min read
AI Compute Infrastructure: Trends, Scaling Laws, and the Rise of Massive Clusters
JD Tech Talk
JD Tech Talk
Nov 8, 2024 · Artificial Intelligence

Exploring UI Design‑to‑Code Automation: Practices from Meituan, Xianyu, Microsoft and Large‑Model Flutter Generation

This article surveys recent advances in automatically converting UI design drafts into code, reviewing solutions from Meituan, Xianyu, Microsoft, a range of design‑to‑code tools, Flutter‑specific generators, JD's Ling platform, and practical experiments with large language models for Flutter code generation.

Design AutomationFlutterTooling
0 likes · 7 min read
Exploring UI Design‑to‑Code Automation: Practices from Meituan, Xianyu, Microsoft and Large‑Model Flutter Generation
Baidu Tech Salon
Baidu Tech Salon
Nov 6, 2024 · Industry Insights

How Large AI Models Are Powering the Next Industrial Revolution

At the 7th China International Import Expo forum, Baidu's AI chief explained how large models, with their superior performance, strong generalization, and standardized development processes, are driving a new wave of industrial transformation across sectors such as transportation, finance, and scientific research.

AI applicationsBaiduartificial intelligence
0 likes · 5 min read
How Large AI Models Are Powering the Next Industrial Revolution
DataFunSummit
DataFunSummit
Nov 2, 2024 · Artificial Intelligence

AI Large Models in Finance: Applications, Case Studies, and Future Challenges

This article explores how AI large models are transforming the financial sector through intelligent advisory, automated strategy generation, risk prediction, asset allocation, and other applications, presenting detailed implementations, real-world case studies, and discussing future opportunities and challenges such as data privacy, model transparency, and regulatory compliance.

AIfinanceinvestment
0 likes · 16 min read
AI Large Models in Finance: Applications, Case Studies, and Future Challenges
DataFunSummit
DataFunSummit
Oct 29, 2024 · Artificial Intelligence

Technical Maturity Curve of User Profiling and Tag Systems in the Large‑Model Era

This article explains the concept of a technology maturity curve, why it should be evaluated, and how user profiling and tag systems evolve under the influence of large‑model AI, detailing seven key assessment dimensions and a comprehensive architecture that guides enterprises in strategic decision‑making.

AITechnology Maturitylarge models
0 likes · 21 min read
Technical Maturity Curve of User Profiling and Tag Systems in the Large‑Model Era
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Oct 23, 2024 · Artificial Intelligence

How to Optimize Distributed Training for Massive AI Models: Strategies & Performance Insights

This article examines the challenges of scaling large AI models across multiple GPUs, explores data, pipeline, and tensor parallelism, analyzes collective communication patterns and data‑channel technologies such as PCIe, NVLink and RDMA, and offers concrete optimization recommendations to boost training efficiency.

Distributed TrainingGPU communicationcollective communication
0 likes · 21 min read
How to Optimize Distributed Training for Massive AI Models: Strategies & Performance Insights
JD Retail Technology
JD Retail Technology
Oct 15, 2024 · Artificial Intelligence

Large‑Model‑Driven Evolution of E‑commerce Search and Recommendation at JD Retail

The article examines how large language models are reshaping JD Retail's e‑commerce search and recommendation pipelines, detailing industry evolution, technical challenges such as knowledge hallucination, intent understanding, personalization, cost, and safety, and presenting JD's end‑to‑end AIGC architecture, data preprocessing, alignment, evaluation, and next‑generation AI search solutions.

AIKnowledge Graphe‑commerce
0 likes · 36 min read
Large‑Model‑Driven Evolution of E‑commerce Search and Recommendation at JD Retail
DataFunSummit
DataFunSummit
Oct 10, 2024 · Artificial Intelligence

AIGC‑Assisted Marketing Material Generation at Shujia Technology

This article describes Shujia Technology's use of artificial intelligence to generate marketing images and videos, outlining the background, challenges of high-volume content production, detailed solutions for image and video assets—including layout models, diffusion models, and digital human synthesis—and future research directions.

AIGCDigital HumanMarketing
0 likes · 12 min read
AIGC‑Assisted Marketing Material Generation at Shujia Technology
Baidu Geek Talk
Baidu Geek Talk
Oct 9, 2024 · Artificial Intelligence

How Baidu’s Baige 4.0 Architecture Redefines AI Compute Efficiency

This article analyzes Baidu's Baige 4.0 AI infrastructure, detailing its four‑layer architecture, XMAN 5.0 hardware, HPN network, BCCL communication library, and AIAK inference upgrades, and explains how these innovations address large‑model training and inference challenges while boosting performance, utilization, and cost efficiency.

AI InfrastructureCluster ManagementGPU Acceleration
0 likes · 16 min read
How Baidu’s Baige 4.0 Architecture Redefines AI Compute Efficiency
Java Tech Enthusiast
Java Tech Enthusiast
Sep 30, 2024 · Artificial Intelligence

The AI Smile Curve: Profit Distribution and Future Outlook

The AI industry’s profit landscape mirrors a smile curve, with upstream GPU manufacturers and downstream application developers capturing most returns while costly large‑model R&D yields low margins, prompting predictions of GPU valuation corrections, a push for consumer‑facing killer apps, and massive application turnover through creative destruction.

AIGPUIndustry analysis
0 likes · 11 min read
The AI Smile Curve: Profit Distribution and Future Outlook
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Sep 26, 2024 · Artificial Intelligence

How Alibaba Cloud’s PAI Tackles Large‑Model Training and Inference Challenges in 2024

At the 2024 Yunqi Conference, Alibaba Cloud’s AI Infra experts detailed the latest challenges of large‑model deployment—such as hardware costs, resource management, and software‑hardware coordination—and introduced PAI’s new capabilities, including stability tools, automated distributed training, reinforcement‑learning frameworks, inference optimizations, and integrated big‑data AI solutions.

AI InfraBig Data IntegrationDistributed Training
0 likes · 14 min read
How Alibaba Cloud’s PAI Tackles Large‑Model Training and Inference Challenges in 2024
Data Thinking Notes
Data Thinking Notes
Sep 24, 2024 · Artificial Intelligence

Leveraging Large Models to Transform Data Governance: Quality, Cost, Efficiency

This article explains how large language models enhance data governance by improving data quality, reducing implementation costs, and increasing operational efficiency through knowledge bases and interactive prompt libraries, and it also outlines practical empowerment pathways for organizations seeking to leverage AI-driven analytics.

AICost reductionData Governance
0 likes · 3 min read
Leveraging Large Models to Transform Data Governance: Quality, Cost, Efficiency
JD Tech Talk
JD Tech Talk
Sep 23, 2024 · Artificial Intelligence

JD Advertising R&D: AI‑Driven Solutions for Traffic Valuation, Multimodal Understanding, Auction Mechanisms, Generative Recommendation, and Large‑Model Engineering

The JD Advertising R&D team applies cutting‑edge AI techniques—including query intent models, multimodal representation pipelines, reinforcement‑learning‑based auction mechanisms, generative recommendation with quantized product tokens, and large‑model infrastructure—to boost traffic valuation, ad relevance, revenue, and creative generation across the platform.

AIAdvertisinggraph neural networks
0 likes · 19 min read
JD Advertising R&D: AI‑Driven Solutions for Traffic Valuation, Multimodal Understanding, Auction Mechanisms, Generative Recommendation, and Large‑Model Engineering
JD Cloud Developers
JD Cloud Developers
Sep 23, 2024 · Artificial Intelligence

How JD’s Advertising Lab Leverages Large‑Scale AI to Transform E‑Commerce Ads

JD's advertising research team combines deep learning, multimodal modeling, reinforcement‑learning auctions, and generative recommendation to boost ad relevance, improve long‑tail product exposure, and overcome large‑model inference challenges in a high‑traffic e‑commerce environment.

Graph Neural Networkadvertising AIe‑commerce
0 likes · 22 min read
How JD’s Advertising Lab Leverages Large‑Scale AI to Transform E‑Commerce Ads
Baobao Algorithm Notes
Baobao Algorithm Notes
Sep 18, 2024 · Artificial Intelligence

Why Training on 1,000 GPUs Is Harder Than You Think—and How to Tame It

Training deep learning models on a thousand GPUs faces steep communication overhead, higher failure probability, and scaling inefficiencies, but by profiling each step, overlapping compute and communication, using gradient bucketing and accumulation, and employing elastic training techniques, practitioners can approach near‑linear performance while mitigating common pitfalls.

Distributed TrainingGPU scalingPerformance Optimization
0 likes · 13 min read
Why Training on 1,000 GPUs Is Harder Than You Think—and How to Tame It
AntTech
AntTech
Sep 16, 2024 · Artificial Intelligence

Opportunities and Challenges in the Era of Large Models: Technology Integration and Industry Leap

In his keynote at the 2024 Inclusion·Bund Conference, HKUST Board Chair Shen Xiangyang discusses how large‑model AI reshapes human‑computer interaction, introduces the concept of Intelligent Augmentation, emphasizes responsible AI governance, and outlines the practical steps needed to deploy AI agents in industry.

AIAI GovernanceHuman-Computer Interaction
0 likes · 4 min read
Opportunities and Challenges in the Era of Large Models: Technology Integration and Industry Leap
AntTech
AntTech
Sep 14, 2024 · Artificial Intelligence

WDTA Releases First International Standard for Large‑Model Supply‑Chain Security

At the 2024 Inclusion·Bund Conference, the World Digital Technology Academy (WDTA) unveiled the first international standard for large‑model supply‑chain security, a collaborative effort by CSA Greater China, Ant Group, Microsoft, Google, Meta, PrivateAI and others, marking a significant step in global AI governance and trust.

AI GovernanceInternational Standardslarge models
0 likes · 7 min read
WDTA Releases First International Standard for Large‑Model Supply‑Chain Security
Efficient Ops
Efficient Ops
Sep 11, 2024 · Artificial Intelligence

How AI Large Models Can Automate DevOps Pipeline Failure Analysis

This article explores how AI large‑model technology can be integrated into DevOps pipelines to automatically detect, classify, and resolve interruption events, dramatically reducing manual troubleshooting time and improving overall software development and operations efficiency.

AIDevOpsPipeline
0 likes · 11 min read
How AI Large Models Can Automate DevOps Pipeline Failure Analysis
AntTech
AntTech
Sep 11, 2024 · Artificial Intelligence

2024 Inclusion·Bund Conference Forum: Exploring the Creative Boundaries and Application Imagination of Large Models

The 2024 Inclusion·Bund Conference hosted a forum on "Large Model Creativity Boundaries and Application Imagination," featuring leading AI experts who discussed agents, multimodal technology, knowledge graphs, announced a new industry alliance, unveiled three major model products, and presented a trustworthy AI framework report for finance, healthcare, and government sectors.

AIFinancial AIKnowledge Graph
0 likes · 6 min read
2024 Inclusion·Bund Conference Forum: Exploring the Creative Boundaries and Application Imagination of Large Models
AntTech
AntTech
Sep 6, 2024 · Artificial Intelligence

Large Model Industry Trustworthy Application Framework Research Report

Ant Group and the China Academy of Information and Communications Technology released a research report outlining a trustworthy application framework for large models in rigorous sectors such as finance and healthcare, detailing technical safeguards, industry case studies, and guidance for scalable, secure AI deployment.

AI GovernanceAI deploymentHealthcare AI
0 likes · 3 min read
Large Model Industry Trustworthy Application Framework Research Report
Baobao Algorithm Notes
Baobao Algorithm Notes
Aug 29, 2024 · Industry Insights

Why Pretraining Boosts New Engineers More Than SFT: A Practical Guide

The answer argues that fresh graduates should join pre‑training teams because the required engineering tasks—large‑scale data crawling, Hadoop/Spark pipelines, torch and CUDA setup, megatron code debugging, and scaling‑law experiments—rapidly sharpen coding skills, while SFT work focuses mainly on data labeling and offers slower technical growth.

AI EngineeringSFTSkill development
0 likes · 7 min read
Why Pretraining Boosts New Engineers More Than SFT: A Practical Guide
Baidu Geek Talk
Baidu Geek Talk
Aug 28, 2024 · Artificial Intelligence

How PaddlePaddle 3.0 Simplifies Large‑Model Distributed Training with Automatic Parallelism

This article explains the challenges of scaling large AI models, introduces PaddlePaddle 3.0's four‑dimensional hybrid parallelism and its unified automatic parallel framework, details core concepts such as ProcessMesh and Placements, provides step‑by‑step code examples, and outlines performance‑optimizing strategies like operator fusion and pipeline scheduling.

Distributed TrainingHybrid ParallelPaddlePaddle
0 likes · 17 min read
How PaddlePaddle 3.0 Simplifies Large‑Model Distributed Training with Automatic Parallelism
DataFunSummit
DataFunSummit
Aug 25, 2024 · Artificial Intelligence

Applying Large AI Models to Financial Data Governance and Innovative Use Cases

This article presents a comprehensive technical overview of how large AI models are reshaping financial data production, governance, multimodal document understanding, lakehouse storage, private‑domain model deployment, data‑centric engineering methods, and multi‑agent intelligent advisory within the finance sector.

AIMulti-AgentRAG
0 likes · 21 min read
Applying Large AI Models to Financial Data Governance and Innovative Use Cases
Data Thinking Notes
Data Thinking Notes
Aug 20, 2024 · Artificial Intelligence

How Large AI Models Transform Data Governance: Strategies and Challenges

This article explores how the rise of massive AI models reshapes data governance, detailing model fundamentals, architectural types, emerging challenges, a five‑domain governance framework, and practical AI‑driven applications for data standards, metadata, quality, and security, while also looking ahead to future trends.

AIData GovernanceData Quality
0 likes · 14 min read
How Large AI Models Transform Data Governance: Strategies and Challenges
JD Retail Technology
JD Retail Technology
Aug 16, 2024 · Artificial Intelligence

Interview with JD Retail AI Director Zhai Zhouwei on the Evolution and Future of E‑commerce Search Powered by Large Models

In this interview, JD Retail’s AI director Zhai Zhouwei outlines the four historical stages of e‑commerce search, explains how large‑model AI is reshaping user interaction, retrieval and content generation, discusses practical challenges and solutions, and shares his vision and advice for enterprises adopting these technologies.

AIJD.comNLP
0 likes · 9 min read
Interview with JD Retail AI Director Zhai Zhouwei on the Evolution and Future of E‑commerce Search Powered by Large Models
DataFunTalk
DataFunTalk
Aug 11, 2024 · Artificial Intelligence

AI‑Driven Security Operations (AISECOPS): Architecture, Practices, and Evaluation

This article presents a comprehensive overview of AI‑enabled security operations, detailing the industry pain points, the AISECOPS workflow, model selection between OpenAI embeddings and ST5, classification methods, performance and cost evaluations, and future directions for integrating agents and secure AI pipelines.

AICost EvaluationOps Automation
0 likes · 22 min read
AI‑Driven Security Operations (AISECOPS): Architecture, Practices, and Evaluation
DaTaobao Tech
DaTaobao Tech
Aug 7, 2024 · Artificial Intelligence

Overview of Large Model Development, AIGC Practices, and Prompt Engineering

The article surveys the rapid emergence of large AI models and AIGC, explains core concepts like AI, AGI, and LLMs, details prompt‑engineering techniques such as chain‑of‑thought, outlines a seven‑layer AIGC stack, discusses technical and ethical challenges, and highlights future multimodal and industry‑specific applications.

AIAIGCLLM
0 likes · 25 min read
Overview of Large Model Development, AIGC Practices, and Prompt Engineering
Open Source Linux
Open Source Linux
Aug 6, 2024 · Artificial Intelligence

What Is AI? A Beginner’s Guide to Definitions, Types, and Real‑World Impact

This article explains what artificial intelligence (AI) is, how it differs from traditional programming, outlines its main categories, introduces machine learning, deep learning, neural network models such as CNN, RNN, and Transformer, describes large models and GPT, and discusses AI’s wide‑range applications and societal implications.

AIAI applicationsDeep Learning
0 likes · 16 min read
What Is AI? A Beginner’s Guide to Definitions, Types, and Real‑World Impact
DataFunTalk
DataFunTalk
Aug 2, 2024 · Artificial Intelligence

From Big Data to Large Models: Alibaba Cloud AI Platform Architecture and Practices for Search Recommendation

This presentation details Alibaba Cloud's AI platform, covering the end‑to‑end pipeline from big‑data processing and feature engineering to large‑model training, inference optimization, recommendation system architecture, and RAG applications, highlighting practical engineering solutions and performance gains.

AI PlatformBig DataFeature Store
0 likes · 18 min read
From Big Data to Large Models: Alibaba Cloud AI Platform Architecture and Practices for Search Recommendation
Kuaishou Tech
Kuaishou Tech
Jul 31, 2024 · Artificial Intelligence

Kuaishou Showcases AI‑Driven Multimedia Innovations at China Multimedia 2024

At the China Multimedia 2024 conference in Yinchuan, Kuaishou presented its latest AI‑driven large‑model technologies—including text‑to‑image, text‑to‑video, and audio models—alongside advances in intelligent video coding, a new research‑fund initiative, and recent industry awards.

AIKuaishouMultimedia
0 likes · 5 min read
Kuaishou Showcases AI‑Driven Multimedia Innovations at China Multimedia 2024
Model Perspective
Model Perspective
Jul 30, 2024 · Artificial Intelligence

Your Complete AI Learning Roadmap: From Basics to Large Model Mastery

This guide presents a comprehensive AI learning roadmap, dividing study into five progressive stages—from foundational math and programming to core deep‑learning and reinforcement‑learning techniques, large‑model training, industry applications, and future trends—plus curated book lists, tool recommendations, and practical RAG tutorials.

AI learning roadmapAI resourcesRAG
0 likes · 9 min read
Your Complete AI Learning Roadmap: From Basics to Large Model Mastery
JD Tech Talk
JD Tech Talk
Jul 23, 2024 · Artificial Intelligence

Intelligent Parcel Identification Using Large Language Models in JD Express Logistics

This article examines how JD Express applies large‑language‑model‑based natural language processing to accurately recognize and classify shipped items, addressing low matching rates, improving packaging recommendations, reducing damage and claims, and outlining architecture, model selection criteria, caching strategies, and future operational benefits.

AIJD ExpressNLP
0 likes · 21 min read
Intelligent Parcel Identification Using Large Language Models in JD Express Logistics
Architects' Tech Alliance
Architects' Tech Alliance
Jul 15, 2024 · Artificial Intelligence

Why Model-as-a-Service (MaaS) Is Shaping the Future of AI Deployment

This article examines the Model-as-a-Service (MaaS) paradigm, tracing its origins, defining its expanded capabilities for large‑model ecosystems, outlining the full‑stack services it offers, and analyzing current industry adoption, deployment models, and the technical and regulatory challenges that must be addressed for scalable AI rollout.

AI InfrastructureAI deploymentCloud AI
0 likes · 11 min read
Why Model-as-a-Service (MaaS) Is Shaping the Future of AI Deployment
Architects' Tech Alliance
Architects' Tech Alliance
Jul 10, 2024 · Industry Insights

Why AI Large Models Are Driving the Next Industrial Revolution

The article analyzes the rapid evolution of AI large models—from their role in advancing AGI through massive pre‑training and fine‑tuning, to current market dynamics led by GPT and domestic Chinese players, and finally to future multimodal applications, content‑factory capabilities, and emerging AIGC revenue models projected to reach trillion‑yuan scales by 2030.

AIAIGCGPT
0 likes · 7 min read
Why AI Large Models Are Driving the Next Industrial Revolution
Ops Development & AI Practice
Ops Development & AI Practice
Jul 8, 2024 · Artificial Intelligence

Essential Denoising Techniques for Training Large AI Models

This article outlines key denoising methods—including data cleaning, augmentation, regularization, adversarial training, and self‑supervised learning—that improve the performance, generalization, and robustness of large neural network and transformer models.

DenoisingRegularizationadversarial training
0 likes · 5 min read
Essential Denoising Techniques for Training Large AI Models
Baobao Algorithm Notes
Baobao Algorithm Notes
Jul 8, 2024 · Industry Insights

Why Large‑Model Deployment Stalls: Robots, Scaling Laws, and Multimodal Frontiers

The article analyzes current challenges in deploying large AI models, covering robot automation, scaling‑law limits, vertical‑domain use cases, multimodal breakthroughs, algorithmic evolution, and the hardware‑software trade‑offs of training and inference infrastructures, while questioning ROI and practical feasibility.

Roboticsalgorithm evolutioninference infrastructure
0 likes · 21 min read
Why Large‑Model Deployment Stalls: Robots, Scaling Laws, and Multimodal Frontiers
AntTech
AntTech
Jul 7, 2024 · Information Security

2024 WAIC Forum on Privacy Computing: Enabling Trusted Data Sharing for Large Models

The 2024 WAIC Privacy Computing Forum gathered leading experts from academia and industry to discuss how encryption, anonymization, and secure multi‑party computation can protect data privacy while enabling large‑model training and inference, highlighting technical challenges, standards, and emerging solutions across AI, big data, and information security domains.

AIMPCPrivacy Computing
0 likes · 15 min read
2024 WAIC Forum on Privacy Computing: Enabling Trusted Data Sharing for Large Models
DataFunSummit
DataFunSummit
Jul 6, 2024 · Artificial Intelligence

Highlights of DataFunCon 2024 Beijing: Big Data, AI, and Large‑Model Trends

The two‑day DataFunCon 2024 Beijing conference gathered hundreds of big‑data and AI experts to discuss the evolution from data lakes to lake‑warehouses, large‑model development, practical applications, and future strategies for enterprises, while showcasing partner exhibitions and a vibrant community spirit.

Big DataChinaData Lake
0 likes · 9 min read
Highlights of DataFunCon 2024 Beijing: Big Data, AI, and Large‑Model Trends
DataFunSummit
DataFunSummit
Jul 5, 2024 · Big Data

Highlights of DataFunCon 2024 Beijing: Big Data, Large Models, and AI Integration

The DataFunCon 2024 Beijing conference opened with keynote speeches on the evolution of Alibaba Cloud's big data platform, explored distributed data warehousing, large model research, and practical AI applications, and concluded with a round‑table discussing future trends and enterprise strategies for big data and AI integration.

Big Dataartificial intelligenceconference
0 likes · 8 min read
Highlights of DataFunCon 2024 Beijing: Big Data, Large Models, and AI Integration
DataFunTalk
DataFunTalk
Jun 29, 2024 · Artificial Intelligence

Document Intelligence in the Financial Sector: Technologies, Challenges, and Future Directions

This presentation reviews the technical scope of document intelligence, its specific applications and challenges in finance, recent advances in document analysis, recognition, and understanding, and outlines future research directions for large‑model and multimodal solutions in processing complex financial documents.

Deep LearningDocument AIlarge models
0 likes · 28 min read
Document Intelligence in the Financial Sector: Technologies, Challenges, and Future Directions
JD Tech
JD Tech
Jun 23, 2024 · Artificial Intelligence

Applying Large Models to Recommendation Systems: Strategies, Challenges, and E‑commerce Case Study

This article examines how large pre‑trained models such as GPT‑4 and BERT are integrated into modern recommendation systems, detailing their advantages, implementation strategies, real‑world e‑commerce case studies, and the technical and privacy challenges that must be addressed for effective deployment.

Online Learningartificial intelligencelarge models
0 likes · 14 min read
Applying Large Models to Recommendation Systems: Strategies, Challenges, and E‑commerce Case Study
DataFunTalk
DataFunTalk
Jun 20, 2024 · Artificial Intelligence

User Profiling Algorithms: From Ontology‑Based Methods to Deep Learning and Large Model Integration

This article provides a comprehensive overview of user profiling algorithms, covering the evolution from ontology‑based traditional methods to modern deep‑learning approaches, including structured label prediction, representation learning, active learning, and large‑model integration, while discussing challenges, practical applications, and future research directions.

Deep LearningOntologyactive learning
0 likes · 26 min read
User Profiling Algorithms: From Ontology‑Based Methods to Deep Learning and Large Model Integration
DataFunTalk
DataFunTalk
Jun 15, 2024 · Artificial Intelligence

DataFunSummit2024 Recommendation System Architecture Summit Overview

The DataFunSummit2024 Recommendation System Architecture Summit invites participants to explore cutting‑edge advances in large‑model recommendation, training and inference optimization, feature engineering, multi‑task modeling, and graph‑based techniques through a series of expert talks and panel discussions from leading industry and academic researchers.

AIRecommendation Systemsconference
0 likes · 33 min read
DataFunSummit2024 Recommendation System Architecture Summit Overview