Baidu Intelligent Cloud Tech Hub
Author

Baidu Intelligent Cloud Tech Hub

We share the cloud tech topics you care about. Feel free to leave a message and tell us what you'd like to learn.

130
Articles
0
Likes
99
Views
0
Comments
Recent Articles

Latest from Baidu Intelligent Cloud Tech Hub

100 recent articles max
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jul 25, 2024 · Artificial Intelligence

How Transformers Work: From Tensor Basics to GPU Performance Analysis

This article provides a comprehensive, engineer‑focused breakdown of transformer architecture—including tensor fundamentals, matrix multiplication, GPU theoretical compute, attention and FFN mechanics, quantitative parameter and FLOP analysis, performance metrics like MFU, parallelism strategies, variant optimizations, and practical exercise questions—offering clear insight into large‑model efficiency and scaling.

AttentionFFNGPU Performance
0 likes · 33 min read
How Transformers Work: From Tensor Basics to GPU Performance Analysis
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jul 10, 2024 · Artificial Intelligence

How AI Transforms Physical Network Fault Localization: From Manual to LLM‑Powered Precision

This article explains how Baidu Cloud evolved its physical network fault‑location workflow—from manual analysis and integrated multi‑signal algorithms to AI‑driven reasoning with large language models—highlighting structured prompting, multi‑agent collaboration, and measurable improvements in accuracy and automation.

AIAutomationFault Localization
0 likes · 15 min read
How AI Transforms Physical Network Fault Localization: From Manual to LLM‑Powered Precision
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jul 3, 2024 · Operations

How to Eliminate Network Hash Collisions in Large‑Model Training

This article examines the impact of GPU communication bottlenecks on large‑model training, analyzes hash‑collision issues in high‑performance networks, and presents three practical solutions—including increasing RDMA streams, affinity‑aware scheduling, and dynamic load balancing—to boost effective network bandwidth up to 95%.

Hash CollisionRDMAdynamic load balancing
0 likes · 11 min read
How to Eliminate Network Hash Collisions in Large‑Model Training