Tagged articles
6 articles
Page 1 of 1
21CTO
21CTO
Dec 7, 2025 · Fundamentals

Why Linus Torvalds Defends Windows’ Blue Screen – Hardware Faults Over Software Bugs

As Windows 10 reaches end‑of‑life, the article examines the surge in Linux adoption, the evolution of Windows blue‑screen displays, Linus Torvalds’ surprising defense of the issue, and a detailed video where he and Linus Sebastian build a high‑end open‑source workstation while debating hardware reliability and ECC memory.

BlueScreenECC MemoryHardware Reliability
0 likes · 6 min read
Why Linus Torvalds Defends Windows’ Blue Screen – Hardware Faults Over Software Bugs
Liangxu Linux
Liangxu Linux
Aug 6, 2022 · Operations

When Core Switches Suddenly Die: The Hidden SSD Time‑Bomb in Network Gear

A network engineer recounts a terrifying outage caused by a firmware‑related SSD bug that locks core switches after 28,224 hours of use, explains the emergency troubleshooting steps taken, and highlights the need for better vendor recall mechanisms to protect critical infrastructure.

Hardware ReliabilityOperationsSSD bug
0 likes · 8 min read
When Core Switches Suddenly Die: The Hidden SSD Time‑Bomb in Network Gear
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 2, 2021 · Fundamentals

Why Data Loss Happens: Hidden Bit Flips and How to Prevent Them

This article explains the concepts of data loss and corruption, defines "data not lost" and "data not wrong", examines common bit‑flip sources in disks, memory, and networks, explores silent CPU errors, and presents design, detection, and recovery strategies for reliable storage systems.

Hardware ReliabilityStorage Systemsbit flip
0 likes · 17 min read
Why Data Loss Happens: Hidden Bit Flips and How to Prevent Them
Amap Tech
Amap Tech
Apr 28, 2021 · Operations

Hardware Quality and Reliability of Map Collection Vehicles: Design, Production, and Testing Practices

The article outlines how map‑collection vehicles achieve high hardware quality and reliability through systematic design, production management, and rigorous testing—including derated components, redundancy, thermal and mechanical safeguards, sensor protection, and data‑driven failure prediction—to meet MTBF targets and extend service life.

Hardware Reliabilitymap vehiclequality assurance
0 likes · 29 min read
Hardware Quality and Reliability of Map Collection Vehicles: Design, Production, and Testing Practices
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Oct 16, 2018 · Operations

Improving Server Reliability by Reducing Memory Faults: Alibaba's Memory Fault Isolation Enhancements

The article explains how Alibaba's infrastructure team tackles unexpected server outages caused by memory hardware failures by enhancing memory fault isolation, using AI‑driven prediction, hardware‑level segregation, and improved diagnostics to boost overall system stability and reduce downtime.

AI predictionHardware Reliabilitycloud infrastructure
0 likes · 11 min read
Improving Server Reliability by Reducing Memory Faults: Alibaba's Memory Fault Isolation Enhancements