Author

JavaEdge

First‑line development experience at multiple leading tech firms; now a software architect at a Shanghai state‑owned enterprise and founder of Programming Yanxuan. Nearly 300k followers online; expertise in distributed system design, AIGC application development, and quantitative finance investing.

372

Articles

Likes

885

Views

Comments

Latest from JavaEdge

100 recent articles max

JavaEdge

Dec 16, 2024 · Backend Development

Why Nginx’s Event‑Driven Architecture Beats Traditional Thread‑Per‑Request Servers

Unlike traditional one‑request‑per‑process servers, Nginx uses a fixed number of worker processes with a non‑blocking, event‑driven model that reduces context switches, leverages epoll/kqueue, and handles thousands of connections efficiently, making it the preferred high‑performance web server.

event-drivennon-blocking I/Operformance

0 likes · 8 min read

Why Nginx’s Event‑Driven Architecture Beats Traditional Thread‑Per‑Request Servers

JavaEdge

Dec 15, 2024 · Cloud Computing

Is Serverless a Scam? Uncovering Hidden Costs, Complexity, and Reliability Risks

The article argues that serverless platforms often hide high costs, operational complexity, and reliability issues, contrasting them with traditional VPS and Cloudflare solutions while highlighting DDoS protection, pricing traps, and the challenges of managing micro‑service architectures.

Cloud ComputingDDoS protectionServerless

0 likes · 11 min read

Is Serverless a Scam? Uncovering Hidden Costs, Complexity, and Reliability Risks

JavaEdge

Dec 8, 2024 · Backend Development

Netflix’s Service‑Level Priority Load Shedding: Protecting User‑Initiated Requests

This article explains how Netflix extended its priority load‑shedding strategy from the API gateway to individual services, detailing the classification of user‑initiated versus pre‑fetch requests, the implementation of partitioned concurrency limiters, CPU‑ and I/O‑based shedding, test results, and real‑world impact on availability.

NetflixPrioritybackend architecture

0 likes · 18 min read

Netflix’s Service‑Level Priority Load Shedding: Protecting User‑Initiated Requests

JavaEdge

Dec 1, 2024 · Artificial Intelligence

Exploring the Limits and Benchmarks of Qwen’s QwQ‑32B‑Preview AI Model

QwQ‑32B‑Preview, an experimental AI model from the Qwen team, showcases strong reasoning in math and programming while facing challenges like language switching, inference loops, safety concerns, and variable capabilities across domains, with benchmark scores ranging from 50% to over 90% on tests such as GPQA, AIME, MATH‑500, and LiveCodeBench.

AI BenchmarkLLMQwen

0 likes · 7 min read

Exploring the Limits and Benchmarks of Qwen’s QwQ‑32B‑Preview AI Model

JavaEdge

Nov 24, 2024 · Fundamentals

How to Measure and Tame Technical Debt with Practical Metrics

The article explains what technical debt is, why it matters, and presents a set of concrete metrics—such as WTFs per minute, code smells, test and documentation coverage, effort on deprecated components, defect‑fix work, and vulnerability counts—to help teams identify, monitor, and reduce technical debt effectively.

code qualitydocumentationsoftware maintenance

0 likes · 13 min read

How to Measure and Tame Technical Debt with Practical Metrics

JavaEdge

Nov 21, 2024 · Backend Development

Inside Booking.com’s Real‑Time Ranking Engine: Architecture, Challenges & Solutions

Booking.com’s ranking platform uses sophisticated machine‑learning models and a multi‑cluster backend architecture to deliver personalized hotel search results, detailing data pipelines, feature engineering, service components, performance challenges, and optimization techniques such as static fallback, multi‑stage ranking, and model inference acceleration.

Ranking

0 likes · 13 min read

JavaEdge

Nov 20, 2024 · Artificial Intelligence

7 Proven Strategies to Simplify Large Language Model Deployment

The article explains why deploying large language models is challenging and presents seven practical techniques—including defining deployment boundaries, model quantization, inference optimization, infrastructure consolidation, model replacement planning, GPU utilization, and using smaller models—to make LLM deployment more efficient and cost‑effective.

GPU OptimizationLLM deploymentQuantization

0 likes · 24 min read

7 Proven Strategies to Simplify Large Language Model Deployment

JavaEdge

Nov 17, 2024 · Backend Development

How Netflix’s Data Gateway Simplifies Distributed Database Access

This article explains how Netflix built the Data Gateway platform to abstract and protect complex distributed databases, detailing its motivation, architecture, component overview, declarative runtime and deployment configurations, and real‑world case studies such as key‑value services, secure RDS, and seamless data migration.

Data GatewayDeclarative DeploymentKey-Value Service

0 likes · 20 min read

How Netflix’s Data Gateway Simplifies Distributed Database Access

JavaEdge

Nov 16, 2024 · Backend Development

How Netflix Built a Low‑Latency Distributed Counter Service at Scale

This article explains Netflix's distributed counter abstraction built on their time‑series service, detailing use cases, API design, counter types, implementation methods, control‑plane configuration, performance results, and future work to achieve near‑real‑time, low‑latency counting at massive scale.

Low latencyNetflixbackend architecture

0 likes · 25 min read

How Netflix Built a Low‑Latency Distributed Counter Service at Scale

JavaEdge

Nov 11, 2024 · Fundamentals

Master Audio/Video Development: Fast‑Track Your FFmpeg Skills and Community Engagement

This guide outlines why audio‑video technology is booming, how mastering FFmpeg provides a rapid entry point, and offers a structured learning path—including core concepts, streaming tools, API usage, and community contribution—to help developers quickly become proficient in media processing.

Open Sourceaudioffmpeg

0 likes · 8 min read

Master Audio/Video Development: Fast‑Track Your FFmpeg Skills and Community Engagement