JavaEdge
Author

JavaEdge

First‑line development experience at multiple leading tech firms; now a software architect at a Shanghai state‑owned enterprise and founder of Programming Yanxuan. Nearly 300k followers online; expertise in distributed system design, AIGC application development, and quantitative finance investing.

371
Articles
0
Likes
109
Views
0
Comments
Recent Articles

Latest from JavaEdge

100 recent articles max
JavaEdge
JavaEdge
Dec 15, 2024 · Cloud Computing

Is Serverless a Scam? Uncovering Hidden Costs, Complexity, and Reliability Risks

The article argues that serverless platforms often hide high costs, operational complexity, and reliability issues, contrasting them with traditional VPS and Cloudflare solutions while highlighting DDoS protection, pricing traps, and the challenges of managing micro‑service architectures.

DDoS protectioncloud computingoperational complexity
0 likes · 11 min read
Is Serverless a Scam? Uncovering Hidden Costs, Complexity, and Reliability Risks
JavaEdge
JavaEdge
Dec 8, 2024 · Backend Development

Netflix’s Service‑Level Priority Load Shedding: Protecting User‑Initiated Requests

This article explains how Netflix extended its priority load‑shedding strategy from the API gateway to individual services, detailing the classification of user‑initiated versus pre‑fetch requests, the implementation of partitioned concurrency limiters, CPU‑ and I/O‑based shedding, test results, and real‑world impact on availability.

Netflixbackend architectureconcurrency limits
0 likes · 18 min read
Netflix’s Service‑Level Priority Load Shedding: Protecting User‑Initiated Requests
JavaEdge
JavaEdge
Dec 1, 2024 · Artificial Intelligence

Exploring the Limits and Benchmarks of Qwen’s QwQ‑32B‑Preview AI Model

QwQ‑32B‑Preview, an experimental AI model from the Qwen team, showcases strong reasoning in math and programming while facing challenges like language switching, inference loops, safety concerns, and variable capabilities across domains, with benchmark scores ranging from 50% to over 90% on tests such as GPQA, AIME, MATH‑500, and LiveCodeBench.

AI benchmarkLLMQwen
0 likes · 7 min read
Exploring the Limits and Benchmarks of Qwen’s QwQ‑32B‑Preview AI Model
JavaEdge
JavaEdge
Nov 24, 2024 · Fundamentals

How to Measure and Tame Technical Debt with Practical Metrics

The article explains what technical debt is, why it matters, and presents a set of concrete metrics—such as WTFs per minute, code smells, test and documentation coverage, effort on deprecated components, defect‑fix work, and vulnerability counts—to help teams identify, monitor, and reduce technical debt effectively.

code qualitydocumentationsoftware maintenance
0 likes · 13 min read
How to Measure and Tame Technical Debt with Practical Metrics
JavaEdge
JavaEdge
Nov 21, 2024 · Backend Development

Inside Booking.com’s Real‑Time Ranking Engine: Architecture, Challenges & Solutions

Booking.com’s ranking platform uses sophisticated machine‑learning models and a multi‑cluster backend architecture to deliver personalized hotel search results, detailing data pipelines, feature engineering, service components, performance challenges, and optimization techniques such as static fallback, multi‑stage ranking, and model inference acceleration.

Ranking
0 likes · 13 min read
Inside Booking.com’s Real‑Time Ranking Engine: Architecture, Challenges & Solutions
JavaEdge
JavaEdge
Nov 20, 2024 · Artificial Intelligence

7 Proven Strategies to Simplify Large Language Model Deployment

The article explains why deploying large language models is challenging and presents seven practical techniques—including defining deployment boundaries, model quantization, inference optimization, infrastructure consolidation, model replacement planning, GPU utilization, and using smaller models—to make LLM deployment more efficient and cost‑effective.

GPU OptimizationLLM deploymentQuantization
0 likes · 24 min read
7 Proven Strategies to Simplify Large Language Model Deployment
JavaEdge
JavaEdge
Nov 17, 2024 · Backend Development

How Netflix’s Data Gateway Simplifies Distributed Database Access

This article explains how Netflix built the Data Gateway platform to abstract and protect complex distributed databases, detailing its motivation, architecture, component overview, declarative runtime and deployment configurations, and real‑world case studies such as key‑value services, secure RDS, and seamless data migration.

Data GatewayDeclarative DeploymentKey-Value Service
0 likes · 20 min read
How Netflix’s Data Gateway Simplifies Distributed Database Access
JavaEdge
JavaEdge
Nov 16, 2024 · Backend Development

How Netflix Built a Low‑Latency Distributed Counter Service at Scale

This article explains Netflix's distributed counter abstraction built on their time‑series service, detailing use cases, API design, counter types, implementation methods, control‑plane configuration, performance results, and future work to achieve near‑real‑time, low‑latency counting at massive scale.

Eventual ConsistencyNetflixbackend architecture
0 likes · 25 min read
How Netflix Built a Low‑Latency Distributed Counter Service at Scale
JavaEdge
JavaEdge
Nov 9, 2024 · Artificial Intelligence

Build an AI‑Powered Airline Ticket Agent with Spring AI Alibaba

This tutorial walks through creating an intelligent airline‑ticket customer‑service agent using Spring AI Alibaba, covering requirements, architecture, RAG integration, function calling, chat memory, core capabilities, code implementation with ChatClient, and a sample running result.

AI AgentAlibabaChat Memory
0 likes · 9 min read
Build an AI‑Powered Airline Ticket Agent with Spring AI Alibaba