Tag

performance optimization

0 views collected around this technical thread.

Bilibili Tech
Bilibili Tech
Jan 17, 2025 · Backend Development

NeighborHash: An Enhanced Batch Query Architecture for Real‑time Recommendation Systems

NeighborHash is a distributed batch‑query architecture for real‑time recommendation systems that combines a cache‑line‑optimized hash table—featuring Lodger Relocation, bidirectional cache‑aware probing, and inline‑chaining—with an NVMe‑backed key‑value service, versioned updates, and asynchronous memory‑access chaining to achieve sub‑microsecond, high‑throughput top‑N retrieval.

AMACBatch QueryDistributed Storage
0 likes · 20 min read
NeighborHash: An Enhanced Batch Query Architecture for Real‑time Recommendation Systems
Bilibili Tech
Bilibili Tech
Jan 10, 2025 · Frontend Development

Design and Evolution of Bilibili's Activity Platform Low‑Code System

Bilibili’s Activity Platform low‑code system was completely redesigned over two years, introducing a modular, decoupled architecture with an iframe‑based canvas, unified renderer, and Eva‑CLI tools that cut activity build time from three days to 0.48 days, boosted coverage to 77% of activities, and reduced first‑contentful‑paint by 35%.

BilibiliFrontend Architectureactivity platform
0 likes · 31 min read
Design and Evolution of Bilibili's Activity Platform Low‑Code System
DaTaobao Tech
DaTaobao Tech
Oct 18, 2024 · Artificial Intelligence

Taobao AI Virtual Try-On: Offline Data Processing and Performance Optimization

Taobao’s AI virtual‑try‑on system pre‑computes fitting results offline, writes them into the Item Center via scalable ScheduleX tasks, optimizes pagination, locking and flow‑control, and thereby processes millions of apparel items in under thirty minutes with 99.9% success and reliable checkpoint‑resume monitoring.

AIBig DataOffline Processing
0 likes · 16 min read
Taobao AI Virtual Try-On: Offline Data Processing and Performance Optimization
Tencent Cloud Developer
Tencent Cloud Developer
Apr 25, 2024 · Fundamentals

Cache, Prefetching, False Sharing, Pipeline and Data Dependency: Performance Optimization in Rust

The article uses Rust benchmarks to show how cache layout, prefetching, associativity, false sharing, pipeline stalls, and loop data dependencies impact performance, and demonstrates practical optimizations such as row‑major traversal, proper alignment, avoiding dependent loops, and leveraging sequential access to achieve near‑optimal speed.

CPU cacheRustdata dependency
0 likes · 17 min read
Cache, Prefetching, False Sharing, Pipeline and Data Dependency: Performance Optimization in Rust
Tencent Cloud Developer
Tencent Cloud Developer
Apr 23, 2024 · Backend Development

Performance Optimization of the qs Library: A 5× Speedup Case Study

A Tencent engineer fixed a severe memory‑leak in the qs library’s encode function by processing 30 MB strings in 1024‑character chunks, reducing heap usage from 2.5 GB to 0.48 GB and cutting runtime from 7.9 s to 2.1 s, achieving a five‑fold speedup and memory reduction, and contributed the change as an open‑source pull request.

JavaScriptMemory LeakNode.js
0 likes · 10 min read
Performance Optimization of the qs Library: A 5× Speedup Case Study
iQIYI Technical Product Team
iQIYI Technical Product Team
Jan 19, 2024 · Backend Development

Design and Optimization of Distributed and Local Shared Variables for Strategy Engine Services

By introducing distributed and local shared variables that propagate user profiles via trace context and cache parallel requests, the iQIYI strategy engine cuts redundant DMP calls, reduces traffic up to 25%, lowers P99 latency by nearly 50%, and achieves a 90% cost saving compared to step‑function micro‑services.

CachingDistributed SystemsMicroservices
0 likes · 19 min read
Design and Optimization of Distributed and Local Shared Variables for Strategy Engine Services
DaTaobao Tech
DaTaobao Tech
Jan 17, 2024 · Backend Development

Scaling and Performance Optimization of Taobao Shopping Cart

Taobao’s shopping cart was scaled and optimized by raising the item limit to 380, deploying the cloud‑native in‑memory read‑only replica tairSQL for read‑write separation, pre‑computing promotions, compressing payloads, caching data, redesigning the protocol, introducing response‑streaming APIs, and parallelizing per‑item processing with Java’s ForkJoinPool, dramatically cutting latency during traffic spikes.

Parallel Processingcloud-nativedatabase
0 likes · 15 min read
Scaling and Performance Optimization of Taobao Shopping Cart
Tencent Cloud Developer
Tencent Cloud Developer
Jul 13, 2023 · Frontend Development

Sogou Baike Mobile Front‑End System Upgrade: Architecture, Process, and Performance Optimization

The Sogou Baike mobile front‑end upgrade consolidated three fragmented projects into a single repository, introduced React‑SSR with routing and Redux, streamlined CI/CD, trimmed SSR payloads, lazy‑loaded assets and cut first‑screen requests, resulting in a 24% faster load, 35% bandwidth reduction, 11% CTR rise and a modern, cross‑stack foundation.

ReactSSRWeb Development
0 likes · 16 min read
Sogou Baike Mobile Front‑End System Upgrade: Architecture, Process, and Performance Optimization
DaTaobao Tech
DaTaobao Tech
Jul 10, 2023 · Backend Development

Engineering Growth: Lessons from Alibaba's ACCS Messaging Service

By applying a six‑step growth formula—motivation, product thinking, problem definition, solving, derivative thinking, data validation, and summarizing—the author transformed Alibaba’s ACCS long‑connection messaging service, cutting latency by 40%, eliminating cold‑start delays, and dramatically reducing user complaints while aligning technical work with business goals and social responsibility.

ACCSMessagingMobile Backend
0 likes · 11 min read
Engineering Growth: Lessons from Alibaba's ACCS Messaging Service
DaTaobao Tech
DaTaobao Tech
Apr 19, 2023 · Frontend Development

Reflections on a Decade of Front-End Development at Taobao

Over nine years at Taobao, a front‑end engineer recounts his journey from junior developer to team lead, detailing rapid feature delivery, data‑driven performance gains, cross‑platform innovations, and leadership lessons that shaped the platform’s e‑commerce experience and his own growth.

Front-endcareerdata-driven
0 likes · 20 min read
Reflections on a Decade of Front-End Development at Taobao
DeWu Technology
DeWu Technology
Mar 6, 2023 · Backend Development

Warehouse Inventory System Model Upgrade and Performance Optimization

To handle exploding product inventory data, the company overhauled its warehouse inventory model by eliminating risky document‑hand‑offs, storing only changed rows instead of daily snapshots, and syncing transformed data to a data‑warehouse for reporting, which cut monthly accounting time by 30 hours (≈30 %), improved accuracy, enabled new analytics, and introduced TiDB migration and team upskilling.

Data ModelingData WarehouseInventory
0 likes · 7 min read
Warehouse Inventory System Model Upgrade and Performance Optimization
DaTaobao Tech
DaTaobao Tech
Jan 30, 2023 · Backend Development

Taobao Shopping Cart 5-Year Technical Upgrade and Accumulation

Over five years, Taobao’s shopping cart transformed through successive architectural overhauls, performance tuning, and new features, confronting scaling challenges with innovative solutions and ultimately delivering a more robust, efficient system while documenting valuable lessons learned for future large‑scale e‑commerce development.

System Architecturee‑commerceperformance optimization
0 likes · 1 min read
Taobao Shopping Cart 5-Year Technical Upgrade and Accumulation
Tencent Cloud Developer
Tencent Cloud Developer
Dec 3, 2022 · Backend Development

Refactoring a Decade-Old Query Optimizer: Architecture, DIFF Fixes, Performance Gains, and Stability Improvements

Tencent engineers completely rewrote a ten‑year‑old query optimizer, shrinking the codebase by 80% and replacing its monolithic thread‑pool with a tRPC‑Fiber DAG scheduler, which cut latency by 28%, reduced startup time to five minutes, saved 40 GB memory, boosted throughput 12%, fixed numerous stability bugs, raised test coverage above 60%, and accelerated new‑feature lead time to under a day.

Backend Developmentc++diff testing
0 likes · 15 min read
Refactoring a Decade-Old Query Optimizer: Architecture, DIFF Fixes, Performance Gains, and Stability Improvements
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Nov 16, 2022 · Operations

Design and Implementation of a Continuous Performance Optimization and Tracking Platform for Xiaohongshu Services

To curb rising resource costs as Xiaohourshu scales, engineers built a Continuous Performance Optimization & Tracking Platform that continuously profiles services, stores diff‑analyzed data in ClickHouse, automatically detects tiny regressions, links them to code changes, and has already saved and flagged roughly 20,000 CPU cores across search, recommendation and advertising workloads.

Big Datacloud-nativecontinuous monitoring
0 likes · 16 min read
Design and Implementation of a Continuous Performance Optimization and Tracking Platform for Xiaohongshu Services
Shopee Tech Team
Shopee Tech Team
Aug 12, 2022 · Backend Development

Shopee Video Technology: Backend Services, High‑Definition Low‑Bitrate Optimization, and Performance Enhancements

Shopee’s video platform combines live‑stream and on‑demand transcoding, link‑mic, multi‑party mixing, and backend editing services with a proprietary high‑definition low‑bitrate pipeline that leverages GPU and CPU encoders, AI‑enhanced pre‑processing, hierarchical B‑frames, and SIMD‑optimized sharpening to deliver high‑quality video on low‑end devices while cutting compute costs, and the company is actively recruiting engineers for further development.

AI enhancementBackend Serviceslow bitrate
0 likes · 19 min read
Shopee Video Technology: Backend Services, High‑Definition Low‑Bitrate Optimization, and Performance Enhancements
Tencent Cloud Developer
Tencent Cloud Developer
Mar 7, 2022 · Backend Development

Comprehensive Guide to Caching: Concepts, Types, Strategies, and Best Practices

This comprehensive guide explains caching fundamentals, motivations, and hardware/software perspectives, details client, network, and server cache types, addresses common problems like breakdown, avalanche, and penetration with solutions, and covers strategies, consistency, expiration policies, distributed architectures, design best practices, and implementation examples.

Backend DevelopmentCache StrategiesCaching
0 likes · 52 min read
Comprehensive Guide to Caching: Concepts, Types, Strategies, and Best Practices
Shopee Tech Team
Shopee Tech Team
Jan 20, 2022 · Game Development

Shopee Games: Engine Selection, Optimization, and Engineering Practices for H5 Game Development

Shopee Games chose the Egret engine for its lightweight 2D performance on low‑end Southeast Asian devices, built a custom toolchain with texture‑atlas and draw‑call optimizations, trimmed unused modules, migrated the project to a modern npm‑based workflow with Webpack, linting, testing and CI/CD, and created scaffolding utilities that now let developers treat H5 game development like any frontend project, speeding iteration and improving code quality.

CI/CDEgret engineWebpack
0 likes · 19 min read
Shopee Games: Engine Selection, Optimization, and Engineering Practices for H5 Game Development
DeWu Technology
DeWu Technology
Dec 23, 2021 · Mobile Development

iOS Lag Monitoring and Performance Optimization Using RunLoop and Flame Graphs

The article describes an iOS lag‑monitoring system that detects main‑thread RunLoop blocks, samples stacks every 50 ms into a circular buffer, uses an annealing Fibonacci‑based interval to minimize overhead, visualizes hot paths with flame graphs, and streams data through a Flink‑APM pipeline with only ~2 % CPU and a few megabytes of memory impact.

Flame GraphLag MonitoringRunLoop
0 likes · 12 min read
iOS Lag Monitoring and Performance Optimization Using RunLoop and Flame Graphs
DeWu Technology
DeWu Technology
Nov 23, 2021 · Mobile Development

iOS Lag Monitoring and Performance Optimization Using Runloop Detection and Flame Graphs

The article describes an iOS lag‑monitoring system that uses RunLoop state checks to detect main‑thread stalls, captures and caches stack traces, employs a Fibonacci‑based annealing algorithm to limit overhead, symbolises addresses on a server, and visualises results with flame graphs while maintaining low CPU and memory impact.

Flame GraphLag MonitoringRunLoop
0 likes · 13 min read
iOS Lag Monitoring and Performance Optimization Using Runloop Detection and Flame Graphs
DeWu Technology
DeWu Technology
Nov 16, 2021 · Databases

Full-Chain Load Testing: Redis Large-Key and Memory Leak Issues and Solutions

Full‑chain load testing uncovered a Redis large‑key bottleneck that saturated bandwidth, a memory‑leak caused by repeatedly registering shutdown hooks, and persistently high JVM heap usage, leading to solutions of key sharding with local caching, registering hooks only once, and adjusting heap size, physical memory, and alert thresholds.

Memory LeakRedisbackend
0 likes · 7 min read
Full-Chain Load Testing: Redis Large-Key and Memory Leak Issues and Solutions