Tagged articles
578 articles
Page 3 of 6
DaTaobao Tech
DaTaobao Tech
Jul 19, 2023 · Operations

Data‑Driven Optimization of Taobao Logistics Experience: Problem Definition, Metric Design, and Strategy Implementation

The article details Taobao’s data‑driven approach to redesigning logistics information display and self‑service tickets—defining problems, preparing subjective and objective data, creating metrics, analyzing pain points, implementing timed soothing messages and proactive tickets, and showing through A/B tests reduced help volume and improved user satisfaction.

LogisticsMetricsUser experience
0 likes · 12 min read
Data‑Driven Optimization of Taobao Logistics Experience: Problem Definition, Metric Design, and Strategy Implementation
Huolala Tech
Huolala Tech
Jul 13, 2023 · Operations

How HuoLaLa Built a 0‑to‑1 Stability Metric System in 2 Years

This article explains how HuoLaLa’s stability team tackled the challenge of proving their work’s value by designing and implementing a comprehensive stability metric system from scratch, detailing the motivations, principles, step‑by‑step construction, data platform, cultural adoption, measurable results, and future plans.

Data-drivenMetricsOperations
0 likes · 18 min read
How HuoLaLa Built a 0‑to‑1 Stability Metric System in 2 Years
Top Architect
Top Architect
Jul 11, 2023 · Operations

Introducing MyPerf4J: A High‑Performance Java Monitoring and Statistics Tool

MyPerf4J is a Java‑agent based, low‑overhead performance monitoring library that provides real‑time method, memory, GC and class metrics for high‑concurrency, low‑latency applications, offering quick start, configurable properties, and detailed statistical reports for both development and production environments.

JavaJavaAgentMetrics
0 likes · 7 min read
Introducing MyPerf4J: A High‑Performance Java Monitoring and Statistics Tool
dbaplus Community
dbaplus Community
Jul 10, 2023 · Operations

Why Most Logging and Metrics Strategies Fail – and How to Fix Them

The author reflects on the shortcomings of current logging, metrics, and tracing practices, explains why they become costly and unscalable, and offers concrete recommendations—including log level discipline, structured logging, metric aggregation, and the use of tools like Prometheus, Cortex, and Thanos—to build a more efficient observability stack.

MetricsObservabilityPrometheus
0 likes · 18 min read
Why Most Logging and Metrics Strategies Fail – and How to Fix Them
DataFunSummit
DataFunSummit
Jul 5, 2023 · Artificial Intelligence

Fairness in Recommendation Systems: Consumer and Provider Perspectives

This article examines the fairness of recommendation systems from both consumer and provider viewpoints, discussing sources of bias, definitions of equality and equity, measurement metrics such as CGF and MMF, causal embedding techniques, experimental results on MovieLens and Yelp, and future research directions.

FairnessMetricsRecommendation Systems
0 likes · 9 min read
Fairness in Recommendation Systems: Consumer and Provider Perspectives
Efficient Ops
Efficient Ops
Jun 29, 2023 · Operations

How ICBC‑RS Achieved Leading‑Edge DevOps Continuous Delivery Level 3 in China

ICBC‑RS Fund Management’s Transaction Management Platform passed the China Academy of Information and Communications Technology’s DevOps Continuous Delivery Level 3 assessment, showcasing significant improvements in build times, release cycles, and organizational efficiency while highlighting the cultural and tooling benefits of standardized DevOps practices.

Case StudyContinuous DeliveryDevOps
0 likes · 14 min read
How ICBC‑RS Achieved Leading‑Edge DevOps Continuous Delivery Level 3 in China
Efficient Ops
Efficient Ops
Jun 29, 2023 · Operations

How China’s Leading Futures Exchange Achieved Top‑Tier DevOps Continuous Delivery – A Deep Dive

This article details how Shanghai Financial Futures' technology subsidiary passed the Level 3 Continuous Delivery assessment of the national DevOps maturity model, showcasing the platform’s cloud‑native architecture, metric‑driven improvements, challenges faced, and future plans for scaling DevOps practices across the exchange.

Case StudyContinuous DeliveryDevOps
0 likes · 14 min read
How China’s Leading Futures Exchange Achieved Top‑Tier DevOps Continuous Delivery – A Deep Dive
Architects Research Society
Architects Research Society
Jun 13, 2023 · R&D Management

Enterprise Architecture Governance: Frameworks, Principles, Roles, Processes, and Tools

This article provides a comprehensive overview of enterprise architecture (EA) governance, covering its definition, context, framework components, guiding principles, organizational structure, roles and responsibilities, processes, classification, metrics, and tool selection to help organizations align IT with business strategy and achieve cost savings and compliance.

FrameworkMetricsRoles
0 likes · 22 min read
Enterprise Architecture Governance: Frameworks, Principles, Roles, Processes, and Tools
DataFunTalk
DataFunTalk
Jun 12, 2023 · Big Data

Tencent Oula Data Asset Suite: End‑to‑End Data Production and Governance Framework

The article presents Tencent Oula's comprehensive data‑asset platform that integrates data collection, integration, warehouse and metric modeling, unified services, and a governance engine to create trustworthy, low‑entropy data assets while addressing common data‑governance challenges and outlining future AI‑for‑BI possibilities.

AI for BIMetricsdata modeling
0 likes · 20 min read
Tencent Oula Data Asset Suite: End‑to‑End Data Production and Governance Framework
Data Thinking Notes
Data Thinking Notes
Jun 11, 2023 · Product Management

How to Score Data Tags for Better Governance and Resource Optimization

This article explains why tag scoring is essential for data governance, outlines a five‑dimensional scoring model—including usage, attention, quality, continuous optimization, and security—and demonstrates how the scores can drive dashboards, alerts, and resource‑saving decisions.

Data GovernanceMetricsResource Optimization
0 likes · 9 min read
How to Score Data Tags for Better Governance and Resource Optimization
dbaplus Community
dbaplus Community
Apr 26, 2023 · Big Data

How Bilibili Scaled Big Data Governance: From Reactive to Proactive

This article details Bilibili's journey from rapid data growth to a structured big‑data governance framework, describing the challenges of fragmented ownership, the "Wanglou" project launch, asset metadata modeling, metric design, user engagement strategies, automation tools, and the shift from reactive to proactive, multi‑dimensional resource management.

MetricsResource Optimizationstrategy
0 likes · 21 min read
How Bilibili Scaled Big Data Governance: From Reactive to Proactive
Tencent Cloud Developer
Tencent Cloud Developer
Apr 17, 2023 · Backend Development

Five Methodologies for Backend Development: Quantification, Comparison, Documentation & Process, Standardization & Unification, Automation

An experienced Tencent backend engineer outlines five practical methodologies—quantification of technical and business metrics, systematic comparison of performance data, thorough documentation and defined processes, consistent standardization across code and tools, and extensive automation via CI/CD and testing—to boost efficiency, quality, and reliability in backend development.

Metricsprocesses
0 likes · 15 min read
Five Methodologies for Backend Development: Quantification, Comparison, Documentation & Process, Standardization & Unification, Automation
dbaplus Community
dbaplus Community
Apr 5, 2023 · Cloud Native

How Baidu’s Search Platform Achieves Billion‑Scale Observability in a Cloud‑Native Era

This article explains why observability is critical in cloud‑native architectures and describes how Baidu’s search middle‑platform handles hundred‑billion‑level traffic by implementing low‑cost real‑time metrics, distributed tracing, log querying and topology analysis, while tackling challenges of massive microservice scale, scenario‑level monitoring, and efficient resource usage.

MetricsObservabilitycloud-native
0 likes · 12 min read
How Baidu’s Search Platform Achieves Billion‑Scale Observability in a Cloud‑Native Era
Baidu Intelligent Testing
Baidu Intelligent Testing
Apr 3, 2023 · Operations

R&D Efficiency Analysis: From Metric Definition to a Digital Decision‑Making System

This article explains how to measure and improve R&D efficiency by defining core factors, building data‑driven analysis models, presenting practical case studies on human productivity, code‑review processes and workflow bottlenecks, and describing the technical architecture of a digital platform that turns metrics into actionable decisions.

MetricsR&D efficiencySoftware Engineering
0 likes · 24 min read
R&D Efficiency Analysis: From Metric Definition to a Digital Decision‑Making System
SQB Blog
SQB Blog
Mar 27, 2023 · Frontend Development

How to Build a Full‑Featured Front‑End Monitoring System

This article explains how to design and implement a comprehensive front‑end monitoring solution that captures errors, performance metrics, and client data, covering data collection, tracing, transmission, storage, and analysis to help developers quickly locate and resolve issues.

MetricsWeb Performanceclient data
0 likes · 11 min read
How to Build a Full‑Featured Front‑End Monitoring System
Architect
Architect
Mar 21, 2023 · Operations

Log Management, Observability, and APM Practices in Distributed Systems

This article explains what logs are, when to record them, their value in large‑scale architectures, and how to build effective logging, metrics, and tracing platforms using tools such as ELK, Prometheus, and SkyWalking, while also presenting good and bad logging practices and sample batch‑log retrieval code.

APMDistributed SystemsELK
0 likes · 20 min read
Log Management, Observability, and APM Practices in Distributed Systems
Zhuanzhuan Tech
Zhuanzhuan Tech
Mar 8, 2023 · Product Management

A Comprehensive Guide to A/B Testing: Principles, Design, Metrics, and Decision Making

This article explains the fundamentals of A/B testing, why it is essential for data‑driven product decisions, how to design and run experiments—including hypothesis formulation, metric selection, sample size calculation, traffic segmentation, and duration planning—and how to analyze results using T‑tests, P‑values, and structured decision processes.

A/B testingMetricsdecision making
0 likes · 15 min read
A Comprehensive Guide to A/B Testing: Principles, Design, Metrics, and Decision Making
Xingsheng Youxuan Technology Community
Xingsheng Youxuan Technology Community
Mar 2, 2023 · Product Management

Why Traditional Growth Tactics Fail: Lessons from Three Years of Community E‑Commerce Subsidies

The article reflects on the 2020‑2022 subsidy wars in community e‑commerce, exposing classic growth misconceptions, the limits of traffic‑centric strategies, the pitfalls of single‑metric thinking, and the need for holistic, value‑driven product management to achieve sustainable user growth.

GrowthMarketingMetrics
0 likes · 16 min read
Why Traditional Growth Tactics Fail: Lessons from Three Years of Community E‑Commerce Subsidies
dbaplus Community
dbaplus Community
Feb 21, 2023 · Operations

How Standardized Application Monitoring Boosts Operational Efficiency

This article reviews G Bank's multi‑year journey to standardize application monitoring, detailing the methodology, models, metrics, automation mechanisms, and quantitative evaluation that together improve visibility, early fault detection, and overall operations management for both traditional and distributed systems.

MetricsOperationsaiops
0 likes · 18 min read
How Standardized Application Monitoring Boosts Operational Efficiency
DataFunSummit
DataFunSummit
Feb 20, 2023 · Product Management

Evaluating the Value of Data Products: Scenarios, Frameworks, and Improvement Methods

This article explains why data product value assessment is essential, outlines common usage scenarios and a DBA evaluation framework, describes quantitative methods such as usage, business, and data‑driven metrics, and offers practical ways to enhance data product value through metric optimization, high‑value direction selection, and resource allocation.

Big DataData ProductMetrics
0 likes · 13 min read
Evaluating the Value of Data Products: Scenarios, Frameworks, and Improvement Methods
Data Thinking Notes
Data Thinking Notes
Feb 14, 2023 · Big Data

How Cloud Music Turned 60k Tables into Valuable Data Assets

This article details Cloud Music's year‑long data assetization journey, covering the background, practical achievements, governance methods, and future roadmap for turning massive data warehouses into high‑value, well‑governed assets that drive cost reduction and business insight.

Big DataData GovernanceData Platform
0 likes · 10 min read
How Cloud Music Turned 60k Tables into Valuable Data Assets
37 Interactive Technology Team
37 Interactive Technology Team
Feb 10, 2023 · Backend Development

Analysis of Golang SQL Connection Pool Mechanism and Usage

The article examines Go’s database/sql connection pool implementation, showing how reusing connections cuts latency, explains idle/in‑use/closed state transitions, details configuration parameters such as MaxIdleConns and MaxOpenConns, demonstrates metric collection via Gorm DBStats for monitoring, and provides a stress‑test illustrating the impact of proper tuning.

Connection PoolGORMGolang
0 likes · 10 min read
Analysis of Golang SQL Connection Pool Mechanism and Usage
DataFunSummit
DataFunSummit
Feb 8, 2023 · Product Management

Content‑Driven Data Product Management: Challenges, Governance Frameworks, and Implementation Strategies

This article shares practical insights from a data product expert on the problems faced by content‑oriented data products, outlines a comprehensive governance methodology—including DAMA, Huawei, and Alibaba frameworks—and demonstrates how to operationalize these ideas through concrete examples such as event‑tracking and metric governance.

Big DataData GovernanceData Product Management
0 likes · 16 min read
Content‑Driven Data Product Management: Challenges, Governance Frameworks, and Implementation Strategies
Alibaba Cloud Native
Alibaba Cloud Native
Feb 8, 2023 · Cloud Native

Alibaba Cloud Prometheus vs Open‑Source Prometheus: Deep Performance Benchmark

This article benchmarks Alibaba Cloud Prometheus against the open‑source Prometheus across multiple cluster sizes, churn rates, and query patterns, revealing that while the open‑source version remains stable under light load, its CPU and memory usage grow non‑linearly with high cardinality, whereas Alibaba's managed service delivers higher compatibility, better query performance, and more predictable scaling.

Cloud NativeMetricsObservability
0 likes · 30 min read
Alibaba Cloud Prometheus vs Open‑Source Prometheus: Deep Performance Benchmark
ByteFE
ByteFE
Feb 6, 2023 · Frontend Development

Front‑End Performance Optimization: Key Metrics and Practical Techniques to Reduce First Paint and Improve Interactivity

This article explains essential front‑end performance metrics such as FP, FCP, LCP, TTI, FID, TBT and CLS, and provides a comprehensive set of network, code, CSS, image, and build‑time optimization techniques—including gzip, HTTP/2, lazy loading, SSR, debouncing, and tree‑shaking—to dramatically shorten white‑screen time and improve user experience on H5 pages.

MetricsWeboptimization
0 likes · 18 min read
Front‑End Performance Optimization: Key Metrics and Practical Techniques to Reduce First Paint and Improve Interactivity
DataFunTalk
DataFunTalk
Jan 29, 2023 · Artificial Intelligence

Data Science Practices in E‑commerce Search: Experimentation, Causal Inference, and Metric Design

This article presents the JD Retail search data‑science team's practical approaches to e‑commerce search, covering the scene’s unique data characteristics, order attribution methods, AB experiment design, causal‑inference frameworks, variance‑reduction techniques, quasi‑experimental evaluations, and metric design for traffic distribution, all illustrated with real‑world examples and visualizations.

Data ScienceMetricscausal inference
0 likes · 18 min read
Data Science Practices in E‑commerce Search: Experimentation, Causal Inference, and Metric Design
DevOps
DevOps
Jan 28, 2023 · R&D Management

How to Add Metrics Without Burdening R&D Teams

The article examines common pitfalls when introducing performance metrics in software development teams, offering five key considerations—metric selection, implementation cost, proper usage, clear value, and team consensus—to ensure metrics drive improvement without adding unnecessary burden.

MetricsR&D managementperformance measurement
0 likes · 6 min read
How to Add Metrics Without Burdening R&D Teams
dbaplus Community
dbaplus Community
Jan 26, 2023 · Operations

Unified Metrics, Tracing, and Logging: A Financial Firm’s Path to Microservice Observability

Facing the challenges of distributed microservice architectures, a financial services company implemented a unified observability platform that combines metrics, tracing, and logging via OpenTelemetry and custom agents, enabling real‑time visualization, anomaly detection, and performance analysis across seven core business middle‑platforms.

Distributed TracingMetricsMicroservices
0 likes · 17 min read
Unified Metrics, Tracing, and Logging: A Financial Firm’s Path to Microservice Observability
dbaplus Community
dbaplus Community
Jan 16, 2023 · Operations

Beyond Success‑Ratio: How User‑Uptime Reveals Real Product Availability

The article reviews traditional availability metrics such as Success‑Ratio, Error‑Budget, MTTR/MTTF, SLA/SLO, and highlights their limitations, then introduces Google’s User‑Uptime and Windowed User‑Uptime metrics, explains their definitions, challenges, experimental results, and why they provide a more user‑centric view of service reliability.

AvailabilityMetricsSRE
0 likes · 27 min read
Beyond Success‑Ratio: How User‑Uptime Reveals Real Product Availability
NetEase Yanxuan Technology Product Team
NetEase Yanxuan Technology Product Team
Jan 16, 2023 · Backend Development

Design and Implementation of a Business‑Facing Message Center Management Platform

The platform centralizes message‑center management for e‑commerce by adding end‑to‑end tracing, real‑time metrics, and unified logging, enabling business users to query message links, view dashboards, automate retries and approvals, dramatically reducing manual monitoring, improving completion rates above 90%, and paving the way for cost‑optimized, data‑driven operations.

DevOpsMetricsObservability
0 likes · 15 min read
Design and Implementation of a Business‑Facing Message Center Management Platform
DataFunSummit
DataFunSummit
Jan 9, 2023 · Big Data

JD Data‑Driven Business Development: Building a Business Metric Data System and Marketplace Governance

The article outlines JD's data‑driven business development strategy, describing the current challenges of its business data marketplace, the governance framework—including layered architecture, standardization, ClickHouse dictionary refresh, and optimization measures—and the resulting performance improvements and future outlook.

Big DataClickHouseData Governance
0 likes · 13 min read
JD Data‑Driven Business Development: Building a Business Metric Data System and Marketplace Governance
DaTaobao Tech
DaTaobao Tech
Jan 6, 2023 · Artificial Intelligence

Two‑Stage Ranking Optimization in E‑commerce Search: From Coarse to Fine Ranking

The paper presents a two‑stage e‑commerce search framework where the coarse‑ranking stage is redesigned with multi‑objective optimization, expanded negative sampling, and listwise distillation—guided by a new global transaction hitrate metric—enabling it to surpass fine‑ranking on large candidate sets and boost overall GMV by about one percent.

Metricscoarse rankinge‑commerce
0 likes · 25 min read
Two‑Stage Ranking Optimization in E‑commerce Search: From Coarse to Fine Ranking
DevOps
DevOps
Jan 5, 2023 · R&D Management

Local Optimization vs Global Quality: Metrics, Bugs, and Team Capability

This article examines how fragmented metrics and local optimizations can harm overall software quality, discussing the bug‑vs‑feature debate, various measurement approaches, and the importance of viewing quality as a collective team capability rather than an individual or departmental responsibility.

MetricsR&D managementSoftware quality
0 likes · 10 min read
Local Optimization vs Global Quality: Metrics, Bugs, and Team Capability
JD Tech Talk
JD Tech Talk
Jan 3, 2023 · R&D Management

Agile Transformation Practices and Lessons Learned in JD Cloud Platform R&D Department

This article summarizes four years of agile transformation in the JD Cloud Platform R&D department, detailing cultural building, continuous agile practices, practical guidelines, effectiveness measurement, and insight analysis across five key areas to improve organizational, technical, and project agility.

Continuous ImprovementMetricsproject-management
0 likes · 40 min read
Agile Transformation Practices and Lessons Learned in JD Cloud Platform R&D Department
HelloTech
HelloTech
Dec 23, 2022 · Cloud Native

Design Principles and Implementation Details of Kubernetes Horizontal Pod Autoscaler and Custom Water Pod Autoscaler

The article explains Kubernetes’ built‑in Horizontal Pod Autoscaler, then details the custom Water Pod Autoscaler (WPA) that extends HPA with dual‑signal (load and SOA registration) detection, dual‑threshold scaling, noise filtering, configurable cooldown, frequency limits, tolerance buffers, and integrated alerting for reliable elastic scaling.

Cloud NativeHPAKubernetes
0 likes · 13 min read
Design Principles and Implementation Details of Kubernetes Horizontal Pod Autoscaler and Custom Water Pod Autoscaler
Inke Technology
Inke Technology
Dec 19, 2022 · Backend Development

How to Build a Highly Available, Stable, and Observable SMS Service

This article explains how to design a high‑availability SMS system by identifying stability bottlenecks, defining reliability goals, implementing failover strategies for Redis, MySQL and external services, establishing a comprehensive observability framework, and measuring key quality metrics to ensure 99.99% uptime.

BackendMetricsObservability
0 likes · 11 min read
How to Build a Highly Available, Stable, and Observable SMS Service
DevOpsClub
DevOpsClub
Dec 19, 2022 · R&D Management

How ByteDance’s DevMind Platform Transforms R&D Efficiency Measurement

The article details ByteDance’s DevMind platform, describing its origins, the challenges of measuring software development efficiency, the collaborative and value‑driving “flywheel” concepts, the architectural design across data lifecycle and query engine layers, and the principles and future roadmap for scaling R&D performance.

Data PlatformDevMindMetrics
0 likes · 29 min read
How ByteDance’s DevMind Platform Transforms R&D Efficiency Measurement
DataFunTalk
DataFunTalk
Dec 18, 2022 · Big Data

Application of Data Tags and Metrics in the Financial Industry

The article explains the concepts, classifications, construction methods, and practical usage of data tags and metrics in the financial sector, illustrating how to build indicator and label systems and how to apply them effectively for refined customer operations and business management.

Financial IndustryIndicator SystemMetrics
0 likes · 13 min read
Application of Data Tags and Metrics in the Financial Industry
Open Source Linux
Open Source Linux
Dec 15, 2022 · Cloud Native

Kubernetes 1.26 ‘Electrifying’: Key New Features, Deprecations, and Upgrades

Kubernetes 1.26, themed “Electrifying,” introduces 37 enhancements—including registry changes, storage upgrades, signed release artifacts, Windows high‑privilege containers, metric and scheduling improvements—while promoting 11 features to stable, deprecating 12 APIs, and emphasizing sustainability and carbon‑footprint awareness.

Cloud NativeKubernetesMetrics
0 likes · 10 min read
Kubernetes 1.26 ‘Electrifying’: Key New Features, Deprecations, and Upgrades
Cloud Native Technology Community
Cloud Native Technology Community
Dec 14, 2022 · Cloud Native

Kubernetes v1.26 Release: New Features, Enhancements, and Deprecations

Kubernetes 1.26 is officially released, introducing 37 enhancements—including 11 stable and 10 beta features—while deprecating 12 APIs, updating the container image registry, removing CRI v1alpha2, advancing storage CSI migrations, enhancing metrics, and adding support for Windows privileged containers and dynamic resource allocation.

CSIContainer Runtime InterfaceKubernetes
0 likes · 15 min read
Kubernetes v1.26 Release: New Features, Enhancements, and Deprecations
Open Source Linux
Open Source Linux
Dec 8, 2022 · Operations

Master Prometheus: From Metrics Collection to Alerting and Visualization

Prometheus is an open‑source monitoring solution that covers metric exposition, scraping, storage, querying, visualization, and alerting, and this guide walks through its architecture, configuration, custom exporters, PromQL queries, Grafana integration, and alert management, providing a comprehensive introduction for developers and ops engineers.

AlertingExporterGrafana
0 likes · 22 min read
Master Prometheus: From Metrics Collection to Alerting and Visualization
Bilibili Tech
Bilibili Tech
Dec 2, 2022 · Big Data

Data Quality Management: Expectations, Measurement, Assurance, and Operation

The article outlines a complete data‑quality‑management framework that first captures business expectations, then translates them into basic and personalized measurement rules, defines four assurance approaches for handling violations, and scales operation with indicators, tooling, and metrics to continuously improve data quality across the lifecycle.

Data GovernanceData QualityMetrics
0 likes · 19 min read
Data Quality Management: Expectations, Measurement, Assurance, and Operation
Efficient Ops
Efficient Ops
Nov 29, 2022 · Operations

How to Retrieve and Process Prometheus Metrics via Its API

This article explains how to use the Prometheus HTTP API to query instant and range metrics, interpret the JSON responses, and fetch data programmatically with Python, providing code examples and details on request parameters, error handling, and practical usage.

APIDevOpsMetrics
0 likes · 8 min read
How to Retrieve and Process Prometheus Metrics via Its API
FunTester
FunTester
Nov 27, 2022 · Fundamentals

Why Performance Testing Matters: Key Metrics, Types, and Best Practices

This guide explains what performance testing is, why it’s essential, the key metrics such as throughput, response time, and bandwidth, outlines a step‑by‑step testing process, compares load, stress, endurance and capacity testing types, and reviews popular tools like JMeter, LoadRunner and NeoLoad.

Load TestingMetricsPerformance Testing
0 likes · 10 min read
Why Performance Testing Matters: Key Metrics, Types, and Best Practices
macrozheng
macrozheng
Nov 19, 2022 · Operations

Unlocking Prometheus: Visual Guide to Architecture, Metrics, and Alerts

This article visually explains Prometheus’s architecture, core features, metric collection methods, exporters, PromQL query language, and alerting workflow, helping readers understand how to monitor cloud‑native systems effectively while noting its strengths and limitations.

AlertingExportersMetrics
0 likes · 8 min read
Unlocking Prometheus: Visual Guide to Architecture, Metrics, and Alerts
Alibaba Cloud Native
Alibaba Cloud Native
Nov 17, 2022 · Cloud Native

How RocketMQ Harnesses Prometheus for Full‑Stack Observability

This article explains how RocketMQ integrates with Prometheus and Grafana to provide comprehensive metrics, tracing, and logging, detailing the exporter architecture, deployment choices, span topology, dashboard examples, and ARMS‑based alerting for cloud‑native message‑queue observability.

ARMSCloud NativeMetrics
0 likes · 14 min read
How RocketMQ Harnesses Prometheus for Full‑Stack Observability
macrozheng
macrozheng
Nov 5, 2022 · Operations

Unlock Full Observability in Spring Boot 3 with Micrometer Observation API

This article explains how Spring Boot 3.0.0‑RC1 integrates Micrometer Observation API to provide unified metrics, logging, and distributed tracing, showing the observation lifecycle, configuration steps, sample server and client code, Docker‑compose setup, and notes on native image support for comprehensive application observability.

JavaMetricsMicrometer
0 likes · 26 min read
Unlock Full Observability in Spring Boot 3 with Micrometer Observation API
Architects Research Society
Architects Research Society
Nov 3, 2022 · Fundamentals

Programming Productivity: Definitions, Models, and Influencing Factors

Programming productivity, also known as software or development productivity, examines how output relates to input, covering definitions, measurement models such as COCOMO II and Jones’s factors, function points, value‑based engineering, and human aspects, while discussing efficiency, effectiveness, profitability, and various factors influencing individual and team efficiency.

COCOMOMetricsSoftware Engineering
0 likes · 12 min read
Programming Productivity: Definitions, Models, and Influencing Factors
Baidu MEUX
Baidu MEUX
Oct 26, 2022 · Product Management

How Baidu’s Light Design System Uses PATS Metrics to Boost Efficiency

This article explains why measuring a design system is essential, outlines how Baidu's Light Design System built a PATS metric framework through research and four practical steps, and shares real-world results that improved usability, reliability, and overall workflow efficiency.

BaiduMetricsPATS
0 likes · 10 min read
How Baidu’s Light Design System Uses PATS Metrics to Boost Efficiency
Architecture Digest
Architecture Digest
Oct 21, 2022 · Operations

Benchmarking and Sizing Your Elasticsearch Cluster for Logs and Metrics

This article explains how to assess hardware resources, calculate required Elasticsearch cluster size based on data volume, and perform indexing and search benchmark tests to ensure stable performance and optimal throughput for log and metric workloads in production environments.

BenchmarkingCluster SizingElasticsearch
0 likes · 10 min read
Benchmarking and Sizing Your Elasticsearch Cluster for Logs and Metrics
Efficient Ops
Efficient Ops
Oct 13, 2022 · Operations

Essential Guide to Effective Monitoring in Operations: Goals, Methods, and Tools

This article outlines the essential components of operational monitoring, covering monitoring objectives, methods, core processes, key tools, metrics for hardware, system, application, network, and business layers, as well as alerting, handling, and best practices for building a comprehensive, reliable monitoring solution.

AlertingMetricssystem reliability
0 likes · 7 min read
Essential Guide to Effective Monitoring in Operations: Goals, Methods, and Tools
dbaplus Community
dbaplus Community
Sep 26, 2022 · Backend Development

How Ctrip Replaced HBase with VictoriaMetrics & ClickHouse for Scalable Metrics Monitoring

Ctrip’s internal Dashboard monitoring platform, originally built on HBase, was redesigned by migrating its core writer and storage components to a hybrid VictoriaMetrics‑ClickHouse solution, delivering faster queries, higher write stability, and full Prometheus compatibility while keeping the user experience unchanged.

ClickHouseDashboardHBase
0 likes · 13 min read
How Ctrip Replaced HBase with VictoriaMetrics & ClickHouse for Scalable Metrics Monitoring
Practical DevOps Architecture
Practical DevOps Architecture
Sep 26, 2022 · Operations

Introduction to Prometheus Monitoring, Alertmanager, and Grafana with Course Outline

This article introduces the Prometheus monitoring platform, explains Alertmanager's grouping, inhibition and silencing features, describes Grafana's visualization and alerting capabilities, and provides a detailed course syllabus covering installation, configuration, and advanced monitoring techniques across various environments.

AlertmanagerGrafanaMetrics
0 likes · 4 min read
Introduction to Prometheus Monitoring, Alertmanager, and Grafana with Course Outline
SQB Blog
SQB Blog
Sep 15, 2022 · Frontend Development

Mastering Front-End Performance: How to Use PerformanceObserver & Metrics

This article explains how to monitor and analyze front‑end performance using the deprecated performance.timing API and the modern PerformanceObserver, detailing key web‑vital metrics such as TTFB, FCP, LCP, FID, and CLS, with code examples and practical interpretation guidelines.

Metricsfrontendperformanceobserver
0 likes · 13 min read
Mastering Front-End Performance: How to Use PerformanceObserver & Metrics
DevOpsClub
DevOpsClub
Sep 13, 2022 · R&D Management

How DevMind Transforms R&D Efficiency with Scalable Metrics

This article outlines the DevMind system—a comprehensive, data‑driven framework that turns R&D efficiency measurement into an online, low‑threshold, scalable practice, covering four best‑practice pillars, technical architectures, product modules, and real‑world impact within large organizations.

DevOpsMetricsR&D efficiency
0 likes · 28 min read
How DevMind Transforms R&D Efficiency with Scalable Metrics
Efficient Ops
Efficient Ops
Sep 7, 2022 · Operations

How DeepFlow Automates Full‑Stack Observability for Cloud‑Native Environments

This article presents DeepFlow, an open‑source, highly automated observability platform that uses eBPF to provide zero‑code AutoMetrics and AutoTracing, integrates with Prometheus, OpenTelemetry and SkyWalking, and enables SRE, DevOps and NewOps teams to build full‑stack metrics and blind‑spot‑free tracing for cloud‑native applications.

DevOpsMetricsObservability
0 likes · 20 min read
How DeepFlow Automates Full‑Stack Observability for Cloud‑Native Environments
dbaplus Community
dbaplus Community
Sep 5, 2022 · Operations

How EyesTSDB Evolved into a Cloud‑Native, Second‑Level Monitoring Platform

This article details the evolution of NetEase's self‑built time‑series database EyesTSDB into a cloud‑native, second‑level monitoring solution, covering its architecture, core features, integration with VictoriaMetrics, custom plugin workflow, CMDB linkage, real‑world use cases, and future challenges.

CMDB integrationMetricsObservability
0 likes · 21 min read
How EyesTSDB Evolved into a Cloud‑Native, Second‑Level Monitoring Platform
ITPUB
ITPUB
Aug 29, 2022 · Backend Development

How Ctrip Replaced HBase with VictoriaMetrics & ClickHouse for Scalable Metrics Monitoring

This article details Ctrip's internal Dashboard monitoring platform, explains why its HBase‑based TSDB became a bottleneck, and describes the step‑by‑step migration to a hybrid VictoriaMetrics‑ClickHouse solution with upgraded writers, unified query APIs, performance gains, and future roadmap.

ClickHouseHBaseMetrics
0 likes · 13 min read
How Ctrip Replaced HBase with VictoriaMetrics & ClickHouse for Scalable Metrics Monitoring
Model Perspective
Model Perspective
Aug 25, 2022 · Artificial Intelligence

Mastering Regression: Key Assumptions, Metrics, and Model Evaluation

This article explains the fundamental assumptions of linear regression, compares linear and nonlinear models, discusses multicollinearity, outliers, regularization, heteroscedasticity, VIF, stepwise regression, and reviews essential evaluation metrics such as MAE, MSE, RMSE, R² and Adjusted R².

MetricsModel Evaluationlinear regression
0 likes · 12 min read
Mastering Regression: Key Assumptions, Metrics, and Model Evaluation
DevOps
DevOps
Aug 17, 2022 · Operations

Measuring Success in Continuous Delivery: Four Key Metrics and Practical Tips

This article explains why measuring is essential for continuous delivery, introduces four valuable metrics—deployable package count, cycle time, mean time between failures, and mean time to recovery—and offers practical tips to improve delivery speed and reliability.

Continuous DeliveryDevOpsMTBF
0 likes · 7 min read
Measuring Success in Continuous Delivery: Four Key Metrics and Practical Tips
Programmer DD
Programmer DD
Aug 16, 2022 · Databases

Master MySQL Monitoring with Built‑in SHOW Commands: A Complete Guide

This article explains how to collect comprehensive MySQL performance metrics—including connections, buffer cache, locks, statement counts, throughput, server variables, and slow‑query analysis—using only MySQL's native SHOW commands, providing a fast, low‑overhead monitoring solution.

Database MonitoringMetricsSQL
0 likes · 11 min read
Master MySQL Monitoring with Built‑in SHOW Commands: A Complete Guide
Bilibili Tech
Bilibili Tech
Aug 12, 2022 · Operations

SLO Implementation and Alerting Strategies – Bilibili SRE Practices

The article outlines Bilibili’s refined SLO framework—categorizing services into four business tiers, selecting availability, latency, and freshness SLIs, setting concrete SLO targets, and employing multi‑window error‑budget and consumption‑rate alerting strategies to improve stability and provide comprehensive quality dashboards.

AlertingMetricsSLO
0 likes · 18 min read
SLO Implementation and Alerting Strategies – Bilibili SRE Practices
Snowball Engineer Team
Snowball Engineer Team
Aug 5, 2022 · Big Data

Snowball Data Warehouse Modeling and OneData System Implementation

This article outlines Snowball's data warehouse background, compares major modeling approaches such as ER, dimensional, DataVault and Anchor models, describes the current challenges of their dimensional model, and details the OneData methodology—including OneModel, OneID, and OneService—along with its practical implementation, results, and future plans.

Big DataData GovernanceData Warehouse
0 likes · 23 min read
Snowball Data Warehouse Modeling and OneData System Implementation
DevOps
DevOps
Jul 22, 2022 · Fundamentals

Why Measure Software Architecture and Which Metrics to Use

The article explains the importance of measuring software architecture, outlines the granularity of metrics from code to infrastructure, and provides concrete measurement indicators for code implementation, component design, architecture design, and runtime infrastructure to guide effective architecture governance.

MetricsSoftware Architecturequality measurement
0 likes · 13 min read
Why Measure Software Architecture and Which Metrics to Use