Tagged articles
112 articles
Page 1 of 2
Linux Tech Enthusiast
Linux Tech Enthusiast
May 14, 2026 · Operations

9 Visual Guides to Linux Performance Tuning Tools

The article presents nine diagrams that illustrate Linux performance tooling categories—including observability, static analysis, benchmarking, tuning, sar, perf-tools, tracing, and BPF tools—providing a quick visual reference for system engineers.

BPFBenchmarkingLinux
0 likes · 2 min read
9 Visual Guides to Linux Performance Tuning Tools
AI Engineer Programming
AI Engineer Programming
May 4, 2026 · Artificial Intelligence

RAG in the Long-Context Era: Challenges, Benchmarks, and Context Engineering

The article analyzes how expanding LLM context windows to millions of tokens reshape Retrieval‑Augmented Generation, detailing chunking trade‑offs, embedding retrieval limits, attention U‑shaped distribution, benchmark results, and the emerging practice of Context Engineering for optimal end‑to‑end pipelines.

BenchmarkingEmbedding RetrievalLLM
0 likes · 10 min read
RAG in the Long-Context Era: Challenges, Benchmarks, and Context Engineering
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
May 1, 2026 · Artificial Intelligence

Agentic Harness Engineering Enables Agents to Self‑Evolve and Outperform Codex in 10 Rounds

The Agentic Harness Engineering (AHE) framework lets coding agents automatically read massive execution traces, identify failure patterns, and iteratively modify harness components—prompt, tools, middleware, and memory—achieving a pass@1 increase from 69.7% to 77.0% and surpassing human‑tuned Codex‑CLI after ten automated evolution rounds.

Agentic Harness EngineeringBenchmarkingObservability
0 likes · 9 min read
Agentic Harness Engineering Enables Agents to Self‑Evolve and Outperform Codex in 10 Rounds
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Apr 25, 2026 · Artificial Intelligence

GPT-5.5 Arrives: Faster, Stronger, Costlier—Nvidia Engineer Says Losing Access Feels Like Amputation

GPT-5.5, co‑designed with Nvidia hardware, breaks the traditional scaling‑law trade‑off by delivering higher intelligence while keeping token latency similar, achieves over 20% faster token generation, outperforms competitors across coding, knowledge‑work, and math benchmarks, and even proves new Ramsey‑number results verified by Lean.

BenchmarkingCodexGPT-5.5
0 likes · 11 min read
GPT-5.5 Arrives: Faster, Stronger, Costlier—Nvidia Engineer Says Losing Access Feels Like Amputation
Data Party THU
Data Party THU
Apr 24, 2026 · Artificial Intelligence

OpenAI Unveils GPT‑Rosalind: A New AI Model for Accelerating Life‑Science Research

OpenAI introduced GPT‑Rosalind, a purpose‑built reasoning model for biology, drug discovery and translational medicine that streamlines evidence synthesis, hypothesis generation and experiment planning, and demonstrates leading performance on benchmarks such as BixBench and LABBench2 while offering free plugins that connect to over fifty scientific tools and data sources.

BenchmarkingBixBenchGPT‑Rosalind
0 likes · 8 min read
OpenAI Unveils GPT‑Rosalind: A New AI Model for Accelerating Life‑Science Research
AI Engineer Programming
AI Engineer Programming
Apr 23, 2026 · Artificial Intelligence

From Zero to One: A Roadmap for Building Trustworthy AI Agent Evaluations

The article outlines why rigorous, automated evaluation is essential for AI agents, defines core concepts such as tasks, trials, graders, and frameworks, compares code‑based, model‑based and human graders, and presents an eight‑step roadmap—from early testing to open‑source maintenance—to create reliable, scalable agent assessments.

AI agentsAutomated TestingBenchmarking
0 likes · 22 min read
From Zero to One: A Roadmap for Building Trustworthy AI Agent Evaluations
Qborfy AI
Qborfy AI
Apr 20, 2026 · Artificial Intelligence

How Harness Engineering Lifted LangChain Agents into the Top 5 on Terminal Bench 2.0

LangChain’s Harness Engineering framework tuned system prompts, tool selection, and middleware to turn a rank‑30 programming agent into a top‑5 performer on Terminal Bench 2.0, using trace‑driven analysis, inference‑sandwich scheduling, and context engineering without changing the underlying model.

AI agentsBenchmarkingHarness Engineering
0 likes · 12 min read
How Harness Engineering Lifted LangChain Agents into the Top 5 on Terminal Bench 2.0
SuanNi
SuanNi
Apr 18, 2026 · Artificial Intelligence

How GPT‑Rosalind Is Accelerating Drug Discovery with AI

OpenAI's GPT‑Rosalind model, designed for chemistry and genomics, demonstrates superior performance on scientific benchmarks, outperforms human experts, offers a rich plugin ecosystem, and implements strict access controls to help accelerate early-stage drug research while ensuring responsible AI use in life sciences.

AI GovernanceBenchmarkingLife Sciences
0 likes · 10 min read
How GPT‑Rosalind Is Accelerating Drug Discovery with AI
Architect's Must-Have
Architect's Must-Have
Apr 18, 2026 · Artificial Intelligence

Claude Opus 4.7 Unpacked: Engineering Boost, Vision Leap, and Safety Test

Claude Opus 4.7, Anthropic’s latest publicly released model, extends engineering intelligence with autonomous verification loops, upgrades visual resolution three‑fold, introduces layered safety deployment and new API controls, while benchmarked against GPT‑5.4 and Gemini 3.1, delivering record SWE‑bench scores and detailed real‑world security evaluations.

AI SafetyAPI featuresBenchmarking
0 likes · 36 min read
Claude Opus 4.7 Unpacked: Engineering Boost, Vision Leap, and Safety Test
AI Engineer Programming
AI Engineer Programming
Apr 16, 2026 · Artificial Intelligence

Choosing the Right LLM: A Complete Guide to Selecting from Over 2 Million Models

With more than two million LLMs available, this guide explains how to evaluate functional capabilities, latency, throughput, cost, tool‑calling reliability, context‑window size and compliance, and presents a step‑by‑step framework for picking the most suitable model for each business scenario.

BenchmarkingContext WindowCost Optimization
0 likes · 25 min read
Choosing the Right LLM: A Complete Guide to Selecting from Over 2 Million Models
SuanNi
SuanNi
Apr 13, 2026 · Artificial Intelligence

How AI Researchers Built a 400% Better Multimodal Memory System with AutoResearchClaw

A fully automated AI research pipeline called AutoResearchClaw enabled a team from top universities to redesign a multimodal memory architecture, OMNIMEM, achieving over 400% performance gains on LoCoMo and Mem‑Gallery benchmarks by iteratively fixing code bugs, restructuring the system, and optimizing retrieval strategies.

AI research automationAutoResearchClawBenchmarking
0 likes · 12 min read
How AI Researchers Built a 400% Better Multimodal Memory System with AutoResearchClaw
SuanNi
SuanNi
Apr 12, 2026 · Artificial Intelligence

How TDM‑R1 Achieves 4‑Step Image Generation that Beats 80‑Step Models

Researchers from HKUST, CUHK and XiaoHongShu introduced TDM‑R1, a reinforcement‑learning‑based method that enables 4‑step diffusion image generation to surpass 80‑step models in speed, fidelity, and complex instruction adherence, as demonstrated on the GenEval benchmark and multiple quality metrics.

AI image synthesisBenchmarkingdiffusion models
0 likes · 9 min read
How TDM‑R1 Achieves 4‑Step Image Generation that Beats 80‑Step Models
Code Mala Tang
Code Mala Tang
Mar 28, 2026 · Artificial Intelligence

How MiniMax M2.7 Achieves SOTA Agent Performance Through Self‑Evolving Loops

MiniMax M2.7 is a self‑evolving LLM that combines a persistent Agent Harness, multi‑level memory, and autonomous improvement cycles to reach SOTA benchmark scores, cost efficiency, and real‑world software‑engineering capabilities, illustrating the emerging skill‑economy of agent ecosystems.

Agent ArchitectureBenchmarkingSelf-Improving Models
0 likes · 13 min read
How MiniMax M2.7 Achieves SOTA Agent Performance Through Self‑Evolving Loops
PaperAgent
PaperAgent
Mar 26, 2026 · Artificial Intelligence

TurboQuant: How Google’s New Vector Quantization Cuts KV Memory 6× and Boosts Speed

TurboQuant, presented at ICLR 2026, introduces a theoretically grounded vector quantization technique that reduces large‑language‑model key‑value cache memory by at least six times, achieves up to eight‑fold speedups, and maintains zero accuracy loss by combining PolarQuant’s polar‑coordinate compression with a 1‑bit QJL error‑correction step, as demonstrated on benchmarks such as LongBench and GloVe.

AI inferenceBenchmarkingTurboQuant
0 likes · 10 min read
TurboQuant: How Google’s New Vector Quantization Cuts KV Memory 6× and Boosts Speed
DataFunTalk
DataFunTalk
Mar 16, 2026 · Artificial Intelligence

Unlocking Anthropic’s Skill‑Creator: New Evaluation, Benchmarking, and Parallel Testing Features

The article explains Anthropic’s latest Skill‑Creator update, which adds an evaluation system, benchmark testing, parallel agent execution, and description optimization, and demonstrates how these capabilities dramatically improve skill reliability, trigger accuracy, and overall performance through concrete examples and quantitative results.

AI agentsAnthropicBenchmarking
0 likes · 13 min read
Unlocking Anthropic’s Skill‑Creator: New Evaluation, Benchmarking, and Parallel Testing Features
Data Party THU
Data Party THU
Mar 12, 2026 · Artificial Intelligence

Can a 30B LLM Truly Conduct Autonomous Scientific Research? Inside UniScientist

UniScientist, a 30‑billion‑parameter open‑source model from UniPat AI, demonstrates a closed‑loop scientific research workflow—generating hypotheses, gathering evidence, performing reproducible derivations, and iteratively refining conclusions—while achieving benchmark scores comparable to much larger proprietary systems across multiple scientific evaluation suites.

Benchmarkinglarge language modelscientific research
0 likes · 10 min read
Can a 30B LLM Truly Conduct Autonomous Scientific Research? Inside UniScientist
AI Engineering
AI Engineering
Mar 6, 2026 · Artificial Intelligence

Anthropic Adds a Full Evaluation Framework to Skill Creator

Anthropic's latest Skill Creator update introduces a code‑free evaluation framework that lets non‑engineer skill authors run tests, benchmark regressions, and optimize trigger descriptions, while supporting parallel multi‑agent execution and A/B comparisons to keep skills reliable as models evolve.

AI EvaluationAnthropicBenchmarking
0 likes · 8 min read
Anthropic Adds a Full Evaluation Framework to Skill Creator
Woodpecker Software Testing
Woodpecker Software Testing
Mar 1, 2026 · Artificial Intelligence

Optimizing RAG System Performance: A Practical Testing Guide

The article presents a systematic framework for testing and optimizing Retrieval‑Augmented Generation (RAG) systems, detailing performance‑sensitive bottlenecks, a three‑dimensional test matrix, real‑world case studies, and test‑driven engineering practices to ensure stable, fast, and accurate AI services.

AIBenchmarkingObservability
0 likes · 9 min read
Optimizing RAG System Performance: A Practical Testing Guide
Java Tech Enthusiast
Java Tech Enthusiast
Feb 4, 2026 · Artificial Intelligence

Claude Sonnet 5 (Fennec) – The Next‑Gen Coding LLM Set to Outperform All Rivals

Claude Sonnet 5, codenamed Fennec, is about to launch on Google’s infrastructure with a 1‑million‑token context window, pricing half of Opus 4.5, and benchmark scores surpassing 80.9% on SWE‑Bench, while introducing an autonomous “Dev Team” swarm that can generate, test, and deliver full software modules without human intervention.

Benchmarkingmodel releasemulti-agent systems
0 likes · 9 min read
Claude Sonnet 5 (Fennec) – The Next‑Gen Coding LLM Set to Outperform All Rivals
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jan 20, 2026 · Artificial Intelligence

How LoongFlow Enables Expert‑Level AI Agents to Outperform Human Mathematicians

LoongFlow is an open‑source AI agent framework that combines a Plan‑Execute‑Summarize (PES) paradigm with a Hybrid Evolutionary Memory system, allowing agents to perform directed, iterative problem solving and achieve state‑of‑the‑art results on mathematical challenges, Kaggle‑style benchmarks, and real‑world tasks with dramatically higher efficiency.

BenchmarkingEvolutionary AlgorithmsLoongFlow
0 likes · 15 min read
How LoongFlow Enables Expert‑Level AI Agents to Outperform Human Mathematicians
NetEase LeiHuo Testing Center
NetEase LeiHuo Testing Center
Dec 12, 2025 · User Experience Design

Unlock Better Game UX: A Lazy‑Person’s Guide to QA, Personas, and Benchmarking

This article presents a step‑by‑step, low‑effort framework for improving game product quality by combining QA‑focused requirement analysis, detailed user personas, role‑mapping, systematic benchmark research, and realistic scenario validation, enabling designers to uncover hidden pain points, prioritize core user needs, and generate actionable design insights without excessive overhead.

BenchmarkingQAScenario Testing
0 likes · 32 min read
Unlock Better Game UX: A Lazy‑Person’s Guide to QA, Personas, and Benchmarking
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Dec 5, 2025 · Artificial Intelligence

Quantitative Finance Paper Summaries (Nov 29–Dec 5 2025)

This article presents concise summaries of five recent AI‑driven finance papers, covering a stress‑testing framework for LLM trading agents, an orchestration framework for financial agents, an event‑reflection memory model for stock forecasting, a hybrid LLM‑Bayesian network architecture for options wheel strategies, and their experimental results.

BenchmarkingFinancial AILLM
0 likes · 12 min read
Quantitative Finance Paper Summaries (Nov 29–Dec 5 2025)
JD Tech Talk
JD Tech Talk
Nov 3, 2025 · Artificial Intelligence

How JoyCode Agent Achieves 74.6% Pass@1 on SWE‑bench Verified with Patch‑Test Co‑generation

JoyCode Agent reaches a 74.6% pass rate on the authoritative SWE‑bench Verified benchmark, ranking in the global top‑3, and is now open‑source, showcasing a high‑efficiency, test‑driven, iterative approach to automated code repair that dramatically reduces token consumption while improving success rates.

Automated Code RepairBenchmarkingSWE-bench
0 likes · 44 min read
How JoyCode Agent Achieves 74.6% Pass@1 on SWE‑bench Verified with Patch‑Test Co‑generation
Data Party THU
Data Party THU
Oct 24, 2025 · Artificial Intelligence

How 78 Samples Outperform 10,000: The LIMI Breakthrough in Agent AI

The paper introduces the LIMI framework, which achieves state‑of‑the‑art agent performance on AgencyBench using only 78 carefully crafted samples—outperforming baseline models trained on thousands of examples—by focusing on high‑quality, strategic data construction and demonstrating superior generalization across code, research, and tool‑use tasks.

AgencyBenchAgent AIBenchmarking
0 likes · 11 min read
How 78 Samples Outperform 10,000: The LIMI Breakthrough in Agent AI
Meituan Technology Team
Meituan Technology Team
Oct 15, 2025 · Artificial Intelligence

What’s New in Large Model Research? Top Meituan AI Papers Up to Oct 2025

This curated list showcases Meituan’s latest large‑model breakthroughs and academic papers up to October 2025, spanning LLM system optimizations, multimodal generation, evaluation benchmarks, quantization techniques, and reinforcement‑learning‑driven improvements, offering researchers valuable insights and resources across the AI landscape.

AI researchBenchmarkingMultimodal AI
0 likes · 10 min read
What’s New in Large Model Research? Top Meituan AI Papers Up to Oct 2025
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Sep 24, 2025 · Artificial Intelligence

Key Points for Evaluating AI Agents

The article explains how Coze's Compass introduces a flexible evaluation system for AI agents, outlines a four‑dimensional submodule assessment (planning, tool use, self‑reflection, memory), and details specific testing criteria and challenges for web, scientific, dialogue, and programming agents.

AI agentsBenchmarkingCoze
0 likes · 6 min read
Key Points for Evaluating AI Agents
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 23, 2025 · Artificial Intelligence

How to Systematically Conduct Large Model Evaluation in Real-World Scenarios

This guide walks readers through a complete, business‑oriented workflow for evaluating large language models—from requirement analysis and test‑set design to metric definition, execution, result aggregation, and report generation—while addressing common challenges such as data imbalance, annotation quality, and automation.

AI EvaluationBenchmarkingReporting
0 likes · 24 min read
How to Systematically Conduct Large Model Evaluation in Real-World Scenarios
Architects' Tech Alliance
Architects' Tech Alliance
May 20, 2025 · Industry Insights

What Do GPU Core Specs Really Mean? A Deep Dive into Modern GPU Performance

This article provides a comprehensive analysis of GPU core parameters—including compute units, memory systems, floating‑point performance, power consumption, and manufacturing process—while comparing leading international and domestic GPU products to help readers choose the right accelerator for AI, HPC, or graphics workloads.

AIBenchmarkingGPU
0 likes · 19 min read
What Do GPU Core Specs Really Mean? A Deep Dive into Modern GPU Performance
FunTester
FunTester
Apr 14, 2025 · Backend Development

Common Mistakes in Go Unit Testing and How to Avoid Them

This article examines nine frequent errors developers make when writing Go unit tests—such as improper test classification, neglecting the race detector, ignoring parallel and shuffle flags, avoiding table‑driven tests, using sleep, mishandling time APIs, overlooking httptest/iotest, misusing benchmarks, and skipping fuzz testing—providing analysis and concrete code‑based solutions to improve test reliability and efficiency.

BenchmarkingGoconcurrency
0 likes · 11 min read
Common Mistakes in Go Unit Testing and How to Avoid Them
Python Programming Learning Circle
Python Programming Learning Circle
Apr 9, 2025 · Fundamentals

Python Code Optimization Techniques for Faster Execution

This article presents a comprehensive collection of Python performance‑boosting techniques, covering fundamental optimization principles, avoiding global variables and attribute access, eliminating unnecessary abstraction and data copying, loop optimizations, just‑in‑time compilation with numba, and selecting appropriate built‑in data structures to achieve significant speed improvements.

BenchmarkingPythonbest-practices
0 likes · 15 min read
Python Code Optimization Techniques for Faster Execution
Linux Code Review Hub
Linux Code Review Hub
Apr 5, 2025 · Operations

Inside Linux Perf: How the Kernel’s Powerful Tracing Tool Works

The article introduces Linux’s built‑in performance analysis tool perf, explains its event‑driven sampling, tracing and profiling capabilities, shows how to install it on various distributions, demonstrates common commands with real code examples, and discusses practical scenarios for locating and optimizing kernel and application performance issues.

BenchmarkingSystem optimizationflamegraph
0 likes · 36 min read
Inside Linux Perf: How the Kernel’s Powerful Tracing Tool Works
Su San Talks Tech
Su San Talks Tech
Mar 20, 2025 · Backend Development

How to Crush the One Billion Row Challenge: Java Performance Secrets Revealed

This article walks through the One Billion Row Challenge, explaining the problem, baseline Java solution, and a series of deep performance optimizations—from parallel streams and custom hash tables to unsafe memory access and SIMD techniques—that shrink execution time from minutes to under two seconds.

BenchmarkingJavaLarge Data Processing
0 likes · 21 min read
How to Crush the One Billion Row Challenge: Java Performance Secrets Revealed
Liangxu Linux
Liangxu Linux
Jan 14, 2025 · Fundamentals

How to Measure Execution Time in C with time(), clock() and gettimeofday()

This guide shows how to benchmark C code by measuring elapsed time using the standard time() function for second‑level precision, clock() for higher CPU‑time accuracy, and gettimeofday() for microsecond‑level resolution, including complete example programs and key considerations.

BenchmarkingCclock
0 likes · 4 min read
How to Measure Execution Time in C with time(), clock() and gettimeofday()
Baobao Algorithm Notes
Baobao Algorithm Notes
Jan 11, 2025 · Artificial Intelligence

Why Phi‑4’s 14B Model Outperforms GPT‑4 on STEM and Reasoning Tasks

Microsoft Research’s Phi‑4 model, a 14‑billion‑parameter LLM, leverages extensive synthetic data, advanced tokenization, and a two‑stage training pipeline to achieve superior performance on STEM question answering, long‑context reasoning, and safety benchmarks, rivaling larger models like GPT‑4.

AI SafetyBenchmarkingPhi-4
0 likes · 15 min read
Why Phi‑4’s 14B Model Outperforms GPT‑4 on STEM and Reasoning Tasks
FunTester
FunTester
Oct 14, 2024 · Backend Development

Mastering Go Benchmarking: A Practical Guide to Performance Testing

This article introduces Go's benchmarking framework, explains its purpose and best practices, provides step‑by‑step code examples for measuring string concatenation performance, shows how to run benchmarks from the command line, and teaches how to interpret the detailed test reports.

BackendBenchmarkingCode Optimization
0 likes · 11 min read
Mastering Go Benchmarking: A Practical Guide to Performance Testing
Test Development Learning Exchange
Test Development Learning Exchange
Oct 11, 2024 · Fundamentals

Fundamentals of Performance Testing: Concepts, Metrics, Tools, and Best Practices

This article provides a comprehensive overview of performance testing fundamentals, covering core concepts, key metrics, common testing tools, test design, load generation, result analysis, bottleneck identification, optimization techniques, cloud and micro‑service testing, monitoring, reporting, challenges, and cost‑benefit considerations.

BenchmarkingLoad TestingPerformance Testing
0 likes · 12 min read
Fundamentals of Performance Testing: Concepts, Metrics, Tools, and Best Practices
NewBeeNLP
NewBeeNLP
Oct 11, 2024 · Artificial Intelligence

Inside Llama 3: Training, Architecture, and Performance Secrets

An extensive review of Meta’s Llama 3 model breaks down its pre‑training data pipeline, scaling laws, architectural tweaks like GQA and RoPE, post‑training methods such as SFT, DPO, and reward modeling, and evaluates benchmark results, offering practical insights for researchers and engineers building large language models.

BenchmarkingLlama 3large language models
0 likes · 32 min read
Inside Llama 3: Training, Architecture, and Performance Secrets
21CTO
21CTO
Aug 26, 2024 · Fundamentals

Which Programming Languages Use the Least Power? Findings from a 2017 Study

A 2017 study by six Portuguese researchers compared the energy consumption, execution time, and memory usage of 27 programming languages across ten benchmark problems, revealing that faster languages aren't always more energy‑efficient and that compiled languages generally outperform interpreted ones in both speed and power usage.

Benchmarkingcompiled languagesenergy efficiency
0 likes · 9 min read
Which Programming Languages Use the Least Power? Findings from a 2017 Study
NewBeeNLP
NewBeeNLP
Jul 31, 2024 · Artificial Intelligence

How Continual Pre‑Training Boosts Llama‑3’s Chinese and Scientific Reasoning

This report presents a continual pre‑training approach that significantly enhances Llama‑3 (8B)’s Chinese language proficiency and scientific reasoning by using a carefully mixed corpus of existing and synthetic data, detailing the bilingual adaptation and synthetic‑enhancement stages, data‑mixing and curriculum strategies, and demonstrating strong results across multilingual and scientific benchmarks without sacrificing original capabilities.

BenchmarkingLLMLlama-3
0 likes · 9 min read
How Continual Pre‑Training Boosts Llama‑3’s Chinese and Scientific Reasoning
Python Programming Learning Circle
Python Programming Learning Circle
Jul 17, 2024 · Fundamentals

Simple Techniques to Speed Up Python For Loops by Up to 970×

This article demonstrates a collection of straightforward Python performance tricks—such as list comprehensions, external length calculation, set usage, loop skipping, code inlining, generators, map(), memoization, vectorization, filterfalse, and string joining—that together can accelerate for‑loops from modest 1.3× gains to dramatic 970× speed‑ups, with detailed benchmark results and code examples.

BenchmarkingLoopscode
0 likes · 15 min read
Simple Techniques to Speed Up Python For Loops by Up to 970×
FunTester
FunTester
Apr 15, 2024 · Fundamentals

Using JMH to Benchmark GUID Generation Strategies in Java

This article introduces JMH, explains its key features, and presents microbenchmark results comparing thread‑exclusive, thread‑shared, Snowflake, UUID, and Snowflake‑algorithm GUID generation methods under various thread counts, accompanied by the full Java test code.

BenchmarkingGUIDJMH
0 likes · 8 min read
Using JMH to Benchmark GUID Generation Strategies in Java
Architects' Tech Alliance
Architects' Tech Alliance
Feb 6, 2024 · Industry Insights

How to Evaluate Data Center Compute Power: From Supercomputer Benchmarks to PUE

This article explains the concept of data‑center compute power, reviews mature evaluation methods such as TOP500/FLOPS for supercomputers and SPEC CPU, SPECpower, and MLPerf for conventional servers, introduces the PUE efficiency metric, and summarizes the four core components that together define a data‑center's computing capability.

BenchmarkingData centerHPC
0 likes · 9 min read
How to Evaluate Data Center Compute Power: From Supercomputer Benchmarks to PUE
ITPUB
ITPUB
Jul 31, 2023 · Databases

How to Choose the Right Database: Key Steps for Successful Selection

This guide walks you through the essential stages of database selection—from assessing project requirements and comparing candidate systems to performance testing, long‑term impact analysis, and making the final decision—ensuring you pick a solution that fits both current and future needs.

BenchmarkingNoSQLSQL
0 likes · 10 min read
How to Choose the Right Database: Key Steps for Successful Selection
StarRocks
StarRocks
Apr 23, 2023 · Databases

Why Query Performance Optimization Matters and How to Master It

This guide explains the importance of query performance optimization for database products and engineers, outlines latency and throughput goals, shows how to locate bottlenecks with observability tools and Linux profilers, and provides practical high‑level and low‑level optimization techniques along with testing best practices.

BenchmarkingCPU profilingStarRocks
0 likes · 16 min read
Why Query Performance Optimization Matters and How to Master It
Ziru Technology
Ziru Technology
Mar 31, 2023 · Backend Development

Master Java Performance Testing with JMH: From Basics to Advanced Benchmarks

This article explains what benchmarking is, introduces the JMH framework for Java, shows how to add JMH dependencies, walks through a simple "Hello JMH" example with full source and output, demonstrates using JMH with Spring Boot, details the most important options and annotations, and highlights common pitfalls to avoid when writing reliable micro‑benchmarks.

BenchmarkingJMHJava
0 likes · 17 min read
Master Java Performance Testing with JMH: From Basics to Advanced Benchmarks
Architect's Tech Stack
Architect's Tech Stack
Feb 16, 2023 · Backend Development

A Comprehensive Guide to Java Microbenchmarking with JMH

This article introduces Java Microbenchmark Harness (JMH), explains why warm‑up is necessary, details common annotations, shows how to set up a Maven project, provides a complete benchmark example comparing LinkedList iteration methods, and demonstrates how to run and interpret the results.

BenchmarkingJMHJava
0 likes · 13 min read
A Comprehensive Guide to Java Microbenchmarking with JMH
Top Architect
Top Architect
Nov 19, 2022 · Operations

Guidelines for Sizing and Benchmarking Elasticsearch Clusters

This article provides a comprehensive guide on allocating hardware resources, calculating cluster size based on data volume, and conducting index and search benchmark tests for Elasticsearch, offering practical formulas, test configurations, and performance conclusions to help engineers design stable, high‑throughput clusters.

BenchmarkingCluster SizingPerformance Testing
0 likes · 12 min read
Guidelines for Sizing and Benchmarking Elasticsearch Clusters
Architecture Digest
Architecture Digest
Oct 21, 2022 · Operations

Benchmarking and Sizing Your Elasticsearch Cluster for Logs and Metrics

This article explains how to assess hardware resources, calculate required Elasticsearch cluster size based on data volume, and perform indexing and search benchmark tests to ensure stable performance and optimal throughput for log and metric workloads in production environments.

BenchmarkingCluster SizingElasticsearch
0 likes · 10 min read
Benchmarking and Sizing Your Elasticsearch Cluster for Logs and Metrics
Top Architect
Top Architect
Oct 9, 2022 · Backend Development

JMH – Java Microbenchmark Harness: Introduction, Demo, and Annotation Guide

This article introduces JMH, the official Java microbenchmarking tool, explains why warm‑up is needed, shows how to build a Maven project, provides a complete LinkedList iteration benchmark example, demonstrates common JMH annotations, and outlines how to run and interpret benchmark results.

BenchmarkingJMHJava
0 likes · 16 min read
JMH – Java Microbenchmark Harness: Introduction, Demo, and Annotation Guide
FunTester
FunTester
Jun 24, 2022 · Operations

Performance Testing Resource Collection

A comprehensive catalog of performance testing articles ranging from Linux monitoring tools and test frameworks to concurrency utilities, distributed load testing strategies, QPS modeling and language-specific benchmark studies, providing developers with practical insights and techniques for optimizing system performance.

BenchmarkingLoad TestingPerformance Testing
0 likes · 6 min read
Performance Testing Resource Collection
DataFunTalk
DataFunTalk
Jun 17, 2022 · Artificial Intelligence

Issues with Recommender System Benchmarks and Insights from the BARS Paper

This article examines the shortcomings of current recommender system benchmarks, explains why standardized datasets and metrics are essential, and highlights key findings from the recent BARS paper that propose a more open and reproducible benchmarking framework for recommendation research.

AIBARSBenchmarking
0 likes · 6 min read
Issues with Recommender System Benchmarks and Insights from the BARS Paper
MaGe Linux Operations
MaGe Linux Operations
Jun 3, 2022 · Backend Development

How Rewriting Hasura Storage in Go Boosted Performance Fivefold

The Hasura Storage team rewrote their Node.js service in Go, ran k6 benchmarks, and achieved up to five times more request handling, half the memory usage, and significantly lower response times across multiple download scenarios, demonstrating the scalability benefits of Go for backend services.

BackendBenchmarkingGolang
0 likes · 6 min read
How Rewriting Hasura Storage in Go Boosted Performance Fivefold
FunTester
FunTester
Feb 27, 2022 · Operations

Performance Testing Articles Collection (Chinese Resources)

This collection compiles dozens of Chinese articles on performance testing, covering tools, frameworks, case studies, and techniques such as netdata monitoring, load generators, concurrency utilities, distributed testing, QPS modeling, and comparisons of JMeter, K6, Gatling, and FunTester.

BenchmarkingLoad TestingOperations
0 likes · 8 min read
Performance Testing Articles Collection (Chinese Resources)
Top Architect
Top Architect
Feb 20, 2022 · Backend Development

Using JMH for Java Microbenchmarking: Demo, Annotations, and Best Practices

This article introduces Java Microbenchmark Harness (JMH), explains why warm‑up is needed, shows how to build a benchmark project with Maven, provides a complete LinkedList iteration benchmark example with all relevant JMH annotations, demonstrates execution commands, and interprets the resulting performance reports.

BenchmarkingJMHJava
0 likes · 13 min read
Using JMH for Java Microbenchmarking: Demo, Annotations, and Best Practices
Java Interview Crash Guide
Java Interview Crash Guide
Feb 18, 2022 · Backend Development

Master Java Microbenchmarking with JMH: From Setup to Results

This article explains how to use JMH for precise Java micro‑benchmarks, covering JVM warm‑up, project setup with Maven, writing benchmark methods, configuring annotations, running tests, interpreting results, and provides practical code examples and tips for reliable performance measurement.

BenchmarkingJMHJVM
0 likes · 13 min read
Master Java Microbenchmarking with JMH: From Setup to Results
Alimama Tech
Alimama Tech
Feb 16, 2022 · Big Data

Target Group Discovery: Framework, Models, and Case Study

The article presents a target‑group discovery framework that combines goal definition, rule‑or model‑based segmentation, tiered metrics, benchmarking and quadrant analysis to identify and characterize advantageous, problematic, or weak consumer, product, or merchant sub‑groups, illustrated by a FMCG e‑commerce case study diagnosing high‑share, low‑growth categories.

BenchmarkingBig Datadata segmentation
0 likes · 13 min read
Target Group Discovery: Framework, Models, and Case Study
Python Programming Learning Circle
Python Programming Learning Circle
Dec 1, 2021 · Fundamentals

The Fastest Way to Loop in Python: Using Built‑in Functions and Formulas Instead of While/For Loops

This article benchmarks Python while and for loops, shows that for loops are faster due to fewer operations, demonstrates how built‑in functions like sum and direct arithmetic formulas can achieve orders‑of‑magnitude speedups, and concludes that the quickest way to "loop" in Python is to avoid loops altogether.

Algorithmic EfficiencyBenchmarkingPython
0 likes · 8 min read
The Fastest Way to Loop in Python: Using Built‑in Functions and Formulas Instead of While/For Loops
Python Programming Learning Circle
Python Programming Learning Circle
Aug 11, 2021 · Databases

Generating One Billion SQLite Rows in Under a Minute: Python, PyPy, and Rust Performance Comparison

A programmer needed to create a billion‑row SQLite test database within a minute, found a naïve Python script unbearably slow, applied batch inserts and SQLite PRAGMA tweaks, then compared CPython, PyPy, and Rust implementations, ultimately achieving sub‑minute runtimes with Rust and highlighting best‑practice optimizations.

BenchmarkingPyPyPython
0 likes · 6 min read
Generating One Billion SQLite Rows in Under a Minute: Python, PyPy, and Rust Performance Comparison
Python Programming Learning Circle
Python Programming Learning Circle
Aug 9, 2021 · Backend Development

How to Choose the Fastest JSON Library for Python: A Practical Benchmarking Guide

This article explains a systematic process for evaluating and selecting the most suitable high‑performance JSON library for Python, covering the need assessment, benchmark definition, filtering by additional requirements, and detailed benchmark results that highlight orjson as the fastest option for small‑message encoding while discussing trade‑offs such as safety, customizability, and ecosystem support.

BenchmarkingJSONPython
0 likes · 5 min read
How to Choose the Fastest JSON Library for Python: A Practical Benchmarking Guide
FunTester
FunTester
Jul 27, 2021 · Operations

How I Boosted FunTester QPS by 14% and Halved Memory Usage

After a weekend of code refactoring, asynchronous processing, and removing unnecessary statistics, the author increased FunTester's QPS from 104,375 to 118,904 (≈13.9% gain), reduced memory consumption by over 57%, and documented detailed performance impacts of various optimizations with code samples and benchmark tables.

BenchmarkingFunTesterJava
0 likes · 13 min read
How I Boosted FunTester QPS by 14% and Halved Memory Usage
Python Programming Learning Circle
Python Programming Learning Circle
May 7, 2021 · Fundamentals

Why Python Is Perceived as Slow and How to Make It Faster

The article explains that Python’s reputation for slowness stems more from algorithmic choices and costly import patterns than the language itself, and it offers practical measurements, tooling insights, and optimization suggestions to improve Python’s performance in real‑world projects.

BenchmarkingToolingimports
0 likes · 8 min read
Why Python Is Perceived as Slow and How to Make It Faster
FunTester
FunTester
Apr 6, 2021 · Operations

How Thread Count, Request Volume, and Latency Shape Performance Test Accuracy

This article presents a systematic analysis of how thread numbers, request counts, response times, and response‑time variance affect performance‑testing error rates, using a Java demo that simulates single‑API calls and reports detailed JSON metrics for each configuration.

BenchmarkingJavaLoad Testing
0 likes · 16 min read
How Thread Count, Request Volume, and Latency Shape Performance Test Accuracy
Senior Brother's Insights
Senior Brother's Insights
Jan 6, 2021 · Backend Development

Master Java Microbenchmarking with JMH: A Hands‑On Guide

This article introduces JMH, the Java Microbenchmark Harness, explains why traditional main‑method benchmarks are unreliable, and provides step‑by‑step instructions, code examples, and best‑practice annotations for accurately measuring method‑level performance in Java applications.

Backend DevelopmentBenchmarkingJIT
0 likes · 20 min read
Master Java Microbenchmarking with JMH: A Hands‑On Guide
MaGe Linux Operations
MaGe Linux Operations
Sep 1, 2020 · Fundamentals

Boost Python Speed: 10 Proven Tricks to Accelerate Your Code

This article presents practical Python performance‑boosting techniques—including avoiding global variables, minimizing attribute access, eliminating unnecessary abstractions, reducing data copies, optimizing loops, leveraging built‑in functions, and using tools like numba—each illustrated with before‑and‑after code snippets and measured speed improvements.

BenchmarkingData Structurescode
0 likes · 19 min read
Boost Python Speed: 10 Proven Tricks to Accelerate Your Code
Ops Development Stories
Ops Development Stories
Aug 25, 2020 · Operations

ESrally Guide: Install, Configure, and Benchmark Elasticsearch Performance

ESrally is the official Elasticsearch benchmarking tool; this guide walks through its installation prerequisites, step‑by‑step setup of Python, JDK, and Git, configuration of tracks, cars, pipelines, and challenges, and demonstrates real‑world performance comparisons across Elasticsearch versions and hardware platforms.

BenchmarkingESrallyElasticsearch
0 likes · 16 min read
ESrally Guide: Install, Configure, and Benchmark Elasticsearch Performance
DevOps Coach
DevOps Coach
Aug 13, 2020 · Databases

How to Benchmark Elasticsearch Clusters with Rally: A Step‑by‑Step Guide

This article explains why large‑scale Elasticsearch deployments need rigorous performance testing, compares available testing tools, walks through installing and configuring the official Rally benchmark suite, details hardware recommendations, shows how to run tests against multiple cloud providers, and teaches you how to interpret the resulting metrics to make informed cluster‑selection decisions.

BenchmarkingElasticsearchPerformance Testing
0 likes · 16 min read
How to Benchmark Elasticsearch Clusters with Rally: A Step‑by‑Step Guide
MaGe Linux Operations
MaGe Linux Operations
Jul 2, 2020 · Databases

Mastering MySQL Benchmarking with Sysbench: A Complete Guide

This article explains MySQL benchmark testing, the role and metrics of benchmarking, introduces sysbench, provides step‑by‑step installation and usage instructions, and offers practical tips for interpreting results and optimizing database performance in production environments.

BenchmarkingPerformance TestingSysbench
0 likes · 11 min read
Mastering MySQL Benchmarking with Sysbench: A Complete Guide
Sohu Tech Products
Sohu Tech Products
Jun 10, 2020 · Fundamentals

Using JMH for Java Microbenchmarking: A Comprehensive Guide

This article introduces JMH, explains how to add dependencies, write and run microbenchmarks for string concatenation, describes key annotations, highlights common pitfalls, and shows how to package, visualize, and integrate JMH benchmarks within Java projects.

BenchmarkingJMHJVM
0 likes · 14 min read
Using JMH for Java Microbenchmarking: A Comprehensive Guide
Python Programming Learning Circle
Python Programming Learning Circle
May 6, 2020 · Fundamentals

Python Performance Testing: Comparing Logical Operations, Built‑in Functions, and Class Implementations

This article examines Python execution speed by benchmarking simple arithmetic, logical checks, built‑in functions like max, and three class‑based point‑in‑rectangle implementations, revealing that straightforward if‑statements are often fastest while highlighting practical tips for writing performant Python code.

BenchmarkingCode OptimizationPython
0 likes · 12 min read
Python Performance Testing: Comparing Logical Operations, Built‑in Functions, and Class Implementations
Xiao Lou's Tech Notes
Xiao Lou's Tech Notes
May 4, 2020 · Backend Development

Mastering JMH: Essential Java Microbenchmark Techniques for Accurate Performance Testing

JMH is a Java harness that enables precise, reproducible microbenchmarking through annotations, state management, threading, and profiling, and this guide walks through its core features—benchmark modes, state scopes, setup/teardown, fork control, blackholes, and advanced options—illustrated with sample code and results.

BenchmarkingJMHJava
0 likes · 18 min read
Mastering JMH: Essential Java Microbenchmark Techniques for Accurate Performance Testing
MaGe Linux Operations
MaGe Linux Operations
Jan 8, 2020 · Backend Development

Can Python Handle Millions of Requests per Second? Inside Japronto’s High‑Performance Microframework

This article explores how recent Python performance improvements and the ultra‑fast Japronto microframework enable handling over a million HTTP requests per second, detailing benchmark results, HTTP pipelining optimizations, C extensions, memory management tricks, and the underlying technologies that make such throughput possible.

AsyncBenchmarkingHTTP
0 likes · 9 min read
Can Python Handle Millions of Requests per Second? Inside Japronto’s High‑Performance Microframework
DevOps Coach
DevOps Coach
Aug 29, 2019 · Operations

Benchmark Your DevOps Performance with the 2019 Accelerate Report

This article walks you through the key findings of the 2019 Accelerate DevOps State of the Industry report, explains the four golden metrics, shows how to use Google’s minimal‑ist benchmark tool to compare your organization against industry baselines, and discusses the emerging service‑operations efficiency metric.

Accelerate ReportBenchmarkingDevOps
0 likes · 11 min read
Benchmark Your DevOps Performance with the 2019 Accelerate Report
Aikesheng Open Source Community
Aikesheng Open Source Community
Aug 19, 2019 · Databases

How Network Bandwidth Affects MySQL Performance

This article examines how network bandwidth, protocol compression, and SSL encryption impact MySQL Group Replication performance by running sysbench benchmarks on 1 Gb and 10 Gb networks, revealing bandwidth limits, CPU overhead, and scalability issues.

BenchmarkingSSLcompression
0 likes · 6 min read
How Network Bandwidth Affects MySQL Performance
MaGe Linux Operations
MaGe Linux Operations
Jul 12, 2019 · Fundamentals

Boost Python Performance: 24 Proven Techniques to Speed Up Code

This guide presents 24 practical methods—including timing measurements, faster data structures, loop optimizations, vectorization, and parallel processing—to dramatically accelerate Python code, each illustrated with clear before‑and‑after performance screenshots.

BenchmarkingParallelismProfiling
0 likes · 7 min read
Boost Python Performance: 24 Proven Techniques to Speed Up Code
Architects' Tech Alliance
Architects' Tech Alliance
Feb 22, 2019 · Operations

Performance Monitoring and Analysis in Large‑Scale Data Centers: Challenges and Practices

The article presents Alibaba's experience in large‑scale data‑center performance monitoring, describing the challenges of software and hardware upgrades, the SPEED platform’s estimation‑evaluation‑decision workflow, the RUE metric, and practical insights such as hyper‑threading effects, hardware heterogeneity, and Simpson’s paradox.

BenchmarkingHardware OptimizationJava
0 likes · 16 min read
Performance Monitoring and Analysis in Large‑Scale Data Centers: Challenges and Practices