Tagged articles
2769 articles
Page 14 of 28
Sanyou's Java Diary
Sanyou's Java Diary
Jun 23, 2022 · Backend Development

Master MyBatis: From Basics to Advanced Features and Best Practices

This comprehensive guide explains what MyBatis is, compares it with Hibernate, details its core components, configuration, dynamic SQL, caching, plugins, pagination, and practical usage patterns, providing Java developers with clear examples, code snippets, and visual diagrams to master the framework efficiently.

Dynamic SQLMyBatisORM
0 likes · 38 min read
Master MyBatis: From Basics to Advanced Features and Best Practices
Baidu Geek Talk
Baidu Geek Talk
Jun 22, 2022 · Databases

Analyzing Go SQL Connection Pool Resource Release and Transaction Management

The article explains how Go's database/sql pool retains connections until Rows, Stmt, or Tx objects are explicitly closed or a timeout context is cancelled, demonstrates experiments showing blocked goroutines when resources aren't released, and offers practical guidelines—using QueryContext with timeouts, deferring Close/Cancel, and always committing or rolling back transactions—to prevent pool exhaustion and improve MySQL reliability.

Connection PoolGodatabase
0 likes · 19 min read
Analyzing Go SQL Connection Pool Resource Release and Transaction Management
Qunar Tech Salon
Qunar Tech Salon
Jun 22, 2022 · Operations

Design and Implementation of Multi‑Cluster HPA Metrics Collection, Analysis, and Reporting in Kubernetes

This article explains the background, benefits, and measurement criteria of Kubernetes Horizontal‑Pod‑Autoscaler (HPA), describes the creation of metric tables and SQL queries for collecting scaling events and CPU usage, and presents a Python‑based workflow that aggregates the data, stores daily reports, validates results, and sends automated email summaries.

HPAKubernetesOperations
0 likes · 19 min read
Design and Implementation of Multi‑Cluster HPA Metrics Collection, Analysis, and Reporting in Kubernetes
JD Tech
JD Tech
Jun 20, 2022 · Backend Development

Analyzing MyBatis Query Process and Fixing a Pre‑3.4.5 foreach Variable Bug

This article examines a MyBatis bug present in versions prior to 3.4.5 by reproducing the issue, walking through the complete query execution flow from configuration parsing to result handling, identifying the root cause in foreach variable binding, and presenting official fixes and practical work‑arounds.

BugFixMyBatisORM
0 likes · 19 min read
Analyzing MyBatis Query Process and Fixing a Pre‑3.4.5 foreach Variable Bug
dbaplus Community
dbaplus Community
Jun 19, 2022 · Databases

Master MySQL Upgrades with pt‑upgrade: Detect Compatibility Issues Seamlessly

This article explains why MySQL upgrades can cause syntax, semantic and performance incompatibilities, introduces the Percona Toolkit's pt‑upgrade tool, details its detection logic and common usage patterns, provides a full demo with real query results, and lists essential parameters and precautions for safely migrating from MySQL 5.7 to 8.0.

compatibility testingmysqlpercona-toolkit
0 likes · 17 min read
Master MySQL Upgrades with pt‑upgrade: Detect Compatibility Issues Seamlessly
dbaplus Community
dbaplus Community
Jun 18, 2022 · Operations

Zabbix vs Prometheus: Architecture, Pros, and super_exporter Integration

This article compares the open‑source monitoring systems Zabbix and Prometheus, detailing their architectures, component roles, strengths, and weaknesses, then describes how to integrate Zabbix data into Prometheus using a custom super_exporter and visualise the combined metrics with Grafana.

GrafanaPrometheusZabbix
0 likes · 14 min read
Zabbix vs Prometheus: Architecture, Pros, and super_exporter Integration
Su San Talks Tech
Su San Talks Tech
Jun 16, 2022 · Databases

Master MySQL Index Design: Essential Principles for Optimal Performance

This article explains MySQL index fundamentals, covering clustered and secondary indexes, best‑practice design rules such as using auto‑increment primary keys, avoiding large or low‑cardinality columns, respecting the left‑most prefix, and common scenarios that cause index loss, with practical SQL examples.

index designmysqlsql
0 likes · 12 min read
Master MySQL Index Design: Essential Principles for Optimal Performance
Top Architect
Top Architect
Jun 14, 2022 · Databases

Understanding MySQL Execution Process: Connectors, Cache, Parser, Optimizer, and Executor

This article explains MySQL's internal execution process—including the connector, cache, parser, optimizer, and executor—describes privilege tables and verification, discusses query caching removal, outlines logical SQL execution order, and provides practical tips for ordering WHERE conditions to improve performance.

Database InternalsQuery Optimizerdatabase
0 likes · 12 min read
Understanding MySQL Execution Process: Connectors, Cache, Parser, Optimizer, and Executor
Programmer DD
Programmer DD
Jun 13, 2022 · Databases

Master DataGrip: Essential Tips for Efficient Database Management

This guide walks you through installing DataGrip, configuring database connections, customizing the IDE, and leveraging its powerful SQL editing, navigation, and data import/export features, providing a comprehensive overview for developers seeking a streamlined database client experience.

DataGripDatabase ManagementDatabase client
0 likes · 12 min read
Master DataGrip: Essential Tips for Efficient Database Management
Java Architect Essentials
Java Architect Essentials
Jun 12, 2022 · Backend Development

MyBatis-Mate: Advanced Features and Usage Guide for MyBatis-Plus in Java Backend Development

This article introduces MyBatis-Mate, an official MyBatis-Plus extension that offers enterprise‑grade capabilities such as dictionary binding, field encryption, data masking, dynamic DDL generation, multi‑datasource sharding, data‑scope filtering, and SQL performance logging, complete with Maven dependency setup and code examples.

DataEncryptionMyBatisORM
0 likes · 12 min read
MyBatis-Mate: Advanced Features and Usage Guide for MyBatis-Plus in Java Backend Development
ITPUB
ITPUB
Jun 12, 2022 · Databases

10 Proven Ways to Prevent Dangerous SQL Mistakes in Production

This article presents ten practical techniques—ranging from avoiding chat‑tool SQL transmission to using logical deletes, backups, and temporary tables—to help developers and DBAs safely execute and modify data in production environments while minimizing accidental data loss.

Data Safetybest practicesmysql
0 likes · 14 min read
10 Proven Ways to Prevent Dangerous SQL Mistakes in Production
Java Backend Technology
Java Backend Technology
Jun 11, 2022 · Information Security

How to Mask Sensitive Data in MySQL and Java with MyBatis‑Mate

This article demonstrates practical techniques for data masking, covering MySQL string functions for phone and ID masking, a Java library for various sensitive fields, and a MyBatis‑Mate extension that applies custom and built‑in masking strategies through annotations and REST endpoints.

MyBatisjavasensitive data
0 likes · 8 min read
How to Mask Sensitive Data in MySQL and Java with MyBatis‑Mate
Zuoyebang Tech Team
Zuoyebang Tech Team
Jun 7, 2022 · Big Data

How Doris Powered Zuoyebang’s Real‑Time Data Warehouse for Faster Insights

Zuoyebang’s data team replaced fragmented, slow query solutions with Apache Doris, building a unified real‑time data warehouse that dramatically cut query latency from hours to seconds, streamlined data modeling, and improved reliability across diverse business scenarios, while integrating with Flink, Kafka, and ES via a unified API.

Apache DorisElasticsearchFlink
0 likes · 20 min read
How Doris Powered Zuoyebang’s Real‑Time Data Warehouse for Faster Insights
Snowball Engineer Team
Snowball Engineer Team
Jun 6, 2022 · Databases

Deep Dive into ClickHouse Join Implementation and Optimization Techniques

This article examines ClickHouse's join mechanisms, detailing the limitations of standard joins, the advantages of Global joins, and optimization strategies such as hash and merge joins, subquery filtering, and memory considerations, illustrated with SQL examples and source‑code analysis.

Distributed SystemsHash JoinJOIN optimization
0 likes · 16 min read
Deep Dive into ClickHouse Join Implementation and Optimization Techniques
Ops Development Stories
Ops Development Stories
Jun 5, 2022 · Databases

What Really Happens Inside MySQL When You Run a SELECT Query?

This article walks through the complete lifecycle of a MySQL SELECT statement, from establishing the TCP connection and authentication, through query cache, parsing, optimization, and execution, detailing how the server and storage engine collaborate and explaining concepts like index usage, connection handling, and memory management.

Database ArchitectureIndex OptimizationQuery Execution
0 likes · 20 min read
What Really Happens Inside MySQL When You Run a SELECT Query?
Big Data Technology & Architecture
Big Data Technology & Architecture
Jun 2, 2022 · Operations

Common Operational, Data, and SQL Issues in Apache Doris – FAQs and Solutions

This article compiles frequently asked questions and detailed solutions covering Apache Doris operational problems, data handling errors, and SQL query issues, providing step‑by‑step guidance, configuration tips, and command examples to help administrators troubleshoot and maintain a stable Doris cluster.

Apache DorisConfigurationOperations
0 likes · 28 min read
Common Operational, Data, and SQL Issues in Apache Doris – FAQs and Solutions
Ops Development Stories
Ops Development Stories
Jun 1, 2022 · Databases

Master MySQL Indexes: From B+Tree Basics to Advanced Optimization

This article explains MySQL index fundamentals, covering B+Tree structures, various index types, creation syntax, and practical optimization techniques such as prefix, covering, and composite indexes, while illustrating how to avoid common pitfalls like index misuse and performance‑degrading scenarios.

B+TreeDatabase Optimizationindex
0 likes · 23 min read
Master MySQL Indexes: From B+Tree Basics to Advanced Optimization
Big Data Technology & Architecture
Big Data Technology & Architecture
May 31, 2022 · Databases

Vectorization and Roaring Bitmap Techniques in Database Query Execution

This article explains how classic SQL execution engines use the volcano model and expression trees, discusses their performance drawbacks, introduces vectorized execution to reduce overhead, and describes Roaring Bitmap compression methods with container types for efficient storage and processing of integer sets.

Big DataDatabase EngineOperator Tree
0 likes · 10 min read
Vectorization and Roaring Bitmap Techniques in Database Query Execution
Big Data Technology & Architecture
Big Data Technology & Architecture
May 30, 2022 · Big Data

Doris Architecture, Principles, and Key Features Overview

This article provides a comprehensive overview of Doris's architecture—including its FE and BE components, metadata management, data organization, execution planning—and details its major features such as adaptive join aggregation, vectorized execution, materialized views, and Elasticsearch integration, supplemented with example DDL and query code.

Big DataDatabase ArchitectureElasticsearch
0 likes · 7 min read
Doris Architecture, Principles, and Key Features Overview
Architect's Tech Stack
Architect's Tech Stack
May 28, 2022 · Big Data

Data Lake Challenges and the Open SPL Computing Engine

The article examines the inherent trade‑offs of data lakes—maintaining raw data, enabling efficient computation, and keeping costs low—explains why traditional data‑warehouse approaches fall short, and introduces the open‑source SPL engine that provides multi‑source, file‑based, high‑performance analytics to overcome these limitations.

Big DataData LakeETL
0 likes · 12 min read
Data Lake Challenges and the Open SPL Computing Engine
Big Data Technology & Architecture
Big Data Technology & Architecture
May 24, 2022 · Databases

Apache Doris Basics: Creating Databases, Tables, Partitioning, Data Import, and Rollup

This article provides a comprehensive guide to Apache Doris, covering how to create databases and tables with single and composite partitions, import data via broker and routine loads, understand its aggregate, uniq, and duplicate data models, and leverage rollup and prefix index features for optimized querying.

Apache DorisPartitioningRollup
0 likes · 20 min read
Apache Doris Basics: Creating Databases, Tables, Partitioning, Data Import, and Rollup
Cognitive Technology Team
Cognitive Technology Team
May 23, 2022 · Backend Development

Custom MyBatis Interceptor Mechanism – Java Dynamic Proxy

This article explains how to create a custom MyBatis Interceptor using Java dynamic proxies, detailing the Interceptor interface, the plugin generation process, the wrap and invoke methods, and common use‑cases such as pagination, SQL analysis, encryption, and performance monitoring.

Dynamic ProxyInterceptorMyBatis
0 likes · 4 min read
Custom MyBatis Interceptor Mechanism – Java Dynamic Proxy
IT Architects Alliance
IT Architects Alliance
May 19, 2022 · Big Data

How Apache Kylin Enables Sub‑Second OLAP on Massive Data Sets

Apache Kylin leverages pre‑computed OLAP cubes on Hadoop/Spark/Flink to deliver sub‑second query responses for massive datasets, detailing its architecture, integration with BI platforms, user security, cube building, monitoring, and storage using HBase, illustrating how it overcomes big‑data analytical challenges.

Apache KylinBig DataHBase
0 likes · 12 min read
How Apache Kylin Enables Sub‑Second OLAP on Massive Data Sets
Big Data Technology & Architecture
Big Data Technology & Architecture
May 18, 2022 · Databases

Understanding ClickHouse Distributed JOIN Implementation and Best Practices

This article explains ClickHouse's single‑node and distributed JOIN mechanisms, compares ordinary, GLOBAL, Broadcast, Shuffle and Colocate JOINs, illustrates execution flows with code examples, and provides practical recommendations to reduce join size, avoid query amplification, and leverage data pre‑distribution for optimal performance.

Big Dataclickhouseperformance
0 likes · 10 min read
Understanding ClickHouse Distributed JOIN Implementation and Best Practices
IT Architects Alliance
IT Architects Alliance
May 17, 2022 · Databases

Inside MySQL: How Queries Are Processed from Client to Execution

This article explains the complete MySQL query processing pipeline, covering the connector, authentication, privilege checks, caching, parser, optimizer, executor, and the detailed SQL execution order with examples of each stage, helping readers understand and optimize their SQL statements.

Query Executiondatabasemysql
0 likes · 13 min read
Inside MySQL: How Queries Are Processed from Client to Execution
IT Xianyu
IT Xianyu
May 17, 2022 · Databases

SQL Optimization Steps and Common Scenarios

This article explains how to identify slow SQL statements, analyze execution plans with EXPLAIN, use profiling and tracing tools, and apply specific optimization techniques such as index tuning, query rewriting, and alternative storage solutions across a range of real‑world cases.

Profilingexplainindexing
0 likes · 11 min read
SQL Optimization Steps and Common Scenarios
Top Architect
Top Architect
May 15, 2022 · Backend Development

Understanding MyBatis Batch Processing: Common Pitfalls and Optimized Solutions

This article walks through the challenges of inserting massive data sets with MyBatis, explains the inner workings of batch execution, highlights common mistakes with commit, clearCache and flushStatements, and presents progressively refined code examples—including Oracle-specific optimizations—to dramatically improve performance.

Batch ProcessingMyBatisjava
0 likes · 12 min read
Understanding MyBatis Batch Processing: Common Pitfalls and Optimized Solutions
dbaplus Community
dbaplus Community
May 12, 2022 · Big Data

How Bilibili Scaled Presto on Hadoop: Architecture, Optimizations, and Performance Gains

This article details Bilibili's end‑to‑end Presto on Hadoop architecture, covering the multi‑engine SQL stack, dispatcher routing, cluster scale, stability enhancements like coordinator HA and real‑time punish, query limits, Hive UDF compatibility, insert‑overwrite support, Alluxio caching, multi‑datacenter routing, query result caching, Raptorx local cache, JDK upgrades, dynamic filtering, and future roadmap, illustrating how these innovations boosted query throughput and reduced latency.

Big DataCluster ManagementDistributed Systems
0 likes · 32 min read
How Bilibili Scaled Presto on Hadoop: Architecture, Optimizations, and Performance Gains
Java High-Performance Architecture
Java High-Performance Architecture
May 12, 2022 · Backend Development

Master MyBatis Batch Processing: Common Pitfalls and Optimized Solutions

This article walks readers through MyBatis batch processing from basics to advanced, highlighting common mistakes, explaining the roles of commit, clearCache, and flushStatements, and presenting three progressively refined code versions plus Oracle-specific optimizations to dramatically improve bulk insert performance.

Batch ProcessingMyBatisbackend-development
0 likes · 12 min read
Master MyBatis Batch Processing: Common Pitfalls and Optimized Solutions
dbaplus Community
dbaplus Community
May 11, 2022 · Big Data

How JD Logistics Tackled Billion-Scale Data Challenges with Doris

This article details JD Logistics' journey from fragmented, massive‑scale data to a unified, real‑time analytics platform, covering business needs, pain points, tool evaluation, a new Doris‑based architecture, table management, data import procedures, automation scripts, and future roadmap for data engineering.

BI ToolsBig Datadata-warehouse
0 likes · 16 min read
How JD Logistics Tackled Billion-Scale Data Challenges with Doris
Zuoyebang Tech Team
Zuoyebang Tech Team
May 9, 2022 · Big Data

How Flink SQL Powered Real‑Time Learning Analytics at Zuoyebang

Zuoyebang’s big‑data team shares how they evolved from SparkStreaming to a Flink‑SQL‑centric real‑time platform, detailing three development stages, challenges in DAG optimization, Redis‑based table design, and platform features for unified deployment, ease of use, and operational governance.

FlinkReal-TimeStreaming
0 likes · 14 min read
How Flink SQL Powered Real‑Time Learning Analytics at Zuoyebang
Baidu Geek Talk
Baidu Geek Talk
May 9, 2022 · Big Data

How a Spark Offline Framework Boosts Data Backtracking Efficiency

This article introduces a Spark offline development framework that separates configuration from code, supports SQL and Java applications, and provides fast, automated data backtracking with reduced environment preparation time, lower failure rates, and significant performance gains for large‑scale data warehouses.

Big DataData BacktrackingOffline Framework
0 likes · 17 min read
How a Spark Offline Framework Boosts Data Backtracking Efficiency
ITPUB
ITPUB
May 5, 2022 · Databases

When Does MySQL Use Temporary Tables and How Does It Choose the Storage Engine?

MySQL creates two kinds of temporary tables—external and internal—based on CREATE TEMPORARY TABLE statements or internal query processing, and uses the MEMORY engine by default unless large objects or the big_tables variable force MyISAM or InnoDB, with detailed guidance on scenarios, storage engine selection, conversion thresholds, indexing, and performance tuning.

mysqlquery optimizationsql
0 likes · 19 min read
When Does MySQL Use Temporary Tables and How Does It Choose the Storage Engine?
Architecture Digest
Architecture Digest
May 3, 2022 · Databases

Performance Comparison of Elasticsearch and ClickHouse for Log Search

This article compares Elasticsearch and ClickHouse in architecture, query capabilities, and performance for log search workloads, presenting Docker‑compose setups, data ingestion pipelines, sample queries, and benchmark results that show ClickHouse generally outperforms Elasticsearch, especially in aggregation scenarios.

Vectorlog searchsql
0 likes · 11 min read
Performance Comparison of Elasticsearch and ClickHouse for Log Search
Senior Brother's Insights
Senior Brother's Insights
May 1, 2022 · Databases

How to Quickly Diagnose and Resolve MySQL Table Locks in Production

Facing a sudden MySQL table lock? This guide walks through real‑world troubleshooting steps—from checking table usage and process lists to inspecting InnoDB transaction and lock tables, identifying lock‑wait relationships, and safely killing offending sessions, while also explaining lock types and common metadata‑lock scenarios.

InnoDBdatabase lockmysql
0 likes · 10 min read
How to Quickly Diagnose and Resolve MySQL Table Locks in Production
Java Backend Technology
Java Backend Technology
Apr 28, 2022 · Databases

Why MySQL Pagination Slows Down at High Offsets and How to Fix It

This article recounts a production incident caused by a MySQL query with a huge OFFSET, explains why large offsets lead to full‑table scans and severe latency, and presents three practical solutions—including index‑covering subqueries, start‑position redefinition, and offset throttling—along with data‑generation scripts and performance test results.

Index Optimizationmysqlpagination
0 likes · 14 min read
Why MySQL Pagination Slows Down at High Offsets and How to Fix It
Bilibili Tech
Bilibili Tech
Apr 25, 2022 · Big Data

Optimizing Full Partition Tables with Zipper Tables, Hudi+Flink CDC, and Data Warehouse Strategies

Facing server‑hardware constraints, Bilibili’s data platform replaced wasteful full‑partition tables with a zipper‑table approach—preserving change history while cutting storage from petabytes to terabytes—and complemented it with Hudi + Flink CDC for near‑real‑time updates, dramatically lowering I/O, compute usage and latency.

Big DataFlink CDCHudi
0 likes · 11 min read
Optimizing Full Partition Tables with Zipper Tables, Hudi+Flink CDC, and Data Warehouse Strategies
Cognitive Technology Team
Cognitive Technology Team
Apr 25, 2022 · Backend Development

Two Ways to Instantiate org.apache.ibatis.jdbc.SQL in MyBatis

This article explains two techniques for creating an org.apache.ibatis.jdbc.SQL object in MyBatis—using an anonymous inner class with an initializer block and using method‑chaining builder style—provides code examples, explains the underlying Java syntax, and summarizes their differences.

Builder PatternMyBatisjava
0 likes · 4 min read
Two Ways to Instantiate org.apache.ibatis.jdbc.SQL in MyBatis
YunZhu Net Technology Team
YunZhu Net Technology Team
Apr 21, 2022 · Databases

Common MySQL Query Pitfalls and How to Avoid Them

This article outlines several frequent MySQL query pitfalls—including missing table aliases in multi‑table operations, unsupported FULL JOIN syntax, unexpected REPLACE INTO side effects, auto‑increment edge cases, data‑type mismatches, nondeterministic functions, and warning‑driven query behavior—providing examples and best‑practice recommendations to prevent data loss and performance issues.

BestPracticesDataIntegritydatabase
0 likes · 14 min read
Common MySQL Query Pitfalls and How to Avoid Them
Java Captain
Java Captain
Apr 19, 2022 · Databases

Eight Classic MySQL Errors and How to Fix Them

This article presents eight common MySQL error scenarios—including forgotten passwords, password policy violations, case‑sensitivity issues, service startup failures, export/import restrictions, connection limits, binary log overflow, and primary‑key replication errors—along with detailed troubleshooting steps and configuration commands to resolve each problem.

ConfigurationDatabase Errorsmysql
0 likes · 13 min read
Eight Classic MySQL Errors and How to Fix Them
Top Architect
Top Architect
Apr 19, 2022 · Databases

Querying Multi‑Business‑Unit Product Sales with Row Comparison in MySQL

This article explains how to retrieve sales statistics for multiple business units and their associated products in MySQL by comparing rows, discusses alternative approaches such as loop queries, OR concatenation, and mixed filtering, and demonstrates why row‑by‑row comparison is the most suitable solution.

Database OptimizationDynamic SQLRow Comparison
0 likes · 8 min read
Querying Multi‑Business‑Unit Product Sales with Row Comparison in MySQL
Open Source Linux
Open Source Linux
Apr 19, 2022 · Databases

8 Common SQL Pitfalls and How to Optimize Them for Faster Queries

This article examines frequent SQL anti‑patterns—such as inefficient LIMIT usage, implicit type conversion, suboptimal UPDATE/DELETE joins, mixed ordering, unnecessary EXISTS clauses, condition push‑down failures, early limiting, and intermediate result handling—and provides rewritten queries that dramatically improve performance.

Query Tuningdatabasemysql
0 likes · 14 min read
8 Common SQL Pitfalls and How to Optimize Them for Faster Queries
DaTaobao Tech
DaTaobao Tech
Apr 18, 2022 · Fundamentals

Propensity Score Matching: Principles, Implementation, and Evaluation

The article explains Propensity Score Matching as a causal inference method, detailing treatment effect concepts, required assumptions, score estimation, various matching algorithms, SQL implementation, quality assessment metrics, and how to estimate ATT using Difference-in-Differences, while outlining workflow steps, trade-offs, and alternatives.

Propensity Score Matchingcausal inferencematching algorithms
0 likes · 13 min read
Propensity Score Matching: Principles, Implementation, and Evaluation
Programmer DD
Programmer DD
Apr 16, 2022 · Databases

Mastering MySQL Hierarchical Queries with the Nested Set Model

This article explains how to replace inefficient recursive MySQL tree queries with the nested‑set (preorder traversal) technique, covering descendant retrieval, subtree counting, leaf detection, insertion, deletion, direct‑child lookup, ancestor path queries, and a JavaScript example for building a tree structure.

Database designHierarchical QueryTree Structure
0 likes · 12 min read
Mastering MySQL Hierarchical Queries with the Nested Set Model
Big Data Technology & Architecture
Big Data Technology & Architecture
Apr 15, 2022 · Big Data

Configuring Flink SQL Client with Iceberg: Catalogs, DDL, Data Insertion and Query

This guide explains how to set up the Flink SQL client to work with Apache Iceberg, covering Scala version requirements, downloading and deploying Iceberg jars, configuring Hive and HDFS catalogs, creating databases and tables, performing insert and overwrite operations, and querying data in both batch and streaming modes.

Big DataCatalogFlink
0 likes · 18 min read
Configuring Flink SQL Client with Iceberg: Catalogs, DDL, Data Insertion and Query
Architecture & Thinking
Architecture & Thinking
Apr 14, 2022 · Databases

Mastering MySQL Indexes: Boost Query Performance with B‑Tree, Prefix, and Composite Strategies

This article explains how MySQL stores indexes using B+ trees, how to evaluate index selectivity and prefix length, the importance of the leftmost principle for composite indexes, and practical techniques such as covering indexes, index condition pushdown, and avoiding common pitfalls that render indexes ineffective, all illustrated with real‑world query examples and performance measurements.

B+TreeDatabase Performancemysql
0 likes · 36 min read
Mastering MySQL Indexes: Boost Query Performance with B‑Tree, Prefix, and Composite Strategies
vivo Internet Technology
vivo Internet Technology
Apr 13, 2022 · Big Data

Understanding Join Algorithms in Presto: Theory, Implementation, and Engineering Practices

The article explains Presto’s join processing by detailing the business need to limit multi‑table joins, then describing nested‑loop, sort‑merge, and hash join algorithms with Java examples, and finally showing how the Volcano model, columnar pages, and planner integration enable scalable, efficient OLAP join execution.

Big DataHash JoinJoin Algorithms
0 likes · 17 min read
Understanding Join Algorithms in Presto: Theory, Implementation, and Engineering Practices
Big Data Technology & Architecture
Big Data Technology & Architecture
Apr 13, 2022 · Databases

Common MySQL Index Failure Scenarios and Optimization Techniques

This article explains typical MySQL index failure cases such as left‑most matching truncation, implicit conversion, IN + ORDER BY, range queries, leading wildcard, OR conditions, function usage, NOT IN/!=, low selectivity, and index fragmentation, and provides practical optimization strategies including index redesign, covering indexes, and pagination improvements.

Database PerformanceIndex OptimizationQuery Tuning
0 likes · 17 min read
Common MySQL Index Failure Scenarios and Optimization Techniques
Architecture & Thinking
Architecture & Thinking
Apr 11, 2022 · Databases

Unlocking MySQL Indexes: How B+ Trees Boost Query Performance

This article explains how MySQL implements indexes using B+ trees in InnoDB and MyISAM, compares primary and secondary indexes, demonstrates performance gains from proper indexing on large tables, and provides practical commands for creating, viewing, and dropping indexes.

B+Treemysqlperformance
0 likes · 12 min read
Unlocking MySQL Indexes: How B+ Trees Boost Query Performance
Selected Java Interview Questions
Selected Java Interview Questions
Apr 9, 2022 · Databases

Comprehensive SQL Basics, Interview Questions, and Advanced Query Techniques

This article provides a thorough guide to SQL fundamentals—including SELECT, FROM, WHERE, GROUP BY, HAVING, ORDER BY, LIMIT, aggregate functions, joins, set operations, and Oracle-specific objects—followed by a collection of common interview questions with detailed solutions, top‑N queries, and row‑to‑column transformations, while also containing promotional notes for related resources.

JoinsOracleaggregation
0 likes · 25 min read
Comprehensive SQL Basics, Interview Questions, and Advanced Query Techniques
Top Architect
Top Architect
Apr 9, 2022 · Databases

Applying the Nested Set Model for Hierarchical Department Queries in MySQL

This article explains how to use the nested set (left‑right) model to efficiently query, count, and manipulate hierarchical department data in MySQL, providing SQL examples for retrieving descendants, ancestors, leaf detection, as well as insert, delete, and JavaScript tree‑building techniques.

Nested Set ModelTree Structuredatabase
0 likes · 11 min read
Applying the Nested Set Model for Hierarchical Department Queries in MySQL
Bilibili Tech
Bilibili Tech
Apr 9, 2022 · Big Data

Bilibili Presto on Hadoop: Architecture, Scaling, and Performance Enhancements

Bilibili’s Presto on Hadoop combines a multi‑engine offline platform with Kubernetes‑managed YARN scheduling, Ranger security, and a custom dispatcher, scaling to over 400 nodes handling 160 k daily queries on 10 PB, while adding coordinator HA, resource‑group punishment, query limits, Alluxio caching, dynamic filtering, and numerous SQL‑level enhancements, with future auto‑scaling and materialized‑view automation.

Big DataHadoopPresto
0 likes · 30 min read
Bilibili Presto on Hadoop: Architecture, Scaling, and Performance Enhancements
Architect's Tech Stack
Architect's Tech Stack
Apr 8, 2022 · Databases

Comprehensive Guide to MySQL Query Optimization and Common Pitfalls

This article explains why SQL performance degrades as data grows, outlines a step‑by‑step optimization workflow—including slow‑query detection, EXPLAIN analysis, profiling, optimizer tracing, and corrective actions—while detailing type and extra fields, index usage, and practical scenarios such as large pagination, IN‑order‑by, range queries, and complex joins.

databaseindexingmysql
0 likes · 10 min read
Comprehensive Guide to MySQL Query Optimization and Common Pitfalls
Su San Talks Tech
Su San Talks Tech
Apr 8, 2022 · Backend Development

How I Overcame Data Import Pitfalls and Canal Crashes in Production

This article recounts a series of real‑world challenges—including Excel‑to‑SQL imports, temporary table handling, Canal service failures, jar conflicts, and deployment mistakes—and explains the step‑by‑step solutions that restored system stability and delivered the required data to operations.

BackendCanalDataImport
0 likes · 17 min read
How I Overcame Data Import Pitfalls and Canal Crashes in Production
Liangxu Linux
Liangxu Linux
Apr 7, 2022 · Databases

Essential MySQL Interview Guide: Normal Forms, Permissions, ACID, Indexes & More

This comprehensive guide covers MySQL fundamentals including the three normal forms, permission tables, ACID properties, index design principles, SQL statement categories, sharding, deadlock resolution, isolation anomalies, view characteristics, SQL lifecycle, primary key choices, performance troubleshooting, replication, GTID, backup tools, and backup planning.

BackupDatabase designReplication
0 likes · 13 min read
Essential MySQL Interview Guide: Normal Forms, Permissions, ACID, Indexes & More
DataFunTalk
DataFunTalk
Apr 7, 2022 · Big Data

Apache Kyuubi: Architecture, Use Cases, Community, and Mobile Cloud Deployment

This article introduces Apache Kyuubi—a multi‑tenant Thrift JDBC/ODBC service built on Spark—detailing its architecture, advantages over Spark Thrift Server, real‑world use cases, open‑source community progress, and practical deployment strategies on mobile cloud, Kubernetes, and with Trino.

Apache SparkBig DataKubernetes
0 likes · 16 min read
Apache Kyuubi: Architecture, Use Cases, Community, and Mobile Cloud Deployment
DataFunSummit
DataFunSummit
Apr 6, 2022 · Big Data

Real-time Dimension Modeling with Flink SQL: Challenges and Solutions

This article presents a JD.com case study on applying Flink SQL for real‑time dimension modeling, detailing two complex streaming scenarios—full‑join of multiple streams and full‑group aggregation—along with the associated challenges of historical data handling, state management, and performance optimization, and proposes component‑based architectural solutions.

Big DataFlinkReal-Time
0 likes · 14 min read
Real-time Dimension Modeling with Flink SQL: Challenges and Solutions
Liangxu Linux
Liangxu Linux
Apr 5, 2022 · Databases

Efficiently Query Multi‑Business‑Entity Sales Data with MySQL Row Comparison

This article explains how to retrieve sales statistics for multiple business units and their selected products in MySQL, compares loop‑based, OR‑concatenated, mixed‑filter, and SQL‑92 row‑comparison approaches, and shows why the row‑comparison technique was ultimately chosen for its index usage and simplicity.

Database designDynamic SQLRow Comparison
0 likes · 6 min read
Efficiently Query Multi‑Business‑Entity Sales Data with MySQL Row Comparison
Open Source Linux
Open Source Linux
Apr 2, 2022 · Databases

Mastering MySQL with Sequel Pro: A Complete Beginner's Guide

Sequel Pro is a user‑friendly MySQL client that lets you connect via standard, socket or SSH methods, configure connection details, test and save them, and then explore databases through panels for selecting schemas, viewing table structures, contents, relationships, triggers, table info, and running queries.

Database ManagementGUISequel Pro
0 likes · 3 min read
Mastering MySQL with Sequel Pro: A Complete Beginner's Guide
Laravel Tech Community
Laravel Tech Community
Mar 31, 2022 · Databases

SQL Optimization Steps and Common Cases for Improving Query Performance

This article outlines a systematic approach to identifying and fixing slow MySQL queries, explains how to interpret EXPLAIN output and profiling information, and presents multiple real‑world cases illustrating index usage, implicit conversion, large pagination, IN queries, and other optimization techniques.

Database OptimizationExplain Planindexing
0 likes · 10 min read
SQL Optimization Steps and Common Cases for Improving Query Performance
Big Data Technology & Architecture
Big Data Technology & Architecture
Mar 30, 2022 · Databases

Understanding ClickHouse AggregatingMergeTree, AggregateFunction, and Materialized Views

This article explains how ClickHouse's AggregatingMergeTree engine uses the special AggregateFunction data type to pre‑aggregate data, demonstrates table creation, data insertion, and querying with state and merge functions, and shows how to combine it with materialized views for efficient analytics.

AggregateFunctionAggregatingMergeTreeMaterializedView
0 likes · 8 min read
Understanding ClickHouse AggregatingMergeTree, AggregateFunction, and Materialized Views
21CTO
21CTO
Mar 30, 2022 · Big Data

What Drives Taobao App Users? Insights from AARRR and RFM Analyses

This article analyzes 2 million Taobao app user‑behavior records using AARRR funnel metrics and RFM segmentation, revealing daily and hourly usage patterns, conversion bottlenecks, product‑search mismatches, and offering data‑driven marketing recommendations to boost retention and sales.

AARRRBig DataRFM
0 likes · 25 min read
What Drives Taobao App Users? Insights from AARRR and RFM Analyses
Qunar Tech Salon
Qunar Tech Salon
Mar 30, 2022 · Databases

Understanding and Optimizing PostgreSQL VACUUM and AUTOVACUUM

This article explains the purpose and mechanics of PostgreSQL's VACUUM and AUTOVACUUM processes, demonstrates practical examples of table bloat, transaction ID handling, and provides step‑by‑step guidance for precise triggering and performance optimization of vacuum operations.

AUTOVACUUMDatabase TuningVacuum
0 likes · 37 min read
Understanding and Optimizing PostgreSQL VACUUM and AUTOVACUUM
Laravel Tech Community
Laravel Tech Community
Mar 29, 2022 · Databases

Efficient Methods for Removing Duplicate Records in MySQL Tables

This article explains why a naïve Python‑based row‑deletion approach is slow for large MySQL tables and provides step‑by‑step SQL techniques—including identifying duplicate names, handling MySQL’s update‑from‑same‑table limitation, and deleting duplicates while preserving a single record per group—complete with executable code examples.

data cleaningduplicate removalmysql
0 likes · 5 min read
Efficient Methods for Removing Duplicate Records in MySQL Tables
Big Data Technology & Architecture
Big Data Technology & Architecture
Mar 28, 2022 · Big Data

Real-time Dimension Modeling with Flink SQL: Problems, Challenges, and Solutions

This article presents JD's real-time dimension modeling case using Flink SQL, detailing two complex streaming scenarios, the difficulties of handling historical data and state management, and a component‑based solution that leverages external KV stores and optimized Flink operators to improve performance and scalability.

Big DataFlinkReal-Time
0 likes · 13 min read
Real-time Dimension Modeling with Flink SQL: Problems, Challenges, and Solutions
StarRocks
StarRocks
Mar 28, 2022 · Backend Development

Scaling Microservice Tracing with Zipkin and StarRocks: A Practical Guide

This article explains how Sohu Smart Media built a high‑performance tracing system for microservices by integrating Zipkin for data collection with StarRocks for storage and analytics, covering architecture, data models, SQL queries, Flink processing, and real‑world results that boost observability and engineering efficiency.

FlinkMicroservicesStarRocks
0 likes · 31 min read
Scaling Microservice Tracing with Zipkin and StarRocks: A Practical Guide
Java Captain
Java Captain
Mar 25, 2022 · Databases

Common MySQL Interview Questions and Answers

This article compiles essential MySQL interview topics, covering normal forms, privilege tables, ACID properties, SQL statement categories, sharding, deadlocks, isolation anomalies, view characteristics, statement lifecycle, primary key choices, CPU troubleshooting, replication, GTID, backup tools, and backup planning strategies.

Database designReplicationinterview-questions
0 likes · 15 min read
Common MySQL Interview Questions and Answers
DataFunTalk
DataFunTalk
Mar 24, 2022 · Big Data

Real‑time Dimension Modeling with Flink SQL: Problems, Challenges, and Solutions

This article presents a JD.com BI engineer's case study on applying Flink SQL to real‑time dimension modeling, detailing two complex streaming scenarios, the technical difficulties of handling historical data and performance, and a component‑based solution architecture with future roadmap considerations.

Big DataFlinkReal-Time
0 likes · 13 min read
Real‑time Dimension Modeling with Flink SQL: Problems, Challenges, and Solutions
DeWu Technology
DeWu Technology
Mar 23, 2022 · Databases

Introduction to ORM and Peewee Usage in Python

The article explains how ORM bridges Python objects and relational databases, compares options, and justifies choosing the lightweight Peewee library, then details its workflow—model generation, inserting, updating, deleting, and querying records—showcasing how this approach reduces manual data‑creation time from hours to seconds.

DjangoORMPeewee
0 likes · 9 min read
Introduction to ORM and Peewee Usage in Python
Big Data Technology & Architecture
Big Data Technology & Architecture
Mar 23, 2022 · Databases

ClickHouse SQL Fundamentals: CREATE, TABLE, Views, ALTER, Partitioning, Import/Export, and Mutation Operations

This article provides a comprehensive guide to ClickHouse SQL, covering database creation, table definitions, column defaults, temporary and partitioned tables, view types, DDL commands, data import/export formats, and mutation‑based update and delete operations with practical code examples.

MutationPartitioningViews
0 likes · 17 min read
ClickHouse SQL Fundamentals: CREATE, TABLE, Views, ALTER, Partitioning, Import/Export, and Mutation Operations
Java Interview Crash Guide
Java Interview Crash Guide
Mar 21, 2022 · Databases

Mastering Multi‑Entity Sales Queries with SQL Row Comparison

This article explains how to design a MySQL table for product sales statistics and compares several query strategies—including loop queries, OR concatenation, mixed filtering, and the SQL‑92 row‑by‑row comparison—to efficiently retrieve sales data for dynamic sets of business institutions and their products in a single indexed query.

Database QueryRow Comparisonmany-to-many
0 likes · 7 min read
Mastering Multi‑Entity Sales Queries with SQL Row Comparison
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 18, 2022 · Backend Development

13 Crucial Java Practices Every Developer Should Follow

This article examines fifteen common Java pitfalls and best‑practice guidelines—ranging from property copying and date formatting to HashMap sizing, thread‑pool creation, collection handling, logging, and serialization—explaining why each recommendation exists and how it impacts performance and safety.

Collectionsconcurrencyjava
0 likes · 9 min read
13 Crucial Java Practices Every Developer Should Follow
Architecture & Thinking
Architecture & Thinking
Mar 17, 2022 · Databases

Why Large OFFSETs Slow Down MySQL Queries and How to Fix Them

This article analyzes a production incident where massive OFFSET values in MySQL pagination caused severe query slowdown, demonstrates the problem with simulated data, benchmarks the impact, and presents three practical solutions—including index‑covering subqueries, remembering the last primary key, and limiting offset size—to dramatically improve performance and protect the database.

indexingmysqlperformance
0 likes · 13 min read
Why Large OFFSETs Slow Down MySQL Queries and How to Fix Them