Databases 7 min read

MySQL Multi‑Table Join Performance: When to Prefer Service‑Layer Merging Over Database Joins

The article examines MySQL's limited join types and performance drawbacks for multi‑table queries, compares them with PostgreSQL, and explains why many applications split joins into separate service‑layer operations to reduce RPC overhead, improve caching, and handle sharding constraints.

Architecture Digest

Mar 11, 2024

MySQL Multi‑Table Join Performance: When to Prefer Service‑Layer Merging Over Database Joins

When discussing MySQL versus PostgreSQL multi‑table joins, the author notes that MySQL only supports nested‑loop joins, lacking sort‑merge and hash joins, which can make queries involving more than three tables slower than PostgreSQL.

For small tables (tens of thousands of rows) without indexes, a Cartesian product can explode result size, making multiple single‑table queries more efficient than a single massive join due to network I/O constraints.

In typical business scenarios, join conditions are indexed, producing a small intermediate result that can be fetched with a single RPC when using a database join, saving two extra RPCs compared to service‑layer merging.

However, many systems still prefer to perform the merge in the service layer for three main reasons:

Database CPU is expensive; moving computation to horizontally scalable services improves overall throughput.

Complex architectures often involve multiple databases behind middleware, making cross‑database joins impossible, so a service layer abstracts the data source.

Sharding and distributed transactions limit join feasibility; service‑layer logic can handle eventual consistency and compensate for failed writes.

To improve performance, high‑throughput applications often decompose joins: execute separate single‑table queries and combine results in the application. This approach enhances cache reuse, reduces lock contention, eases sharding, and can even outperform nested‑loop joins by implementing hash‑style association in code.

Example of a full join query:

select * from tag<br>join tag_post on tag_post.tag_id=tag.id<br>join post on tag_post.post_id=post.id<br>where tag.tag='mysql';

Decomposed into three queries:

Select * from tag where tag='mysql';<br>Select * from tag_post where tag_id=1234;<br>Select * from post where id in (123,456,567,9989,8909);

By splitting the query, applications can cache individual table results, reduce lock competition, and achieve better scalability, especially in sharded environments.

END

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

mysql Service Layer

Written by

Architecture Digest

Focusing on Java backend development, covering application architecture from top-tier internet companies (high availability, high performance, high stability), big data, machine learning, Java architecture, and other popular fields.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.