Databases 16 min read

Understanding MySQL Slow Queries, Index Optimization, ElasticSearch Basics, and HBase Overview

This article explains why MySQL queries become slow, how proper indexing—including B+‑tree, left‑most prefix, index push‑down, and covering indexes—can improve performance, outlines common causes of index failure, and then introduces ElasticSearch search capabilities and HBase column‑family storage as complementary solutions for large‑scale data handling.

Python Programming Learning Circle

Aug 12, 2021

Understanding MySQL Slow Queries, Index Optimization, ElasticSearch Basics, and HBase Overview

MySQL slow queries often stem from missing or misused indexes; the engine relies on B+‑tree structures, where left‑most prefix rules, index push‑down, and covering indexes can dramatically reduce the need for table lookups.

Typical reasons for index inefficiency include using !=, <>, OR, functions on indexed columns, leading‑wildcard LIKE, missing quotes around strings, low‑cardinality fields (e.g., gender), and not matching the leftmost prefix of a composite index.

These patterns break the ordered nature of the index or cause implicit type/character‑set conversions, prompting MySQL to skip the index for safety.

Best practices for effective indexing are:

Leverage index push‑down by ensuring query predicates are fully covered by a composite index.

Use covering indexes so all required columns reside in the index, eliminating table lookups.

Apply prefix indexes for long string columns to reduce index size.

Avoid functions on indexed columns and consider maintenance cost for frequently updated fields.

When an index appears to be chosen incorrectly, possible causes are inaccurate statistics (fixable with ANALYZE TABLE) or optimizer mis‑prediction (addressable with FORCE INDEX or query rewriting).

Additional performance bottlenecks include MDL locks, flush waits, row locks, current‑read overhead, and large‑table challenges such as I/O or CPU limits; solutions involve sharding (horizontal or vertical), read‑write splitting, and careful key design.

ElasticSearch, built on Lucene, serves as a near‑real‑time distributed search engine suitable for full‑text search, log analysis, and NoSQL document storage. A typical search request looks like:

GET yourIndex/_search
{
  "from": 0,
  "size": 10,
  "query": {
    "match_phrase": { "log": "xxx" }
  }
}

ES stores data as indices with mappings (similar to table schemas) and settings (shard and replica configuration). It uses an inverted index with a term dictionary and an in‑memory term index (FST) to achieve fast term lookups.

ES excels at tokenized full‑text queries and can complement relational databases; common integration patterns include:

Store searchable fields and document IDs in ES while keeping full records in MySQL.

Combine ES with HBase for massive write‑intensive workloads.

HBase is a column‑family NoSQL store where rows are identified by a lexicographically ordered row key. Data is organized into column families, allowing dynamic columns and high write throughput, making it suitable for OLTP‑style write‑heavy scenarios rather than complex analytical queries.

Overall, choosing the right tool—optimizing MySQL indexes, leveraging ElasticSearch for full‑text or hybrid queries, and employing HBase for write‑intensive workloads—depends on data size, query patterns, and operational constraints.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

Indexing Search Engine Elasticsearch MySQL HBase database optimization

Written by

Python Programming Learning Circle

A global community of Chinese Python developers offering technical articles, columns, original video tutorials, and problem sets. Topics include web full‑stack development, web scraping, data analysis, natural language processing, image processing, machine learning, automated testing, DevOps automation, and big data.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.