MySQL聚类与非聚类索引性能

发布于 2025-02-13 22:20:16 字数 776 浏览 2 评论 0原文

我正在对MySQL群集与非聚类索引进行几个测试，其中我有一个表100GB_Table包含〜6000万行的：

100gb_table schema:
CREATE TABLE 100gb_table (
    id int PRIMARY KEY NOT NULL AUTO_INCREMENT,
    c1 int,
    c2 text,
    c3 text,
    c4 blob NOT NULL,
    c5 text,
    c6 text,
    ts timestamp NOT NULL default(CURRENT_TIMESTAMP)
);

我正在执行一个查询，该查询只读取群集索引：

SELECT id FROM 100gb_table ORDER BY id;

i'' m看到这个查询完成了几乎 〜55分钟，这很慢。我通过在主键列的顶部添加另一个索引来修改表，并运行以下查询，该查询迫使要使用的非群集索引：

SELECT id FROM 100gb_table USE INDEX (non_clustered_key) ORDER BY id;

在＆lt; 10分钟中完成，比阅读快得多。与群集索引。为什么这两者之间存在如此巨大的差异？我的理解是，这两个索引都将索引列的值存储在树结构中，除了群集索引包含叶子节点中的表数据，因此我希望两个查询都表现出类似的性能。 Blob列可能会扭曲聚类索引结构吗？

原文

I'm running a couple tests on MySQL Clustered vs Non Clustered indexes where I have a table 100gb_table which contains ~60 million rows:

100gb_table schema:
CREATE TABLE 100gb_table (
    id int PRIMARY KEY NOT NULL AUTO_INCREMENT,
    c1 int,
    c2 text,
    c3 text,
    c4 blob NOT NULL,
    c5 text,
    c6 text,
    ts timestamp NOT NULL default(CURRENT_TIMESTAMP)
);

and I'm executing a query that only reads the clustered index:

SELECT id FROM 100gb_table ORDER BY id;

I'm seeing that it takes almost an ~55 min for this query to complete which is strangely slow. I modified the table by adding another index on top of the Primary Key column and ran the following query which forces the non-clustered index to be used:

SELECT id FROM 100gb_table USE INDEX (non_clustered_key) ORDER BY id;

This finished in <10 minutes, much faster than reading with the clustered index. Why is there such a large discrepancy between these two? My understanding is that both indexes store the index column's values in a tree structure, except the clustered index contains table data in the leaf nodes so I would expect both queries to be similarly performant. Could the BLOB column possibly be distorting the clustered index structure?

分享到QQ

分享到微博