在 SQL 语句中使用 LIMIT 可以提高多少性能？

发布于 2024-11-02 08:08:06 字数 1259 浏览 6 评论 0原文

假设我的数据库中有一个包含 1.000.000 记录的表。

如果我执行：

SELECT * FROM [Table] LIMIT 1000

该查询是否会花费与我拥有包含 1000 条记录的表并执行以下操作相同的时间

SELECT * FROM [Table]

？

我不是在寻找是否需要完全相同的时间。我只是想知道第一个执行是否会比第二个花费更多的时间。

我说的是 1.000.000 条记录，但也可能是 20.000.000。这只是一个例子。

编辑：
当然，当使用 LIMIT 并且不在同一个表中使用它时，使用 LIMIT 构建的查询应该执行得更快，但我并不是要求......

使其通用：

表1：X条记录
Table2：Y 记录

(X << Y)

我要比较的是：

SELECT * FROM Table1

和

SELECT * FROM Table2 LIMIT X

编辑 2：
这就是我问这个问题的原因：

我有一个数据库，其中有 5 个表以及其中一些表之间的关系。其中一张表将（我 100% 确定）包含大约 5.000.000 记录。我使用 SQL Server CE 3.5、实体框架作为 ORM 和 LINQ to SQL 来进行查询。

我基本上需要执行三种非简单查询，并且我正在考虑向用户显示记录的限制（就像许多网站所做的那样）。如果用户想要查看更多记录，他/她可以选择限制更多搜索。

所以，出现这个问题是因为我正在考虑这样做（限制每个查询的 X 记录），或者如果在数据库中仅存储 X 结果（最近的结果），这需要在数据库中进行一些删除，但我只是在想......

所以，该表可能包含 5.000.000 记录或更多，我不想显示的是用户 1000 左右，即使这样，查询仍然像返回 5.000.000 行一样慢。

原文

Let's suppose I have a table in my database with 1.000.000 records.

If I execute:

SELECT * FROM [Table] LIMIT 1000

Will this query take the same time as if I have that table with 1000 records and just do:

SELECT * FROM [Table]

I'm not looking for if it will take exactly the same time. I just want to know if the first one will take much more time to execute than the second one.

I said 1.000.000 records, but it could be 20.000.000. That was just an example.

Edit:
Of course that when using LIMIT and without using it in the same table, the query built using LIMIT should be executed faster, but I'm not asking that...

To make it generic:

Table1: X records
Table2: Y records

(X << Y)

What I want to compare is:

SELECT * FROM Table1

and

SELECT * FROM Table2 LIMIT X

Edit 2:
Here is why I'm asking this:

I have a database, with 5 tables and relationships between some of them. One of those tables will (I'm 100% sure) contain about 5.000.000 records. I'm using SQL Server CE 3.5, Entity Framework as the ORM and LINQ to SQL to make the queries.

I need to perform basically three kind of non-simple queries, and I was thinking about showing to the user a limit of records (just like lot of websites do). If the user wants to see more records, the option he/she has is to restrict more the search.

So, the question came up because I was thinking about doing this (limiting to X records per query) or if storing in the database only X results (the recent ones), which will require to do some deletions in the database, but I was just thinking...

So, that table could contain 5.000.000 records or more, and what I don't want is to show the user 1000 or so, and even like this, the query still be as slow as if it would be returning the 5.000.000 rows.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

旧竹 2024-11-09 08:08:06

从包含 1000000 条记录的表中 TAKE 1000 - 速度会快 1000000/1000 (= 1000) 倍，因为它只需要查看（并返回）1000/1000000 条记录。既然做的少了，自然就快了。

结果将是相当（伪）随机的，因为您没有指定任何采取的顺序。但是，如果您确实引入了顺序，则以下两个条件之一变为 true：

ORDER BY 子句遵循索引 - 上述语句仍然为 true。
ORDER BY 子句不能使用任何索引 - 它只会比没有 TAKE 时稍微快一些，因为
- 它必须检查所有记录，并按 ORDER BY 排序
- 仅交付一个子集（TAKE 计数）
- 所以第一步并不快，但第二步涉及的 IO/网络比所有记录更少

如果您从包含 1000 条记录的表中取出 1000 条记录，那么只要遵循 (1) 的情况，就相当于从 10 亿条记录中取出 1000 条记录（几乎没有显着差异）无排序依据，或 (2) 针对索引排序