当前位置：文江博客话题详情

“已提交读”和“已提交读”之间的区别和“可重复读取”在 SQL Server 中

发布于 2024-09-29 13:47:43 字数 46 浏览 6 评论 0原文

我认为上面的隔离级别非常相似。有人可以用一些很好的例子来描述主要区别是什么吗？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

九厘米的零° 2024-10-06 13:47:43

已提交读是一种隔离级别，可保证读取时已提交任何数据读取。它只是限制读者看到任何中间的、未提交的“脏”读物。它不承诺如果事务重新发出读取，将找到相同数据，数据在读取后可以自由更改。

可重复读是一种更高的隔离级别，除了保证读已提交级别之外，还保证读到的任何数据不能改变，如果事务再次读取相同的数据，就会发现先前读取的数据就位，未更改且可供读取。

下一个隔离级别（可序列化）提供了更强大的保证：除了所有可重复读取保证之外，它还保证后续读取无法看到新数据。

假设您有一个表 T，其中包含 C 列，其中包含一行，假设它的值为“1”。假设您有一个如下所示的简单任务：

BEGIN TRANSACTION;
SELECT * FROM T;
WAITFOR DELAY '00:01:00'
SELECT * FROM T;
COMMIT;

这是一个从表 T 中发出两次读取的简单任务，两次读取之间有 1 分钟的延迟。

在 READ COMMITTED 下，第二个 SELECT 可能返回任何数据。并发事务可以更新记录、删除记录、插入新记录。第二次选择将始终看到新数据。
在 REPEATABLE READ 下，第二个 SELECT 保证至少显示从第一个 SELECT 返回的行未更改。并发事务可能会在那一分钟内添加新行，但无法删除或更改现有行。
在 SERIALIZABLE 读取下，第二个选择保证看到与第一个选择完全相同的行。并发事务不能更改、删除任何行，也不能插入新行。

如果您遵循上面的逻辑，您很快就会意识到，SERIALIZABLE 事务虽然可以让您的生活变得轻松，但总是完全阻止每个可能的并发操作，因为它们要求任何人都不能修改、删除或插入任何行。 .Net System.Transactions 作用域的默认事务隔离级别是可序列化的，这通常可以解释所导致的糟糕性能。

最后，还有 SNAPSHOT 隔离级别。 SNAPSHOT 隔离级别提供与可串行化相同的保证，但不要求任何并发事务都可以修改数据。相反，它迫使每个读者看到自己版本的世界（自己的“快照”）。这使得编程变得非常容易，并且非常可扩展，因为它不会阻止并发更新。然而，这种好处是有代价的：额外的服务器资源消耗。

补充阅读：

Read committed is an isolation level that guarantees that any data read was committed at the moment is read. It simply restricts the reader from seeing any intermediate, uncommitted, 'dirty' read. It makes no promise whatsoever that if the transaction re-issues the read, will find the Same data, data is free to change after it was read.

Repeatable read is a higher isolation level, that in addition to the guarantees of the read committed level, it also guarantees that any data read cannot change, if the transaction reads the same data again, it will find the previously read data in place, unchanged, and available to read.

The next isolation level, serializable, makes an even stronger guarantee: in addition to everything repeatable read guarantees, it also guarantees that no new data can be seen by a subsequent read.

Say you have a table T with a column C with one row in it, say it has the value '1'. And consider you have a simple task like the following:

BEGIN TRANSACTION;
SELECT * FROM T;
WAITFOR DELAY '00:01:00'
SELECT * FROM T;
COMMIT;

That is a simple task that issue two reads from table T, with a delay of 1 minute between them.

under READ COMMITTED, the second SELECT may return any data. A concurrent transaction may update the record, delete it, insert new records. The second select will always see the new data.
under REPEATABLE READ the second SELECT is guaranteed to display at least the rows that were returned from the first SELECT unchanged. New rows may be added by a concurrent transaction in that one minute, but the existing rows cannot be deleted nor changed.
under SERIALIZABLE reads the second select is guaranteed to see exactly the same rows as the first. No row can change, nor deleted, nor new rows could be inserted by a concurrent transaction.

If you follow the logic above you can quickly realize that SERIALIZABLE transactions, while they may make life easy for you, are always completely blocking every possible concurrent operation, since they require that nobody can modify, delete nor insert any row. The default transaction isolation level of the .Net System.Transactions scope is serializable, and this usually explains the abysmal performance that results.

And finally, there is also the SNAPSHOT isolation level. SNAPSHOT isolation level makes the same guarantees as serializable, but not by requiring that no concurrent transaction can modify the data. Instead, it forces every reader to see its own version of the world (its own 'snapshot'). This makes it very easy to program against as well as very scalable as it does not block concurrent updates. However, that benefit comes with a price: extra server resource consumption.

Supplemental reads:

回复收藏 0 原文

葵雨 2024-10-06 13:47:43

可重复读取

数据库的状态从事务开始时就得到维护。如果您在 session1 中检索一个值，然后在 session2 中更新该值，在 session1 中再次检索该值将返回相同的结果。读取是可重复的。

session1> BEGIN;
session1> SELECT firstname FROM names WHERE id = 7;
Aaron

session2> BEGIN;
session2> SELECT firstname FROM names WHERE id = 7;
Aaron
session2> UPDATE names SET firstname = 'Bob' WHERE id = 7;
session2> SELECT firstname FROM names WHERE id = 7;
Bob
session2> COMMIT;

session1> SELECT firstname FROM names WHERE id = 7;
Aaron

已提交读

在事务上下文中，您将始终检索最近提交的值。如果您在 session1 中检索一个值，在 session2 中更新它，然后再次在 session1 中检索它，您将获得在 session2 中修改的值。它读取最后提交的行。

session1> BEGIN;
session1> SELECT firstname FROM names WHERE id = 7;
Aaron

session2> BEGIN;
session2> SELECT firstname FROM names WHERE id = 7;
Aaron
session2> UPDATE names SET firstname = 'Bob' WHERE id = 7;
session2> SELECT firstname FROM names WHERE id = 7;
Bob
session2> COMMIT;

session1> SELECT firstname FROM names WHERE id = 7;
Bob

有道理吗？

Repeatable Read

The state of the database is maintained from the start of the transaction. If you retrieve a value in session1, then update that value in session2, retrieving it again in session1 will return the same results. Reads are repeatable.

session1> BEGIN;
session1> SELECT firstname FROM names WHERE id = 7;
Aaron

session2> BEGIN;
session2> SELECT firstname FROM names WHERE id = 7;
Aaron
session2> UPDATE names SET firstname = 'Bob' WHERE id = 7;
session2> SELECT firstname FROM names WHERE id = 7;
Bob
session2> COMMIT;

session1> SELECT firstname FROM names WHERE id = 7;
Aaron

Read Committed

Within the context of a transaction, you will always retrieve the most recently committed value. If you retrieve a value in session1, update it in session2, then retrieve it in session1again, you will get the value as modified in session2. It reads the last committed row.

session1> BEGIN;
session1> SELECT firstname FROM names WHERE id = 7;
Aaron

session2> BEGIN;
session2> SELECT firstname FROM names WHERE id = 7;
Aaron
session2> UPDATE names SET firstname = 'Bob' WHERE id = 7;
session2> SELECT firstname FROM names WHERE id = 7;
Bob
session2> COMMIT;

session1> SELECT firstname FROM names WHERE id = 7;
Bob

Makes sense?

回复收藏 0 原文

拧巴小姐 2024-10-06 13:47:43

根据我对这个线程的阅读和理解，简单的答案是基于这个简单的场景：

有两个事务 A 和 B。它们按以下顺序执行以下操作。

事务 B 首先从表 X 读取
事务 A，然后写入表 X
事务 B 然后再次从表 X 读取。

读取未提交：事务 B 可以从事务 A 中读取未提交的数据，并且可以基于不同的行看到不同的行一篇写作。 根本没有锁
ReadCommited：事务 B 只能从事务 A 中读取已提交的数据，并且它可以基于仅 A 写入的 COMMITTED 看到不同的行。 我们可以称之为简单锁吗？
RepeatableRead：无论事务 A 正在做什么，事务 B 都会读取相同的数据（行）。但事务 A 可以更改其他行。 行级别块
可串行：事务 B 将读取与之前相同的行，事务 A 无法在表中读取或写入。 表级块
快照：每个交易都有自己的副本，并且正在处理它。 每个人都有自己的看法

回复收藏 0 原文

赠意 2024-10-06 13:47:43

老问题已经有一个公认的答案，但我喜欢考虑这两个隔离级别如何改变 SQL Server 中的锁定行为。这对于那些像我一样正在调试死锁的人可能会有所帮助。

READ COMMITTED（默认）

共享锁在 SELECT 中获取，然后当 SELECT 语句完成时释放。这就是系统如何保证未提交数据不会被脏读的原因。在 SELECT 完成之后和事务完成之前，其他事务仍然可以更改基础行。

可重复读取

共享锁在 SELECT 中获取，然后仅在事务完成后释放。这就是系统如何保证您读取的值在事务期间不会更改（因为它们在事务完成之前保持锁定状态）。

回复收藏 0 原文

世俗缘 2024-10-06 13:47:43

尝试用简单的图表来解释这个疑惑。

读取已提交：在此隔离级别中，事务 T1 将读取事务 T2 提交的 X 的更新值。

可重复读：在此隔离级别下，事务 T1 不会考虑事务 T2 提交的更改。

回复收藏 0 原文

你另情深 2024-10-06 13:47:43

这里还有其他答案，但它们没有提供有关底层数据库体系结构的任何详细信息，这使得很难理解事务隔离级别为何以它们的方式运行以及解决了哪些问题。

并发环境中常见问题的概述

数据库系统允许多个并发连接。这会导致在其他并发系统中出现相同类型的问题。例如，在多线程环境中，互斥锁可以防止对内存的并发访问，从而解决可能导致数据损坏或无效的竞争条件问题。

同样，由于数据库系统允许并发 CRUD 操作（更新、插入、删除、选择），因此多个连接的并发操作可能会导致不良的观察行为。

请注意，数据库行操作的原子性可防止彻底的数据损坏或不一致，因此始终强制执行基本级别的事务隔离。

有关详细信息，请参阅 ACID。（原子性、一致性、隔离性、持久性。）简短的解释是，在每行的基础上，操作是原子的。这意味着通过防止出现一种情况来防止数据损坏：在另一个连接通过将其数据部分写入同一行而损坏该数据之前，一个连接会写入部分行数据。（这对于熟悉多线程环境的人来说会更直观。）

上述问题与多线程编程中出现的问题类似，其中一个线程开始写入内存块，然后另一个线程到来并将其数据部分写入同一内存块。在第一个线程完成之前阻塞。这会导致数据不一致。

首先了解行操作的原子性质非常重要，因为这已经提供了基本级别的保护。

事务隔离级别的类型

我们将了解以下事务隔离级别，这些级别在 MariaDB 和许多其他 SQL 数据库实现中可用。

我们首先需要知道不同的隔离级别是什么：

读取未提交
读取已提交
可重复读取
可序列化

事务隔离级别解决了哪些问题？

在解释这些不同选项的作用之前，了解每个选项解决的问题非常重要。以下是潜在不良行为的列表。

脏读
不可重复读
幻读

脏读：

数据库操作通常组合在一起形成一个事务。然后，该事务要么作为一组操作提交到数据库，要么执行回滚以丢弃该组操作。

如果一个连接作为事务启动一系列操作，然后第二个连接开始从同一个表读取数据，则第二个连接可以从已提交的数据库中读取数据。，或者它也可以读取作为开放但未提交的事务的一部分所做的更改。

这定义了已提交的读和未提交的读之间的区别。

这在概念上是不寻常的，因为读取未提交的数据通常没有多大意义。事务的全部目的是确保数据库中的数据不会以这种方式更改。

总结一下：

连接 A 打开事务并开始排队修改（写入）操作
连接 B 以读未提交模式打开，并从数据库读取数据
连接 A 继续排队进一步修改
如果连接 B 再次读取数据，它将有已更改
如果连接 A 执行回滚，然后连接 B 执行另一次读取，则读取的数据将由于未提交数据的回滚而发生更改
这称为脏读
这不是一种情况你通常需要担心，因为一般来说你可能不应该在读未提交模式下工作，因为它使数据看起来好像事务不存在，这很

奇怪

。如果读操作模式设置为“读已提交”，则可能会出现重复读。这种模式解决了脏读的问题，因为只读取已提交的数据。

可能的写操作有：

更新
插入
删除

当读操作读取一组行时，就会发生不可重复读，然后重复该操作，返回同一组行（相同的键），但返回非键数据已经改变了。

例如：

连接 A 可以从表中读取一组行。这些行是根据某些选择标准（例如 where 子句）选择的。
连接 B 可能会更改该组中的一行或多行，以及其他行。
如果连接 A 重复相同的读取查询，则将返回相同的行集，但不属于“键”的数据可能会发生变化。改变了。
“键”由 where 或其他过滤器子句中的选择标准定义。

幻读

幻读是不可重复读的扩展。 插入或删除操作可能会更改返回行集中返回的行。插入操作可以将新行添加到返回的行集中。删除操作可能会执行相反的操作，并从返回的行集中删除行。

总结一下：

连接 A 执行读取操作
连接 B 执行插入或删除操作
连接 A 执行相同的读取操作，导致返回一组不同的行。可能会出现新行。现有的行可能会消失。因此读《幻影》。

隔离级别

鉴于我们对潜在不良行为的理解，隔离级别的作用如下：

未提交的读取无法防止任何这些问题。然而，由于原子行操作，仍然存在基本级别的保护。
已提交读仅防止脏读。
可重复读可以防止脏读和不可重复读，但不能防止幻读。
可序列化可以防止上述所有情况。

更高级别的隔离需要“锁定”数据库中的更多数据以防止并发访问。这可能是不可取的，因为如果 DMBS 持有整个表的锁，则其他连接都无法修改该数据。这可能会导致需要访问数据库的其他进程挂起。

提交读通常是最明智的选择。它确保您（数据库管理员）只能看到已提交的数据（持久数据，而不是瞬态数据），并且不会导致其他进程挂起。

参考书目

进一步阅读：

维基百科 ACID 操作

MariaDB 中的隔离级别

Geeks for Geeks 隔离级别。（请注意，有些信息没有任何意义，例如对已提交读的解释，它指出这会导致提交读取的任何未提交的数据。这是不正确的，也没有意义。未提交的数据是仅通过显式提交操作提交。）

不可重复读取与幻像读取< /a>

There are other answers here, but they don't give any details about the underlying database architecture, which makes it difficult to understand why the transaction isolation levels function the way they do, and what problems are solved.

General Overview of Common Problems in Concurrent Environments

Database systems permit multiple concurrent connections. This leads to the same kinds of problems seen in other concurrent systems. For example, in multithreaded environments, mutexes prevent concurrent access to memory, thus solving the problem of race conditions which can lead to corrupt or invalid data.

In a similar way, because database systems permit concurrent CRUD operations (update, insert, delete, select), concurrent operations by multiple connections can lead to undesirable observed behaviour.

Note that the atomicity of database row operations prevents outright data corruption or inconsistency, so there is always a base level of transaction isolation enforced at all times.

For more information on this, see ACID. (Atomic, Consistent, Isolation, Durability.) The short explanation is that on a per-row basis, operations are atomic. This means prevents data corruption by preventing a situation whereby one connection would write part of a rows data, before another connection corrupted that data by partially writing its data to the same row. (This will be more intuitive to those familiar with multithreaded environments.)

The above described problem is analagous to problem seen in multithreaded programming whereby one thread begins writing to a block of memory, and then another thread comes and partially writes its data to the same block, before the first thread is finished. This results in inconsistent data.

It is important to understand the atomic nature of row operations first, because this already provides a base level of protection.

Types of Transaction Isolation Levels

We will look at the following Transaction Isolation levels, whcih are available in MariaDB and many other SQL database implementations.

We first need to know what the different isolation levels are:

Read uncommitted
Read committed
Repeatable read
Serializable

What problems are solved by the Transaction Isolation Levels?

Before explaining what these different options do, it is important to understand the problems solved by each of them. Here is a list of potential unwanted behaviours.

Dirty read
Non-repeatable read
Phantom read

Dirty Read:

Database operations are often grouped together into a transaction. This transaction is then either committed to the database as a group of operations, or a rollback is perfomed to discard the group of operations.

If one Connection starts a series of operations as a transaction, and then a second Connection begins reading data from the same table(s), the second Connection could either read the data from the database which has been committed, or it could also read the changes made as part of the open, and yet uncommitted, transaction.

This defines the difference between read committed and read uncommitted.

This is conceptually unusual, because it often doesn't make much sense to read uncommitted data. The whole point of transactions is to ensure data in the database does not change in this way.

To summarize:

Connection A opens a transaction and begins queueing up modifying (write) operations
Connection B opens in read-uncommitted mode, and reads data from the database
Connection A continues queueing up further modifications
If Connection B reads the data again, it will have changed
If Connection A performs a rollback, and then Connection B performs another read, the data read will have changed due to the rollback of the uncommitted data
This is known as a dirty read
This isn't a situation you usually have to worry about because you probably shouldn't be working in read-uncommitted mode as a general rule because it makes data appear as if transactions didn't exist, which is weird

Non-Repeatable Read

Given the above, a Non-Repeatable read may occur if the read operation mode is set to "Read Committed". This mode solves the problem of Dirty Read because only committed data is read.

The possible write operations are:

update
insert
delete

A Non-Repeatable Read occurs when a read operation reads a set of rows, and then the operation is repeated and the same set of rows (same keys) are returned, but the non-key data has changed.

For example:

Connection A may read a set of rows from a table. These rows are selected based on some selection criteria, for example a where clause.
Connection B may alter one or more of the rows in this set, as well as others
If Connection A repeats the same read query, then the same set of rows will be returned, but data which is not part of the "key" may have changed.
The "key" is defined by the selection criteria in the where, or other filter, clause.

Phantom Reads

Phantom Reads are an extension of the Non-Repeatable read. An insert or delete operation may change the rows returned in the set of returned rows. An insert operation may add new rows to the set of returned rows. A delete operation may do the opposite and remove rows from the set of returned rows.

To summarize:

Connection A perfoms a read operation
Connection B performs an insert, or delete, operation
Connection A performs the same read operation resulting in a different set of rows being returned. New rows may appear. Existing rows may dissappear. Hence "Phantom" Read.

Isolation Levels

Given our understanding of the potential undesirable behaviours, this is what the isolation levels do:

Read uncommitted does nothing to prevent any of these problems. However there is still a base level of protection due to atomic row operations.
Read committed prevents Dirty Reads only.
Repeatable Read prevents Dirty and Non-Repeatable Reads, but not Phantom Reads.
Serializable prevents all of the above.

Higher levels of isolation require more data in the database to be "locked" preventing concurrent access. This can be undesirable, because if a DMBS is holding a lock over a whole table, then no other connections can modify that data. This might cause other processes which need access to the database to hang.

Read Committed is typically the most sensible choice. It ensures that you, the Database Administrator, see only the committed data (data which is persistent, not transient) and it will not cause other processes to hang.

Bibliography

关于作者

没有你我更好

暂无简介

文章

27 人气

关注发私信

友情链接

文江博客

“已提交读”和“已提交读”之间的区别和“可重复读取”在 SQL Server 中

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（9）

可重复读取

已提交读

Repeatable Read

Read Committed

并发环境中常见问题的概述

事务隔离级别的类型

事务隔离级别解决了哪些问题？

脏读：

奇怪

幻读

隔离级别

参考书目

General Overview of Common Problems in Concurrent Environments

Types of Transaction Isolation Levels

What problems are solved by the Transaction Isolation Levels?

Dirty Read:

Non-Repeatable Read

Phantom Reads

Isolation Levels

Bibliography

关于作者

相关话题

热门标签

推荐作者

佚名

羁客

天天爱笑的徐老师

星

夏日落

隐诗

友情链接

“已提交读”和“已提交读”之间的区别和“可重复读取”在 SQL Server 中

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（9）

可重复读取

已提交读

Repeatable Read

Read Committed

并发环境中常见问题的概述

事务隔离级别的类型

事务隔离级别解决了哪些问题？

脏读：

奇怪

幻读

隔离级别

参考书目

General Overview of Common Problems in Concurrent Environments

Types of Transaction Isolation Levels

What problems are solved by the Transaction Isolation Levels?

Dirty Read:

Non-Repeatable Read

Phantom Reads

Isolation Levels

Bibliography

关于作者

相关话题

热门标签

推荐作者

佚名

羁客

天天爱笑的徐老师

星

夏日落

隐诗

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。