防止实体框架中 if-exists-update-else-insert 的竞争条件

发布于 2024-11-10 13:00:24 字数 1395 浏览 4 评论 0原文

我一直在阅读有关如何在 EF 中实现 if-exists-insert-else-update 语义的其他问题，但要么我不明白答案是如何工作的，要么它们实际上没有解决问题。提供的常见解决方案是将工作包装在事务范围内（例如：在没有竞争条件的情况下使用实体框架实现 if-not-exists-insert)：

using (var scope = new TransactionScope()) // default isolation level is serializable
using(var context = new MyEntities())
{
    var user = context.Users.SingleOrDefault(u => u.Id == userId); // *
    if (user != null)
    {
        // update the user
        user.property = newProperty;
        context.SaveChanges();
    }
    else
    {
        user = new User
        {
             // etc
        };
        context.Users.AddObject(user);
        context.SaveChanges();
    }
}

但我看不出这如何解决任何问题，至于这个工作，我在上面加星标的行如果第二个线程尝试访问相同的用户 ID，则应阻塞，仅当第一个线程完成其工作时才解除阻塞。然而，使用事务不会导致这种情况，并且由于当第二个线程第二次尝试创建同一用户时发生密钥冲突，我们将抛出 UpdateException。

与其捕获由竞争条件引起的异常，不如从一开始就防止竞争条件发生。实现此目的的一种方法是让加星号的行在与其条件匹配的数据库行上获取独占锁，这意味着在此块的上下文中，一次只有一个线程可以与用户一起工作。

看来这对于 EF 用户来说一定是一个常见问题，所以我正在寻找一个干净、通用的解决方案，我可以在任何地方使用。

如果可能的话，我真的很想避免使用存储过程来创建我的用户。

有什么想法吗？

编辑：我尝试使用相同的用户 ID 在两个不同的线程上同时执行上述代码，尽管取出了可序列化事务，但它们都能够同时进入临界区 (*)。当第二个线程尝试插入第一个线程刚刚插入的相同用户 ID 时，这会导致引发 UpdateException。这是因为，正如下面 Ladislav 所指出的，可序列化事务仅在开始修改数据（而不是读取）后才获取排他锁。

原文

I've been reading other questions on how to implement if-exists-insert-else-update semantics in EF, but either I'm not understanding how the answers work, or they are in fact not addressing the issue. A common solution offered is to wrap the work in a transaction scope (eg: Implementing if-not-exists-insert using Entity Framework without race conditions):

using (var scope = new TransactionScope()) // default isolation level is serializable
using(var context = new MyEntities())
{
    var user = context.Users.SingleOrDefault(u => u.Id == userId); // *
    if (user != null)
    {
        // update the user
        user.property = newProperty;
        context.SaveChanges();
    }
    else
    {
        user = new User
        {
             // etc
        };
        context.Users.AddObject(user);
        context.SaveChanges();
    }
}

But I fail to see how this solves anything, as for this to work, the line I have starred above should block if a second thread tries to access the same user ID, unblocking only when the first thread has finished its work. Using a transaction will not cause this however, and we'll get an UpdateException thrown due to the key violation that occurs when the second thread attempts to create the same user for a second time.

Instead of catching the exception caused by the race condition, it would be better to prevent the race condition from happening in the first place. One way to do this would be for the starred line to take out an exclusive lock on the database row that matches its condition, meaning that in the context of this block, only one thread at a time could work with a user.

It seems that this must be a common problem for users of the EF, so I'm looking for a clean, generic solution that I can use everywhere.

I'd really like to avoid using a stored procedure to create my user if possible.

Any ideas?

EDIT: I tried executing the above code concurrently on two different threads using the same user ID, and despite taking out serializable transactions, they were both able to enter the critical section (*) concurrently. This lead to an UpdateException being thrown when the second thread attempted to insert the same user ID that the first had just inserted. This is because, as pointed out by Ladislav below, a serializable transaction takes exclusive locks only after it has begun modifying data, not reading.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

卷耳 2024-11-17 13:00:24

当使用可序列化事务时，SQL Server 会在读取记录/表上发出共享锁。共享锁不允许其他事务修改锁定的数据（事务将阻塞），但它允许其他事务在发出锁的事务开始修改数据之前读取数据。这就是该示例不起作用的原因 - 允许使用共享锁进行并发读取，直到第一个事务开始修改数据。

您需要隔离，其中 select 命令专门为单个客户端锁定整个表。它必须锁定整个表，否则它无法解决插入“相同”记录的并发性。使用提示时，可以通过 select 命令对锁定记录或表进行精细控制，但您必须编写直接 SQL 查询才能使用它们 - EF 对此不支持。我描述了专门锁定该表的方法这里，但这就像创建对表的顺序访问，它会影响访问该表的所有其他客户端。

如果您确实确定此操作仅在您的单个方法中发生，并且没有其他应用程序使用您的数据库，您可以简单地将代码放入临界区（.NET 同步，例如使用lock）并确保在.NET方面，只有单个线程可以访问临界区。这不是那么可靠的解决方案，但是任何使用锁和事务级别的操作都会对数据库性能和吞吐量产生很大影响。您可以将此方法与乐观并发（唯一约束、时间戳等）结合起来。

回复收藏 0 原文

梨涡 2024-11-17 13:00:24

只是补充一下我的方式，并不是说它真正处理抛出异常和事务的烦恼，并不能完全将其作为可扩展的解决方案，但它确实避免了竞争条件导致锁类型解决方案不可能（易于管理）的问题，例如在分布式系统中。

我非常简单地使用异常并首先尝试插入。我使用对原始代码的修改作为示例：

using(var context = new MyEntities())
{
    EntityEntry entityUser = null;
    try 
    {
        user = new User
        {
             // etc
        };
        entityUser = context.Users.Add(user);
        context.SaveChanges(); // Will throw if the entity already exists
    } 
    catch (DbUpdateException x)
    when (x.InnerException != null && x.InnerException.Message.StartsWith("Cannot insert duplicate key row in object"))
    {
        if (entityUser != null)
        {
            // Detach the entity to stop it hanging around on the context
            entityUser.State = EntityState.Detached;
        }
        var user = context.Users.Find(userId);
        if (user != null) // just in case someone deleted it in the mean time
        {
            // update the user
            user.property = newProperty;
            context.SaveChanges();
        }
    }
}

它并不漂亮，但它有效并且可能对某人有用。

Just to add my way, not that it really deals with the annoyance of exceptions being thrown and transactions not quite cutting it as a scalable solution but it does avoid race conditions from causing problems where lock type solutions are not possible (easily managed) such as in distributed systems.

I very simply use the exception and try the insert first. I use a modification of your original code as an example:

using(var context = new MyEntities())
{
    EntityEntry entityUser = null;
    try 
    {
        user = new User
        {
             // etc
        };
        entityUser = context.Users.Add(user);
        context.SaveChanges(); // Will throw if the entity already exists
    } 
    catch (DbUpdateException x)
    when (x.InnerException != null && x.InnerException.Message.StartsWith("Cannot insert duplicate key row in object"))
    {
        if (entityUser != null)
        {
            // Detach the entity to stop it hanging around on the context
            entityUser.State = EntityState.Detached;
        }
        var user = context.Users.Find(userId);
        if (user != null) // just in case someone deleted it in the mean time
        {
            // update the user
            user.property = newProperty;
            context.SaveChanges();
        }
    }
}

It's not pretty, but it works and might be of use to someone.

回复收藏 0 原文