测试数据库中重复键的最佳方法

发布于 2024-07-23 18:32:10 字数 746 浏览 7 评论 0原文

这更多的是一个正确性问题。假设我的数据库中有一个带有主键列的表。在我的 DAO 代码中，我有一个名为 insertRow(string key) 的函数，如果表中不存在该键，该函数将返回 true，并使用该键插入新行。否则，如果具有该键的行已存在，则返回 false。让 insertRow 首先检查键是否存在或直接执行插入并捕获重复键错误是更好/更差？或者，保存单个 select 语句是否是一种微不足道的优化，甚至无需担心？

所以在 sudo 代码中：

boolean insertRow(String key){
    //potentially a select + insert
    if(select count(*) from mytable where key = "somekey" == 0){
       insert into mytable values("somekey")
       return true;
    }
    return false;
}

或者

  boolean insertRow(String key){
    try{
       //always just 1 insert
       insert into mytable values("somekey")
       return true;
    } catch (DuplicateKeyException ex){}
    return false;
  }

原文

This is more of a correctness question. Say I have a table with a primary key column in my database. In my DAO code I have a function called insertRow(string key) that will return true if the key doesn't exist in the table and insert a new row with the key. Otherwise, if a row already exists with that key it returns false. Is it better/worse to have insertRow first check for the existence of the key or just go ahead and do the insert and catch the duplicate key error? Or is saving on a single select statement too trivial an optimization to even bother worrying about?

So in sudo code:

boolean insertRow(String key){
    //potentially a select + insert
    if(select count(*) from mytable where key = "somekey" == 0){
       insert into mytable values("somekey")
       return true;
    }
    return false;
}

  boolean insertRow(String key){
    try{
       //always just 1 insert
       insert into mytable values("somekey")
       return true;
    } catch (DuplicateKeyException ex){}
    return false;
  }

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

_畞蕅 2024-07-30 18:32:10

插入行，捕获重复键错误。我个人的选择是，

我认为这可能会表现得更好，具体取决于抛出异常的成本与两次访问数据库的成本。

只有测试这两种情况你才能确定

回复收藏 0 原文

辞取 2024-07-30 18:32:10

尝试插入，然后捕获错误。

否则，您可能仍然在两个活动 SPID 之间存在并发问题（假设系统上同时有两个 Web 用户），在这种情况下，您无论如何都必须捕获错误

User1: Check for key "newkey"? Not in database.
User2: Check for key "newkey"? Not in database.
User1: Insert key "newkey". Success.
User2: Insert key "newkey". Duplicate Key Error.

：可以通过使用显式事务或设置事务隔离级别来缓解这一问题，但使用第二种技术更容易，除非您确定始终只有一个应用程序线程针对数据库运行。

Try the insert, then catch the error.

Otherwise, you could still have a concurrency issue between two active SPIDs (lets say two web users on the system at the same time), in which case, you'd have to catch the error anyway:

User1: Check for key "newkey"? Not in database.
User2: Check for key "newkey"? Not in database.
User1: Insert key "newkey". Success.
User2: Insert key "newkey". Duplicate Key Error.

You can mitigate this by using explicit transactions or setting the transaction-isolation level, but its just easier to use the second technique, unless you are sure only one application thread is running against the database at all times.

回复收藏 0 原文

〃温暖了心ぐ 2024-07-30 18:32:10

在我看来，这是使用异常的一个很好的例子（因为重复是异常的），除非你指望在那里，大多数时候，已经是一行（即，你正在做“插入，但更新” 。

如果代码的目的是更新，那么您应该使用 select 或 INSERT ... ON DUPLICATE KEY UPDATE 子句（如果您的数据库引擎支持）或者，创建一个存储过程来为您处理此逻辑。

回复收藏 0 原文

小嗲 2024-07-30 18:32:10

第二个是因为第一个选项击中了数据库的两倍，而第二个选项只击中了一次。

回复收藏 0 原文

ぽ尐不点ル 2024-07-30 18:32:10

简而言之，您需要自己测试一下。我的直觉是，做一个小的选择来检查是否存在会表现得更好，但你需要自己验证一下，看看哪个表现更好。

一般来说，我不喜欢将错误检查完全留给异常引擎，无论我正在做什么。换句话说，如果我可以检查我正在做的事情是否有效，而不仅仅是抛出异常，那么我通常会这样做。

不过，我建议使用 EXISTS 查询而不是 count(*) 话

if(exists (select 1 from mytable where key = "somekey"))
    return false
else
    insert the row

虽这么说（从抽象的、与引擎无关的角度来看），我很确定MySQL 有一些关键字，仅当主键不存在时才可用于将行插入表中。假设您可以使用 MySQL 特定的关键字，这可能是您最好的选择。

另一种选择是将逻辑完全放在 SQL 语句中。

The short answer is that you need to test it for yourself. My gut feeling is that doing a small select to check for the existence will perform better, but you need to verify that for yourself at volume and see whichever performs better.

In general, I don't like to leave my error checking entirely to the exception engine of whatever it is I'm doing. In other words, if I can check to see if what I'm doing is valid rather than just having an exception thrown, that's generally what I do.

I would suggest, however, using an EXISTS query rather than count(*)

if(exists (select 1 from mytable where key = "somekey"))
    return false
else
    insert the row

All that being said (from an abstract, engine-neutral perspective), I'm pretty sure that MySQL has some keywords that can be used to insert a row into a table only if the primary key doesn't exist. This may be your best bet, assuming you're OK with using MySQL-specific keywords.

Another option would be to place the logic entirely in the SQL statement.

回复收藏 0 原文

断舍离 2024-07-30 18:32:10

mysql 中的另外两个选项是使用

insert ignore into....

并

insert into .... on duplicate key update field=value

包括重复键更新 field=field

请参阅：http://dev.mysql.com/doc/refman/5.0/en/insert.html

编辑：
您可以测试affected_rows 来确定插入是否有效果。

another two options in mysql are to use

insert ignore into....

and

insert into .... on duplicate key update field=value

including on duplicate key update field=field

See: http://dev.mysql.com/doc/refman/5.0/en/insert.html

Edit:
You can test affected_rows for whether or not the insert had an effect or not.

回复收藏 0 原文

黑凤梨 2024-07-30 18:32:10

现在我已经在网上找到了 Martin Fowler 的书，一个不错的方法是使用密钥表 - 请参阅第 222 页了解更多信息。

回复收藏 0 原文

~没有更多了~

关于作者

护你周全

暂无简介

0 文章

0 评论

23 人气

关注发私信

友情链接

文江博客

测试数据库中重复键的最佳方法

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（7）

关于作者

相关话题

热门标签

推荐作者

留蓝

18790681156

zach7772

Wini

ayeshaaroy

初雪

友情链接

测试数据库中重复键的最佳方法

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（7）

关于作者

相关话题

热门标签

推荐作者

留蓝

18790681156

zach7772

Wini

ayeshaaroy

初雪

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。