MySQL 和 PHP：执行两个后续查询的 PHP 代码块的原子性和可重入性 - 有多危险？

发布于 2024-09-06 11:02:16 字数 2394 浏览 9 评论 0原文

在 MySQL 中，我必须检查 select 查询是否返回任何记录，如果没有，我插入一条记录。但我担心 PHP 脚本中的整个 if-else 操作并不像我希望的那样原子，即在某些情况下会中断，例如，如果在需要处理相同记录的情况下调用脚本的另一个实例：

if(select returns at least one record)
{
    update record;
}
else
{
    insert record;
}

我这里没有使用事务，并且自动提交已打开。我正在使用 MySQL 5.1 和 PHP 5.3。该表是InnoDB。我想知道上面的代码是否不是最优的并且确实会崩溃。我的意思是两个实例重新输入相同的脚本，并发生以下查询序列：

实例 1 尝试选择记录，未找到任何记录，进入插入查询块
实例 2 尝试选择记录，未找到任何记录，进入该块对于插入查询，
实例 1 尝试插入记录，成功，
实例 2 尝试插入记录，失败，自动中止脚本

这意味着实例 2 将中止并返回错误，跳过插入查询语句后面的任何内容。我可以使错误不致命，但我不喜欢忽略错误，我更想知道我的恐惧是否真实存在。

更新：我最终做了什么（这样可以吗？）

相关表有助于限制（实际上是允许/拒绝）应用程序发送给每个收件人的消息量。系统不应在 Z 周期内向接收者 Y 发送多于 X 条消息。该表[概念上]如下：

create table throttle
(
    recipient_id integer unsigned unique not null,
    send_count integer unsigned not null default 1,
    period_ts timestamp default current_timestamp,
    primary key (recipient_id)
) engine=InnoDB;

[稍微简化/概念上的] PHP 代码块应该执行原子事务，以维护表中的正确数据，并根据节流状态允许/拒绝发送消息：

function send_message_throttled($recipient_id) /// The 'Y' variable
{
    query('begin');

    query("select send_count, unix_timestamp(period_ts) from throttle where recipient_id = $recipient_id for update");

    $r = query_result_row();

    if($r)
    {
        if(time() >= $r[1] + 60 * 60 * 24) /// The numeric offset is the length of the period, the 'Z' variable
        {/// new period
            query("update throttle set send_count = 1, period_ts = current_timestamp where recipient_id = $recipient_id");
        }
        else
        {
            if($r[0] < 5) /// Amount of messages allowed per period, the 'X' variable
            {
                query("update throttle set send_count = send_count + 1 where recipient_id = $recipient_id");
            }
            else
            {
                trigger_error('Will not send message, throttled down.', E_USER_WARNING);
                query('rollback');
                return 1;
            }
        }
    }
    else
    {
        query("insert into throttle(recipient_id) values($recipient_id)");
    }

    if(failed(send_message($recipient_id)))
    {
        query('rollback');
        return 2;
    }

    query('commit');
}

好吧，忽略 InnoDB 死锁发生的事实，这很好不是吗？我并不是在拍胸脯或做任何事情，但这只是我能做的性能/稳定性的最佳组合，缺少使用 MyISAM 并锁定整个表，我不想这样做，因为与更新/插入相比，我不想这样做选择。

原文

In MySQL I have to check whether select query has returned any records, if not I insert a record. I am afraid though that the whole if-else operation in PHP scripts is NOT as atomic as I would like, i.e. will break in some scenarios, for example if another instance of the script is called where the same record needs to be worked with:

if(select returns at least one record)
{
    update record;
}
else
{
    insert record;
}

I did not use transactions here, and autocommit is on. I am using MySQL 5.1 with PHP 5.3. The table is InnoDB. I would like to know if the code above is suboptimal and indeed will break. I mean the same script is re-entered by two instances and the following query sequence occurs:

instance 1 attempts to select the record, finds none, enters the block for insert query
instance 2 attempts to select the record, finds none, enters the block for insert query
instance 1 attempts to insert the record, succeeds
instance 2 attempts to insert the record, fails, aborts the script automatically

Meaning that instance 2 will abort and return an error, skipping anything following the insert query statement. I could make the error not fatal, but I don't like ignoring errors, I would much rather know if my fears are real here.

Update: What I ended up doing (is this ok for SO?)

The table in question assists in a throttling (allow/deny, really) amount of messages the application sends to each recipient. The system should not send more than X messages to a recipient Y within a period Z. The table is [conceptually] as follows:

create table throttle
(
    recipient_id integer unsigned unique not null,
    send_count integer unsigned not null default 1,
    period_ts timestamp default current_timestamp,
    primary key (recipient_id)
) engine=InnoDB;

And the block of [somewhat simplified/conceptual] PHP code that is supposed to do an atomic transaction that maintains the right data in the table, and allows/denies sending message depending on the throttle state:

function send_message_throttled($recipient_id) /// The 'Y' variable
{
    query('begin');

    query("select send_count, unix_timestamp(period_ts) from throttle where recipient_id = $recipient_id for update");

    $r = query_result_row();

    if($r)
    {
        if(time() >= $r[1] + 60 * 60 * 24) /// The numeric offset is the length of the period, the 'Z' variable
        {/// new period
            query("update throttle set send_count = 1, period_ts = current_timestamp where recipient_id = $recipient_id");
        }
        else
        {
            if($r[0] < 5) /// Amount of messages allowed per period, the 'X' variable
            {
                query("update throttle set send_count = send_count + 1 where recipient_id = $recipient_id");
            }
            else
            {
                trigger_error('Will not send message, throttled down.', E_USER_WARNING);
                query('rollback');
                return 1;
            }
        }
    }
    else
    {
        query("insert into throttle(recipient_id) values($recipient_id)");
    }

    if(failed(send_message($recipient_id)))
    {
        query('rollback');
        return 2;
    }

    query('commit');
}

Well, disregarding the fact that InnoDB deadlocks occur, this is pretty good no? I am not pounding my chest or anything, but this is simply the best mix of performance/stability I can do, short of going with MyISAM and locking entire table, which I don't want to do because of more frequent updates/inserts vs selects.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

み青杉依旧 2024-09-13 11:02:16

看起来您已经知道问题的答案以及如何解决您的问题。这是一个真正的问题，您可以使用以下方法之一来解决它：

SELECT ... FOR UPDATE
INSERT ... ON DUPLICATE KEY UPDATE
事务（不要使用 MyIsam）
表锁

回复收藏 0 原文

旧时模样 2024-09-13 11:02:16

这种情况可能和可能发生取决于此页面的执行频率。

安全的选择是使用交易。您写的同样的事情仍然会发生，除了您可以安全地检查事务内的错误（如果插入涉及多个查询，并且只有最后一个插入中断），允许您回滚该错误变得无效。

所以（伪）：

#start transaction
if (select returns at least one record)
{
    update record;
}
else
{
    insert record;
}

if (no constraint errors)
{
    commit; //ends transaction
}
else
{
    rollback; //ends transaction
}

您也可以锁定表，但根据您正在执行的工作，您必须获得整个表的独占锁（您不能SELECT ... FOR UPDATE 不存在的行，抱歉）但这也会阻止从表中读取数据，直到完成为止。

This can and might happen depending on how often this page is executed.

The safe bet would be to use transactions. The same thing you wrote would still happen, except that you could safely check for the error inside the transaction (In case the insertion involves several queries, and only the last insert breaks) allowing you to rollback the one that became invalid.

So (pseudo):

#start transaction
if (select returns at least one record)
{
    update record;
}
else
{
    insert record;
}

if (no constraint errors)
{
    commit; //ends transaction
}
else
{
    rollback; //ends transaction
}

You could lock the table as well but depending on the work you're doing, you'd have to get an exclusive lock on the entire table (you cannot SELECT ... FOR UPDATE non-existing rows, sorry) but that would also block reads from your table until you are finished.

回复收藏 0 原文

~没有更多了~