Minimax 与 Alpha-Beta 剪枝；类变量或通过递归发送它们？

发布于 2024-12-17 12:58:53 字数 1583 浏览 2 评论 0原文

当使用 Minimax 和 Alpha-Beta 剪枝时，是否可以将 alpha 和 beta 作为类变量，而不是通过递归发送它们？

而不是：

private ValuedMove AlphaBetaSearch(Board state)
{
    return MaxValue(state, 0, int.MinValue, int.MaxValue);
}

private ValuedMove MaxValue(Board state, int d, int alpha, int beta)
{
    if (d == depth || state.GameRunning == false)
        return new ValuedMove(Heuristic.BoardValue(state, Player));

    ValuedMove v = new ValuedMove(int.MinValue);
    foreach (Move move in LegalMoves)
    {
        ValuedMove minCheck = MinValue(state.ImagineMove(move), d + 1, alpha, beta);
        if (v.Value >= beta)
            return v;
        alpha = Max(alpha, v.Value);
    }

    return v;
}

private ValuedMove MinValue(Board state, int d, int alpha, int beta)
{
    //Minimax and Alpha-Beta logic here
}

我可以写：

int alpha, beta;

private ValuedMove AlphaBetaSearch(Board state)
{
    alpha = int.MinValue;
    beta = int.MaxValue;
    return MaxValue(state, 0);
}

private ValuedMove MaxValue(Board state, int d)
{
    //Minimax and Alpha-Beta logic here
}

private ValuedMove MinValue(Board state, int d)
{
    //Minimax and Alpha-Beta logic here
}

我问是因为当我尝试通过这样做来优化代码时（我的想法是，如果我不需要将整数发送到每个递归，我可能能够剥离一点时间），我的棋手突然变成了一个白痴，牺牲了他的皇后来杀死一个棋子，并犯了其他愚蠢的错误。

他的表现总是比他的“常规 Alpha-Beta”对手差很多，我猜这是因为与他的对手相比，他也只搜索了树的一小部分（他们都使用相同的深度，但修改后的玩家似乎修剪了更积极，从而减少访问的节点数量）。为了确保这一点，我现在已经这样做了两次，除了我在这里勾画的内容之外，我没有改变任何其他内容。

如果我正确理解了 Alpha-Beta 算法，这应该不会有任何区别，但对于我的国际象棋棋手来说，它确实有什么区别。我做错了什么吗？

所以，我现在的主要问题不是这是否是优化明智或代码实践明智的好事，而是它是否应该可以做到。

原文

When using Minimax with Alpha-Beta pruning, is it possible to have alpha and beta as class variables instead of sending them through the recursion?

Instead of:

private ValuedMove AlphaBetaSearch(Board state)
{
    return MaxValue(state, 0, int.MinValue, int.MaxValue);
}

private ValuedMove MaxValue(Board state, int d, int alpha, int beta)
{
    if (d == depth || state.GameRunning == false)
        return new ValuedMove(Heuristic.BoardValue(state, Player));

    ValuedMove v = new ValuedMove(int.MinValue);
    foreach (Move move in LegalMoves)
    {
        ValuedMove minCheck = MinValue(state.ImagineMove(move), d + 1, alpha, beta);
        if (v.Value >= beta)
            return v;
        alpha = Max(alpha, v.Value);
    }

    return v;
}

private ValuedMove MinValue(Board state, int d, int alpha, int beta)
{
    //Minimax and Alpha-Beta logic here
}

Can I write:

int alpha, beta;

private ValuedMove AlphaBetaSearch(Board state)
{
    alpha = int.MinValue;
    beta = int.MaxValue;
    return MaxValue(state, 0);
}

private ValuedMove MaxValue(Board state, int d)
{
    //Minimax and Alpha-Beta logic here
}

private ValuedMove MinValue(Board state, int d)
{
    //Minimax and Alpha-Beta logic here
}

I am asking because when I tried to optimize the code by doing so (my thought was that if I didn't need to send the ints through to each recursion, I might be able to peel of a little time), my chess player suddenly became an idiot, sacrificing his queen to kill a pawn, and doing other silly mistakes.

He constantly performs a lot poorer than his "regular Alpha-Beta" opponent, which I guess is because he also searches only a small percentage of the tree compared to his opponent (they both use the same depth, but the modified player seems to prune more aggressive, and thereby reducing the number of nodes visited). I have done this twice now, to make sure, and I do not change anything else than what I scetch out here.

If I have understood the Alpha-Beta algorithm correct, this shouldn't make any difference, but for my chess player, it does. Am I doing anything wrong?

So, my main question now is not whether it is a optimization wise or code practice wise good thing to do, but rather whether it should be possible to do or not.

分享到QQ

分享到微博