当前位置：文江博客话题详情

C++ 中的尾递归

发布于 2024-08-29 17:40:53 字数 83 浏览 7 评论 0原文

有人可以向我展示一个简单的 C++ 尾递归函数吗？

如果尾递归更好的话，为什么它更好呢？

除了尾递归之外，还有哪些递归？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

芯好空 2024-09-05 17:40:53

一个简单的尾递归函数：

unsigned int f( unsigned int a ) {
   if ( a == 0 ) {
      return a;
   }
   return f( a - 1 );   // tail recursion
}

尾递归基本上是这样的：

只有一个递归调用，
该调用是函数中的最后一条语句

并且它不是“更好”，除非一个好的编译器可以删除递归，将其转换进入一个循环。这可能会更快，并且肯定会节省堆栈使用量。 GCC 编译器可以进行这种优化。

A simple tail recursive function:

unsigned int f( unsigned int a ) {
   if ( a == 0 ) {
      return a;
   }
   return f( a - 1 );   // tail recursion
}

Tail recursion is basically when:

there is only a single recursive call
that call is the last statement in the function

And it's not "better", except in the sense that a good compiler can remove the recursion, transforming it into a loop. This may be faster and will certainly save on stack usage. The GCC compiler can do this optimisation.

回复收藏 0 原文

孤独患者 2024-09-05 17:40:53

C++ 中的尾部递归看起来与 C 或任何其他语言相同。

void countdown( int count ) {
    if ( count ) return countdown( count - 1 );
}

尾递归（以及一般的尾调用）需要在执行尾调用之前清除调用者的堆栈帧。对于程序员来说，尾递归类似于循环，其中 return 简化为像 goto first_line; 一样工作。不过，编译器需要检测您正在做什么，如果没有检测到，仍然会有一个额外的堆栈帧。大多数编译器都支持它，但编写循环或 goto 通常更容易且风险更小。

非递归尾部调用可以启用随机分支（例如转到其他函数的第一行），这是一种更独特的功能。

请注意，在 C++ 中，return 语句的范围内不能有任何具有非平凡析构函数的对象。函数结束清理将要求被调用者返回到调用者，从而消除尾部调用。

另请注意（在任何语言中）尾递归要求算法的整个状态在每一步都通过函数参数列表传递。（这一点从要求在下一次调用开始之前消除函数的堆栈帧的要求中可以清楚地看出......您不能在局部变量中保存任何数据。）此外，在尾部返回之前不能对函数的返回值应用任何操作。

int factorial( int n, int acc = 1 ) {
    if ( n == 0 ) return acc;
    else return factorial( n-1, acc * n );
}

Tail recusion in C++ looks the same as C or any other language.

void countdown( int count ) {
    if ( count ) return countdown( count - 1 );
}

Tail recursion (and tail calling in general) requires clearing the caller's stack frame before executing the tail call. To the programmer, tail recursion is similar to a loop, with return reduced to working like goto first_line;. The compiler needs to detect what you are doing, though, and if it doesn't, there will still be an additional stack frame. Most compilers support it, but writing a loop or goto is usually easier and less risky.

Non-recursive tail calls can enable random branching (like goto to the first line of some other function), which is a more unique facility.

Note that in C++, there cannot be any object with a nontrivial destructor in the scope of the return statement. The end-of-function cleanup would require the callee to return back to the caller, eliminating the tail call.

Also note (in any language) that tail recursion requires the entire state of the algorithm to be passed through the function argument list at each step. (This is clear from the requirement that the function's stack frame be eliminated before the next call begins… you can't be saving any data in local variables.) Furthermore, no operation can be applied to the function's return value before it's tail-returned.

int factorial( int n, int acc = 1 ) {
    if ( n == 0 ) return acc;
    else return factorial( n-1, acc * n );
}

回复收藏 0 原文

白昼 2024-09-05 17:40:53

尾递归是尾调用的一种特殊情况。尾部调用是编译器可以看到从被调用函数返回时不需要执行的任何操作 - 本质上是将被调用函数的返回转换为它自己的返回。编译器通常可以执行一些堆栈修复操作，然后跳转（而不是调用）到被调用函数的第一条指令的地址。

除了消除一些返回调用之外，这样做的一大好处是您还可以减少堆栈的使用。在某些平台或操作系统代码中，堆栈可能非常有限，并且在高级计算机（例如台式机中的 x86 CPU）上，像这样减少堆栈使用将提高数据缓存性能。

尾递归是指被调用函数与调用函数相同。这个可以变成循环，和上面提到的尾调用优化中的跳转一模一样。由于这是同一个函数（被调用者和调用者），因此在跳转之前需要完成的堆栈修复较少。

下面显示了执行递归调用的常见方法，这对于编译器来说更难变成循环：

int sum(int a[], unsigned len) {
     if (len==0) {
         return 0;
     }
     return a[0] + sum(a+1,len-1);
}

这很简单，许多编译器可能无论如何都能弄清楚，但正如您所看到的，需要添加一个内容发生在被调用的 sum 返回一个数字之后，因此简单的尾部调用优化是不可能的。

如果您这样做：

static int sum_helper(int acc, unsigned len, int a[]) {
     if (len == 0) {
        return acc;
     }
     return sum_helper(acc+a[0], len-1, a+1);
}
int sum(int a[], unsigned len) {
     return sum_helper(0, len, a);
}

您将能够利用两个函数中的调用作为尾部调用。这里 sum 函数的主要工作是移动一个值并清除寄存器或堆栈位置。 sum_helper 完成所有的数学运算。

既然您在问题中提到了 C++，我将提到一些与此相关的特殊内容。
C++ 向您隐藏了一些 C 不会的东西。这些析构函数是阻碍尾部调用优化的主要因素。

int boo(yin * x, yang *y) {
    dharma z = x->foo() + y->bar();
    return z.baz();
}

在此示例中，对 baz 的调用实际上并不是尾部调用，因为 z 需要在 baz 返回后被销毁。我相信，即使在调用期间不需要变量的情况下，C++的规则也可能使优化变得更加困难，例如：

int boo(yin * x, yang *y) {
    dharma z = x->foo() + y->bar();
    int u = z.baz();
    return qwerty(u);
}

这里从qwerty返回后可能必须破坏z。

另一件事是隐式类型转换，这也可能发生在 C 中，但在 C++ 中可能更复杂和常见。
例如：

static double sum_helper(double acc, unsigned len, double a[]) {
     if (len == 0) {
        return acc;
     }
     return sum_helper(acc+a[0], len-1, a+1);
}
int sum(double a[], unsigned len) {
     return sum_helper(0.0, len, a);
}

这里 sum 对 sum_helper 的调用不是尾部调用，因为 sum_helper 返回一个 double 并且 sum 需要将其转换为 int 。

在 C++ 中，返回一个对象引用是很常见的，它可能有各种不同的解释，每种解释都可能是不同的类型转换，
例如：

bool write_it(int it) {
      return cout << it;
}

这里有一个对 cout.operator<< 的调用作为最后的陈述。 cout 将返回对其自身的引用（这就是为什么您可以将很多东西串在一起在一个由 << 分隔的列表中），然后强制将其计算为 bool，这最终会调用 cout 的另一个方法，运算符布尔（）。在这种情况下，这个 cout.operator bool() 可以作为尾调用来调用，但是operator<<不能。

编辑：

值得一提的是，C 中尾调用优化成为可能的一个主要原因是编译器知道被调用函数将其返回值存储在与调用函数必须确保其返回值相同的位置值存储在.

Tail recursion is a special case of a tail call. A tail call is where the compiler can see that there are no operations that need to be done upon return from a called function -- essentially turning the called function's return into it's own. The compiler can often do a few stack fix-up operations and then jump (rather than call) to the address of the first instruction of the called function.

One of the great things about this besides eliminating some return calls is that you also cut down on stack usage. On some platforms or in OS code the stack can be quite limited and on advanced machines like the x86 CPUs in our desktops decreasing the stack usage like this will improve data cache performance.

Tail recursion is where the called function is the same as the calling function. This can be turned into loops, which is exactly the same as the jump in the tail call optimization mentioned above. Since this is the same function (callee and caller) there are fewer stack fixups that need to be done before the jump.

The following shows a common way to do a recursive call which would be more difficult for a compiler to turn into a loop:

int sum(int a[], unsigned len) {
     if (len==0) {
         return 0;
     }
     return a[0] + sum(a+1,len-1);
}

This is simple enough that many compilers could probably figure it out anyway, but as you can see there is an addition that needs to happen after the return from the called sum returns a number, so a simple tail call optimization is not possible.

If you did:

static int sum_helper(int acc, unsigned len, int a[]) {
     if (len == 0) {
        return acc;
     }
     return sum_helper(acc+a[0], len-1, a+1);
}
int sum(int a[], unsigned len) {
     return sum_helper(0, len, a);
}

You would be able to take advantage of the calls in both functions being tail calls. Here the sum function's main job is to move a value and clear a register or stack position. The sum_helper does all of the math.

Since you mentioned C++ in your question I'll mention some special things about that.
C++ hides some things from you which C does not. Of these destructors are the main thing that will get in the way of tail call optimization.

int boo(yin * x, yang *y) {
    dharma z = x->foo() + y->bar();
    return z.baz();
}

In this example the call to baz is not really a tail call because z needs to be destructed after the return from baz. I believe that the rules of C++ may make the optimization more difficult even in cases where the variable is not needed for the duration of the call, such as:

int boo(yin * x, yang *y) {
    dharma z = x->foo() + y->bar();
    int u = z.baz();
    return qwerty(u);
}

z may have to be destructed after the return from qwerty here.

Another thing would be implicit type conversion, which can happen in C as well, but can more complicated and common in C++.
For instance:

static double sum_helper(double acc, unsigned len, double a[]) {
     if (len == 0) {
        return acc;
     }
     return sum_helper(acc+a[0], len-1, a+1);
}
int sum(double a[], unsigned len) {
     return sum_helper(0.0, len, a);
}

Here sum's call to sum_helper is not a tail call because sum_helper returns a double and sum will need to convert that into an int.

In C++ it is quite common to return an object reference which may have all kinds of different interpretations, each of which could be a different type conversion,
For instance:

bool write_it(int it) {
      return cout << it;
}

Here there is a call made to cout.operator<< as the last statement. cout will return a reference to itself (which is why you can string lots of things together in a list separated by << ), which you then force to be evaluated as a bool, which ends up calling another of cout's methods, operator bool(). This cout.operator bool() could be called as a tail call in this case, but operator<< could not.

EDIT:

One thing that is worth mentioning is that a major reason that tail call optimization in C is possible is that the compiler knows that the called function will store it's return value in the same place as the calling function would have to ensure that its return value is stored in.

回复收藏 0 原文

静谧 2024-09-05 17:40:53

尾递归是一种同时处理两个问题的技巧。第一种是当很难知道要执行的迭代次数时执行循环。

虽然这可以通过简单的递归来解决，但出现了第二个问题，即由于递归调用执行次数过多而导致堆栈溢出。当与“计算和进位”技术相结合时，尾部调用就是解决方案。

在基础计算机科学中，您会了解到计算机算法需要具有不变量和终止条件。这是构建尾递归的基础。

所有计算都发生在参数传递中。
所有结果都必须传递给函数调用。
尾部调用是最后一次调用，发生在终止时。

简而言之，函数的返回值不得进行任何计算。

以计算 10 的幂为例，这很简单，可以用循环编写。

应该看起来像

template<typename T> T pow10(T const p, T const res =1)
{
return p ? res: pow10(--p,10*res);
}

这样给出了一个执行，例如 4:

ret,p,res

-,4,1
-,3,10
-,2,100
-,1,1000
-,0,10000
10000,-,-

很明显，编译器只需在不更改堆栈指针的情况下以及发生尾调用时复制值即可只是为了返回结果。

尾递归非常重要，因为它可以提供现成的编译时评估，例如上面的可以实现。

template<int N,int R=1> struct powc10
{
int operator()() const
{
return  powc10<N-1, 10*R>()();
}
};

template<int R> struct powc10<0,R>
{
    
int operator()() const
{
return  R;
}

};

这可以用作 powc10<10>()() 在编译时计算 10 次方。

大多数编译器都有嵌套调用的限制，因此尾调用技巧会有所帮助。显然，没有元编程循环，因此必须使用递归。

Tail recursion is a trick to actually cope with two issues at the same time. The first is executing a loop when it is hard to know the number of iterations to do.

Though this can be worked out with simple recursion, the second problem arises which is that of stack overflow due to the recursive call being executed too many times. The tail call is the solution, when accompanied by a "compute and carry" technique.

In basic CS you learn that a computer algorithm needs to have an invariant and a termination condition. This is the base for building the tail recursion.

All computation happens in the argument passing.
All results must be passed onto function calls.
The tail call is the last call, and occurs at termination.

To simply put it, no computation must happen on the return value of your function .

Take for example the computation of a power of 10, which is trivial and can be written by a loop.

Should look something like

template<typename T> T pow10(T const p, T const res =1)
{
return p ? res: pow10(--p,10*res);
}

This gives an execution, e.g 4:

ret,p,res

-,4,1
-,3,10
-,2,100
-,1,1000
-,0,10000
10000,-,-

It is clear that the compiler just has to copy values without changing the stack pointer and when the tail call happens just to return the result.

Tail recursion is very important because it can provide ready made compile time evaluations, e.g. The above can be made to be.

template<int N,int R=1> struct powc10
{
int operator()() const
{
return  powc10<N-1, 10*R>()();
}
};

template<int R> struct powc10<0,R>
{
    
int operator()() const
{
return  R;
}

};

this can be used as powc10<10>()() to compute the 10th power at compile time.

Most compilers have a limit of nested calls so the tail call trick helps. Evidently,there are no meta programming loops, so have to use recursion.

回复收藏 0 原文