将双精度型转换为整数以提高速度

发布于 2024-09-01 19:34:09 字数 1060 浏览 4 评论 0原文

在 Redis (http://code.google.com/p/redis) 中，有分数关联到元素，以便对该元素进行排序。即使许多用户实际上按整数排序（例如 unix 时间），该分数也是双精度的。

当数据库保存时我们需要将这个双打ok写入磁盘。这是当前使用的：

  snprintf((char*)buf+1,sizeof(buf)-1,"%.17g",val);

另外检查无穷大和非数字条件，以便也在最终的数据库文件中表示这一点。

不幸的是，将双精度数转换为字符串表示形式非常慢。虽然 Redis 中有一个函数可以更快地将整数转换为字符串表示形式。所以我的想法是检查是否可以将双精度数转换为整数而不丢失数据，然后使用该函数将整数转换为字符串（如果是这样）。

为了提供良好的加速，当然整数“等价性”的测试必须很快。所以我使用了一个可能是未定义行为的技巧，但在实践中效果很好。类似这样的：

double x = ... some value ...
if (x == (double)((long long)x))
    use_the_fast_integer_function((long long)x);
else
    use_the_slow_snprintf(x);

在我的推理中，上面的双精度转换将双精度转换为长整型，然后再转换回整数。如果范围合适，并且没有小数部分，则该数字将在转换后保留下来，并且与初始数字完全相同。

因为我想确保这不会破坏某些系统中的东西，所以我在 freenode 上加入了#c，但我受到了很多侮辱；）所以我现在在这里尝试。

有没有一种标准方法可以在不超出 ANSI C 的情况下完成我想做的事情？否则，上面的代码是否应该在当前 Redis 目标的所有 Posix 系统中工作？也就是说，现在运行 Linux / Mac OS X / *BSD / Solaris 的拱门？

为了使代码更清晰，我可以添加的是在尝试强制转换之前对双精度数的范围进行显式检查。

感谢您的帮助。

原文

in Redis (http://code.google.com/p/redis) there are scores associated to elements, in order to take this elements sorted. This scores are doubles, even if many users actually sort by integers (for instance unix times).

When the database is saved we need to write this doubles ok disk. This is what is used currently:

  snprintf((char*)buf+1,sizeof(buf)-1,"%.17g",val);

Additionally infinity and not-a-number conditions are checked in order to also represent this in the final database file.

Unfortunately converting a double into the string representation is pretty slow. While we have a function in Redis that converts an integer into a string representation in a much faster way. So my idea was to check if a double could be casted into an integer without lost of data, and then using the function to turn the integer into a string if this is true.

For this to provide a good speedup of course the test for integer "equivalence" must be fast. So I used a trick that is probably undefined behavior but that worked very well in practice. Something like that:

double x = ... some value ...
if (x == (double)((long long)x))
    use_the_fast_integer_function((long long)x);
else
    use_the_slow_snprintf(x);

In my reasoning the double casting above converts the double into a long, and then back into an integer. If the range fits, and there is no decimal part, the number will survive the conversion and will be exactly the same as the initial number.

As I wanted to make sure this will not break things in some system, I joined #c on freenode and I got a lot of insults ;) So I'm now trying here.

Is there a standard way to do what I'm trying to do without going outside ANSI C? Otherwise, is the above code supposed to work in all the Posix systems that currently Redis targets? That is, archs where Linux / Mac OS X / *BSD / Solaris are running nowaday?

What I can add in order to make the code saner is an explicit check for the range of the double before trying the cast at all.

Thank you for any help.

分享到QQ

分享到微博