phpass 的自定义 Base 64 编码器：它比 Base64 有名称/优势吗？

发布于 2024-12-04 22:55:52 字数 1851 浏览 1 评论 0原文

phpass 在 encode64() 为 Base 64 编码。 Base64 和 Uuencode 线性分块 6 位以在映射到可打印字符之前生成每个八位字节。 encode64 打乱了这些位：

input bit location:    abcdefgh ijklmnop qrstuvwx
base64 bit location:   ..abcdef ..ghijkl ..mnopqr ..stuvwx
encode64 bit location: ..cdefgh ..mnopab ..wxijkl ..qrstuv

这个算法众所周知吗？除了向后兼容性之外，为什么选择它而不是 Base64？

下面我重写了它以阐明算法：（

function encode64($input, $bytesToProcess)
{
    // convert to array of ints
    for ($i = 0; $i < $bytesToProcess; $i++) {
        $bytes[] = ord($input[$i]);
    }

    $octets = array();
    $i = 0;
    do {
        $value = $bytes[$i++];
        $octets[] = $value & 0x3f;
        if ($i < $bytesToProcess) {
            $value |= $bytes[$i] << 8;
        }
        $octets[] = ($value >> 6) & 0x3f;
        if ($i++ >= $bytesToProcess) {
            break;
        }
        if ($i < $bytesToProcess) {
            $value |= $bytes[$i] << 16;
        }
        $octets[] = ($value >> 12) & 0x3f;
        if ($i++ >= $bytesToProcess) {
            break;
        }
        $octets[] = ($value >> 18) & 0x3f;
    } while ($i < $bytesToProcess);

    return array_map(function ($i) {
        return str_pad(base_convert($i, 10, 2), 6, '0', STR_PAD_LEFT);
    }, $octets);
}

var_export(encode64("Man", 3));

更新以准确指示每个输入位移动的位置）

原文

phpass uses a strange (to me) algorithm in encode64() to base 64 encode. Base64 and Uuencode linearly chunk 6 bits to produce each octet before mapping to a printable char. encode64 shuffles the bits around:

input bit location:    abcdefgh ijklmnop qrstuvwx
base64 bit location:   ..abcdef ..ghijkl ..mnopqr ..stuvwx
encode64 bit location: ..cdefgh ..mnopab ..wxijkl ..qrstuv

Is this algorithm commonly known? And besides backward compatibility, why choose it over Base64?

Below I've rewritten it to clarify the algorithm:

function encode64($input, $bytesToProcess)
{
    // convert to array of ints
    for ($i = 0; $i < $bytesToProcess; $i++) {
        $bytes[] = ord($input[$i]);
    }

    $octets = array();
    $i = 0;
    do {
        $value = $bytes[$i++];
        $octets[] = $value & 0x3f;
        if ($i < $bytesToProcess) {
            $value |= $bytes[$i] << 8;
        }
        $octets[] = ($value >> 6) & 0x3f;
        if ($i++ >= $bytesToProcess) {
            break;
        }
        if ($i < $bytesToProcess) {
            $value |= $bytes[$i] << 16;
        }
        $octets[] = ($value >> 12) & 0x3f;
        if ($i++ >= $bytesToProcess) {
            break;
        }
        $octets[] = ($value >> 18) & 0x3f;
    } while ($i < $bytesToProcess);

    return array_map(function ($i) {
        return str_pad(base_convert($i, 10, 2), 6, '0', STR_PAD_LEFT);
    }, $octets);
}

var_export(encode64("Man", 3));

(updated to indicate exactly where each input bit is moved)

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

放手` 2024-12-11 22:55:52

encode64() 看起来就像标准 Base64 的实现，它以相反的顺序对位进行计数并使用不同的字符集 - 如果您以正确的方式眯起眼睛，它会选择最后一个<例如，/em> 第一个输出字符的第一个字节的 6 位。这可能只是一个错误；这样做没有安全性或性能上的好处（并且相对于 PHP 的本机 base64_encode< /a>）。

回复收藏 0 原文