确定PHP中二进制数据的未知数据格式

发布于 2024-09-17 01:02:46 字数 521 浏览 3 评论 0原文

我有混合了 uint32 和 null 终止字符串的二进制数据。我知道单个数据集的大小（每组数据共享相同的格式），但不知道实际的格式。

我一直在使用 unpack 使用以下函数读取数据：

function read_uint32( $fh ){
  $return_value = fread($fh, 4 );
  $return_value = unpack( 'L', $return_value );
  return $return_value[1];
}

function read_string( $fh ){
  do{
    $char = fread( $fh, 1 );
    $return_string .= $char;
  }while( ord( $char ) != 0 );
  return substr($return_string, 0, -1);
}

然后基本上尝试这两个函数并查看数据作为字符串是否有意义，如果不是，则可能是 int，是否有更简单的方法可以执行此操作？

谢谢。

原文

I have binary data with a mix of uint32 and null terminated strings. I know the size of an individual data set ( each set of data shares the same format ), but not the actual format.

I've been using unpack to read the data with the following functions:

function read_uint32( $fh ){
  $return_value = fread($fh, 4 );
  $return_value = unpack( 'L', $return_value );
  return $return_value[1];
}

function read_string( $fh ){
  do{
    $char = fread( $fh, 1 );
    $return_string .= $char;
  }while( ord( $char ) != 0 );
  return substr($return_string, 0, -1);
}

and then basically trying both functions and seeing if the data makes sense as a string, and if not it's probably an int, is there an easier way to go about doing this?

Thanks.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

你怎么这么可爱啊 2024-09-24 01:02:46

嗯，我认为你的方法没问题。
好吧，如果你只得到 ascii 字符串，那么它很容易，因为最高位总是 0 或 1（在一些奇怪的情况下......）分析文件中的一些字节，然后查看分布会告诉你可能是它的 ascii 还是其他东西二进制。
如果你有不同的编码，比如 utf8 或其他编码，那真的很痛苦。
您可能可以寻找重复出现的 CR/LF 字符或过滤掉 0-31 ，只让 tab、cr、lf、ff 滑过。当您分析前 X 个字节并比较非制表符、cr、lf、ff 字符和其他字符的比率时。这适用于任何编码，因为 ascii 范围是规范的......
要定义实际的文件类型，最好将其交给操作系统层，然后简单地从 shell 调用文件或使用 php 函数来获取 mimetype...

回复收藏 0 原文

~没有更多了~