当前位置：文江博客话题详情

“拖尾”使用 bash 基于字符串位置的二进制文件？

发布于 2024-08-27 10:55:25 字数 214 浏览 11 评论 0原文

我有一堆二进制文件，每个文件都在文件末尾附近包含一个嵌入式字符串，但位于不同的位置（每个文件中仅出现一次）。我需要提取从字符串位置开始直到文件末尾的文件部分并将其转储到新文件中。

例如。如果文件的内容是“AWREDEDEDEXXXERESSDSDS”并且感兴趣的字符串是“XXX”，那么我需要的文件部分是“XXXERESSDSDS”。

在 bash 中执行此操作最简单的方法是什么？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

高冷爸爸 2024-09-03 10:55:25

在PERL中，内置了一个变量，专门指字符串中匹配正则表达式之后的部分。这就是我会使用的方法。不仅仅是 Bash 和实用程序，PERL 也很常见，所以您应该没问题。

回复收藏 0 原文

本王不退位尔等都是臣 2024-09-03 10:55:25

以下是一个性能不是很好的小型 hack shell 解决方案。但它有效。

编写脚本文件 tail.sh 如下：

#!/bin/sh
dd bs=1 if=$1 of=$2 skip=`grep --binary-files=text -m1 -b -o $3 $1 | cut -d ':' -f 1 | head -1`

Call tail.sh INPUTNAME OUTPUTNAME PATTERN

ps: 抱歉忘记了第一篇文章中 grep 的一个选项

Following is a small hack shell solution that is not very performant. But it works.

Write the script file tail.sh as follows:

#!/bin/sh
dd bs=1 if=$1 of=$2 skip=`grep --binary-files=text -m1 -b -o $3 $1 | cut -d ':' -f 1 | head -1`

Call tail.sh INPUTNAME OUTPUTNAME PATTERN

p.s.: sorry forgot one option to grep in first post

回复收藏 0 原文

最好是你 2024-09-03 10:55:25

您想要 string 和 grep 吗？

例如

strings -n 3 myfilename | grep XXX

Would strings and grep do you want?

e.g.

strings -n 3 myfilename | grep XXX

回复收藏 0 原文

黑凤梨 2024-09-03 10:55:25

 strings -n3 file_binary | awk '/XXX/{gsub(/.*XXX/,"");print}'

 strings -n3 file_binary | awk '/XXX/{gsub(/.*XXX/,"");print}'

回复收藏 0 原文

且行且努力 2024-09-03 10:55:25

我想出了这个解决方案：

ls -1 *.bin | xargs strings -n4 --radix=d -f | grep "string" | awk '{sub(/:/, ""); print $2 " " $1 " " $1".";}' | xargs -l1 split -b && rm *.aa

ls -1 *.bin 仅以列表格式打印扩展名为“bin”的文件名

xargs strings -n4 -- radix=d -f 列出文件中的所有字符串及其位置，并在输出中包含文件名

grep "string" 打印包含以下内容的行“string”（每个文件中仅出现一次）

awk '{sub(/:/, ""); print $2 " " $1 " " $1".";}' 去掉字符串添加的文件名后面的冒号，打印字符串的位置、文件名以及带句点的文件名（这个line 用作 split 命令的参数

xargs -l1 split -b 使用 awk 的输出作为其余参数执行每行的 split 命令

rm *.aa 删除分割文件的第一部分。“aa”是分割文件部分的默认后缀。

可能有更好/更快的方法。 /更安全的方法，但这对我的目的来说很好。

I came up with this solution:

ls -1 *.bin | xargs strings -n4 --radix=d -f | grep "string" | awk '{sub(/:/, ""); print $2 " " $1 " " $1".";}' | xargs -l1 split -b && rm *.aa

ls -1 *.bin Print only the filenames with the extension "bin" in a list format

xargs strings -n4 --radix=d -f List all the strings in the file and their positions and include the filename in the output

grep "string" Print lines containing "string" (it only occurs once in each file)

awk '{sub(/:/, ""); print $2 " " $1 " " $1".";}' Remove the colon after the filename added by strings, and print the position of the string, the filename, and the filename with a period (this line is used as the arguments for the split command

xargs -l1 split -b Execute the split command for each line using the output of awk as the rest of the arguments

rm *.aa Delete the first parts of the split files. "aa" is the default suffix for the part of the split files.

There are probably better/faster/safer ways of doing this but it's fine for my purposes.

回复收藏 0 原文

简单爱 2024-09-03 10:55:25

试试这个：

grep -ao string.* filename

由于您有二进制数据，您可能希望将输出重定向到文件。

grep -ao string.* filename > binary.out

或者通过 hexdump 或类似的管道进行测试：

grep -ao string.* filename | hd

Try this:

grep -ao string.* filename

Since you have binary data, you might want to redirect the output to a file.

grep -ao string.* filename > binary.out

Or pipe it through hexdump or similar for testing:

grep -ao string.* filename | hd

回复收藏 0 原文

~没有更多了~

关于作者

用心笑

暂无简介

0 文章

0 评论

23 人气

关注发私信

友情链接

文江博客

“拖尾”使用 bash 基于字符串位置的二进制文件？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（6）

关于作者

相关话题

热门标签

推荐作者

烙印

singlesman

给自己一个微笑

独孤求败

晨钟暮鼓

我是自愿种绣球花的

友情链接

“拖尾”使用 bash 基于字符串位置的二进制文件？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（6）

关于作者

相关话题

热门标签

推荐作者

烙印

singlesman

给自己一个微笑

独孤求败

晨钟暮鼓

我是自愿种绣球花的

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。