我可以在树中的所有文本文件（但不是二进制文件）中搜索某个字符串吗

发布于 2024-12-10 11:25:37 字数 396 浏览 0 评论 0原文

到目前为止，我最好的办法是（在包含大型 C 程序的目录中查找字符串），

find ~/example_directory -type f \( -name "*.mk" -or -name "*.[sch]" \) -print0 | xargs -0 -e grep "example_string"

效果很好，但它依赖于 .mk makefile、.c 或 .h 源文件和 .s 中的所有有趣的东西汇编程序文件。

我正在考虑添加“所有名为 Makefile 的文件”或“所有 *.py python 脚本”之类的内容，但如果有某种方法告诉 find 只查找文本文件，事情会更容易。

如果您只是对所有文件运行 grep，则需要很长时间，并且您会在目标文件上得到许多无趣的命中。

原文

My best shot so far is (for looking for strings in a directory containing a large C program)

find ~/example_directory -type f \( -name "*.mk" -or -name "*.[sch]" \) -print0 | xargs -0 -e grep "example_string"

Which works pretty well, but it relies on all the interesting things being in .mk makefiles, .c or .h source files, and .s assembler files.

I was thinking of adding in things like 'all files called Makefile' or 'all *.py python scripts', but it occurs that it would be way easier if there were some way to tell find only to find the text files.

If you just run grep on all files, it takes ages, and you get lots of uninteresting hits on object files.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

扶醉桌前 2024-12-17 11:25:37

GNU grep 支持 -I 选项，这使得它将二进制文件（通过查看前几个字节确定）视为不匹配，因此基本上会跳过它们。

回复收藏 0 原文

黑白记忆 2024-12-17 11:25:37

grep -rI <path> <pattern>

'-r' 开关使 grep 递归，'-I' 使其忽略二进制文件。

还有其他开关可以排除某些文件和目录（例如，我经常这样做来排除 svn 元数据）

grep -rI <path> <pattern>

The '-r' switch makes grep recurse, and '-I' makes it ignore binary files.

There are additional switches to exclude certain files and directories (I frequently do this to exclude svn metadata, for example)

回复收藏 0 原文

溺ぐ爱和你が 2024-12-17 11:25:37

您看过ack吗？

使用 ack 的十大理由：

ack 会忽略大部分你不想搜索的垃圾
...
二进制文件、核心转储等

回复收藏 0 原文

丑丑阿 2024-12-17 11:25:37

您可以使用grep -I 忽略二进制文件。使用 GNU Parallel 而不是 xargs 将允许您将工作分解为多个进程，利用一些并行加速。

文档中有一个如何执行并行 grep 的示例：
http://www.gnu.org/s/parallel/man.html#example__parallel_grep

find -type f | parallel -k -j150% -n 1000 -m grep -I "example_string"

You can use grep -I to ignore binary files. Using GNU Parallel instead of xargs will allow you to break up the work into multiple processes, exploiting some parallelism for speedup.

There is an example of how to perform a parallel grep available in the documentation:
http://www.gnu.org/s/parallel/man.html#example__parallel_grep

find -type f | parallel -k -j150% -n 1000 -m grep -I "example_string"

回复收藏 0 原文

~没有更多了~

关于作者

顾忌

暂无简介

0 文章

0 评论

269 人气

关注发私信

友情链接

文江博客

我可以在树中的所有文本文件（但不是二进制文件）中搜索某个字符串吗

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（4）

关于作者

相关话题

热门标签

推荐作者

醉城メ夜风

远昼

平生欢

微凉

Honwey

qq_ikhFfg

友情链接

我可以在树中的所有文本文件（但不是二进制文件）中搜索某个字符串吗

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（4）

关于作者

相关话题

热门标签

推荐作者

醉城メ夜风

远昼

平生欢

微凉

Honwey

qq_ikhFfg

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。