Grep 并提取多个日志文件中的特定数据

发布于 2025-01-19 12:57:32 字数 1075 浏览 4 评论 0原文

我在目录中有多个日志文件，并尝试仅提取时间戳和日志行的一部分，即全文查询参数的值。请求中的每个查询参数均由以下图所示的andand（＆amp;）隔开。

输入

30/MAR/2022：00：27：36 +0000 [59823] - ＆GT;得到 /libs/granite/omnisearch?p.guesstotal = 1000&; fulltext = 798＆prong>＆amp;SavedSearches%40delete=&
31/MAR/2022：00：27：36 +0000 [59823] - ＆GT;得到 /libs/granite/omnisearch?p.guesstotal = 1000&; fulltext = dyson+v7 ＆amp; savedSearches%40delete=&

预期的输出

30/3月/2022：00：27：36-＆GT; 798
31/31/MAR/2022：00：27：36-＆GT; Dyson+V7

我有此命令递归搜索目录中的所有文件。

grep -rn“/libs/granite/omnisearch”〜/downloads/reqlogs/＆gt; output.txt

它以目录名称开头打印整个日志行，例如so

/users/****/downloads/reqlogs/logfile1_2022-03-31.log：6020：31/mar//mar/ 2022：00：27：36 +0000 [59823] - ＆GT; get/libs/granite/omnisearch?p.guesstotal = 1000&; amp;fulltext = 798＆amp;savedSearches%4

请启发，我该如何操纵它以实现预期的输出。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

剩一世无双 2025-01-26 12:57:32

grep 可以返回整行或匹配的字符串。要从匹配行中提取不同的数据，请使用 sed 或 Awk。

awk -v search="/libs/granite/omnisearch" '$0 ~ search { s = $0; sub(/.*fulltext=/, "", s); sub(/&.*/, "", s); print $1, s }' ~/Downloads/ReqLogs/*

或者

sed -n '\%/libs/granite/omnisearch%s/ .*fulltext=\([^&]*\)&.*/\1/p' ~/Downloads/ReqLogs/*

sed 版本更简洁，但也更间接。

\%...% 使用备用分隔符 %，以便我们可以在搜索表达式中使用文字斜杠。

然后，s/ .../\1/p 表示替换第一个空格之后匹配行上的所有内容，捕获 fulltext= 和 & 之间的所有内容。，并替换为捕获的子字符串，然后打印结果行。

-n 标志关闭默认打印操作，以便我们只打印搜索表达式匹配的行。

通配符 ~/Downloads/ReqLogs/* 匹配该目录中的所有文件；如果您确实也需要遍历子目录，也许可以将 find 添加到其中。

find ~/Downloads/ReqLogs -type f -exec sed -n '\%/libs/granite/omnisearch%s/ .*fulltext=\([^&]*\)&.*/\1/p' {} +

或者与 -exec 之后的 Awk 命令类似。占位符 {} 告诉 find 在哪里添加找到的文件的名称，+ 表示将尽可能多的文件放入一个文件中go，而不是为每个找到的文件运行单独的 -exec 。（如果需要，请使用 \; 而不是 +。）

grep can return the whole line or the string which matched. For extracting a different piece of data from the matching lines, turn to sed or Awk.

awk -v search="/libs/granite/omnisearch" '$0 ~ search { s = $0; sub(/.*fulltext=/, "", s); sub(/&.*/, "", s); print $1, s }' ~/Downloads/ReqLogs/*

sed -n '\%/libs/granite/omnisearch%s/ .*fulltext=\([^&]*\)&.*/\1/p' ~/Downloads/ReqLogs/*

The sed version is more succinct, but also somewhat more oblique.

\%...% uses the alternate delimiter % so that we can use literal slashes in our search expression.

The s/ .../\1/p then says to replace everything on the matching lines after the first space, capturing anything between fulltext= and &, and replace with the captured substring, then print the resulting line.

The -n flag turns off the default printing action, so that we only print the lines where the search expression matched.

The wildcard ~/Downloads/ReqLogs/* matches all files in that directory; if you really need to traverse subdirectories, too, perhaps add find to the mix.

find ~/Downloads/ReqLogs -type f -exec sed -n '\%/libs/granite/omnisearch%s/ .*fulltext=\([^&]*\)&.*/\1/p' {} +

or similarly with the Awk command after -exec. The placeholder {} tells find where to add the name of the found file(s) and + says to put as many as possible in one go, rather than running a separate -exec for each found file. (If you want that, use \; instead of +.)

回复收藏 0 原文

~没有更多了~