从 XML 文件中提取文件名（不带扩展名）

发布于 2024-09-15 11:20:14 字数 425 浏览 4 评论 0原文

当我 grep 查找“Server”时，我有以下 XML 输出：

<Server id="1" src="/other/Server/PRX01/PRX01.xml"/>
<Server id="2" src="/other/Server/PRX01/PRX02.xml"/>
<Server id="3" src="/other/Server/PRX01/PRX03.xml"/>
<Server id="4" src="/other/Server/PRX01/PRX04.xml"/>

我需要能够使用 sed/awk 或其他工具获取此输出，并且只获取文件名，而不包含路径或扩展名。所以我的输出需要是（对于这个例子）：

PRX01
PRX02
PRX03
PRX04

原文

I have the following XML output when I grep for "Server":

<Server id="1" src="/other/Server/PRX01/PRX01.xml"/>
<Server id="2" src="/other/Server/PRX01/PRX02.xml"/>
<Server id="3" src="/other/Server/PRX01/PRX03.xml"/>
<Server id="4" src="/other/Server/PRX01/PRX04.xml"/>

I need to be able to take this output and sed/awk or some other tool, and just get the filename, without the path or extension. So my output would need to be (for this example):

PRX01
PRX02
PRX03
PRX04

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

巴黎夜雨 2024-09-22 11:20:14

对于示例输入数据，以下 sed 脚本将起作用：

sed -e 's/.*\/\(.*\)\.xml.*/\1/g' t.tmp

.*\/ 匹配正斜杠（贪婪）。然后 \(.*\)\.xml 匹配该行的最后一个并获取组中的基本文件名。 \1 告诉它用所有内容替换组中的内容。

For the example input data, the following sed script will work:

sed -e 's/.*\/\(.*\)\.xml.*/\1/g' t.tmp

The .*\/ matches up to a forward slash (greedy). Then \(.*\)\.xml matches the last of the line and grabs the base file name in a group. The \1 tells it to substitute all of that for what was in the group.

回复收藏 0 原文

风渺 2024-09-22 11:20:14

使用 awk 和 sed 很简单，假设数据位于文件“test.data”中：

cat test.data | awk 'BEGIN{FS="/"}{print $5}'  | sed 's/\..*//g'

simple to do with awk and sed, assuming the data is in the file "test.data":

cat test.data | awk 'BEGIN{FS="/"}{print $5}'  | sed 's/\..*//g'

回复收藏 0 原文

入怼 2024-09-22 11:20:14

可以简化接受的答案，而无需使用无用的 cat 和 sed，

awk '{gsub(/\..*/,"",$5) ;print $5}' file

the accepted answer can be simplified without the useless cat and sed,

awk '{gsub(/\..*/,"",$5) ;print $5}' file

回复收藏 0 原文

Smile简单爱 2024-09-22 11:20:14

>gawk -F"/" "{ split($5,a,\".\"); print a[1]}" 1.t
PRX01
PRX02
PRX03
PRX04

>gawk -F"/" "{ split($5,a,\".\"); print a[1]}" 1.t
PRX01
PRX02
PRX03
PRX04

回复收藏 0 原文

~没有更多了~

关于作者

孤云独去闲

暂无简介

0 文章

0 评论

24 人气

关注发私信

謌踐踏愛綪

文章 0 评论 0

关注

开始看清了

文章 0 评论 0

关注

高速公鹿

文章 0 评论 0

关注

alipaysp_PLnULTzf66

文章 0 评论 0

关注

热情消退

文章 0 评论 0

关注

白色月光

文章 0 评论 0

友情链接

文江博客

从 XML 文件中提取文件名（不带扩展名）

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（4）

关于作者

相关话题

热门标签

推荐作者

謌踐踏愛綪

开始看清了

高速公鹿

alipaysp_PLnULTzf66

热情消退

白色月光

友情链接

从 XML 文件中提取文件名（不带扩展名）

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（4）

关于作者

相关话题

热门标签

推荐作者

謌踐踏愛綪

开始看清了

高速公鹿

alipaysp_PLnULTzf66

热情消退

白色月光

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。