如何存储 /url 的实际名称？

发布于 2024-10-27 14:49:53 字数 459 浏览 2 评论 0原文

我正在将脚本转换为 HDFS (Hadoop)，并且我有这个 cmd：

    tail -n+$indexedPlus1 $seedsDir/*url* | head -n$it_size > $it_seedsDir/urls

使用 HDFS，我需要使用 -get 获取文件，这可以工作。

bin/hadoop dfs -get $seedsDir/*url* .

但是我不知道下载的文件名是什么，更不用说我想存储在 $local_seedsDir/url 中。我可以知道吗？

KISS 告诉我：

bin/hadoop dfs -get $seedsDir/*url* $local_seedsDir/urls

即只需在本地将文件命名为 url 即可。

原文

I'm converting a script to HDFS (Hadoop) and I have this cmd:

    tail -n+$indexedPlus1 $seedsDir/*url* | head -n$it_size > $it_seedsDir/urls

With HDFS I need to get the file using -get and this works.

bin/hadoop dfs -get $seedsDir/*url* .

However I don't know what downloaded file name is, let alone that I wanted to store in $local_seedsDir/url.
Can I know?

KISS tells me:

bin/hadoop dfs -get $seedsDir/*url* $local_seedsDir/urls

i.e. just name the file as urls locally.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

痞味浪人 2024-11-03 14:49:53

url=`echo bin/hadoop dfs -get urls-input/MR6/*url* .`

然后使用 tail 和 head 从 url 中提取实际文件名并将其存储在 $urls 中

rm $urls

，否则，只需 KISS

url=`echo bin/hadoop dfs -get urls-input/MR6/*url* .`

then tail and head to extract from url the actual file name and store it in $urls

rm $urls

But otherwise, just KISS

回复收藏 0 原文

~没有更多了~

关于作者

千柳

暂无简介

0 文章

0 评论

22 人气

关注发私信

胡图图

文章 0 评论 0

关注

zt006

文章 0 评论 0

关注

z祗昰~

文章 0 评论 0

关注

冰葑

文章 0 评论 0

关注

野の

文章 0 评论 0

关注

天空

文章 0 评论 0

友情链接

文江博客

如何存储 /url 的实际名称？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

胡图图

zt006

z祗昰~

冰葑

野の

天空

友情链接

如何存储 /*url* 的实际名称？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

胡图图

zt006

z祗昰~

冰葑

野の

天空

友情链接

如何存储 /url 的实际名称？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。