合并GCP中的CSV文件

发布于 2025-02-07 22:25:39 字数 132 浏览 1 评论 0原文

我在GCP上工作的数据集为CSV格式,对于每个功能,都有一个没有标头的CSV文件。大约有20个文件,想为所有这些变量创建一个带标头的文件。但是,仅当我尝试打开顶点AI工作台时,我才能访问数据存储桶,这表明我没有许可。 有什么方法可以组合所有这些文件?

The dataset on which I am working on GCP is in csv format and for each feature there is a separate csv file with no header. There is around 20 files and want to create a single file for all these variables with headers. However, I have access on the data bucket only when I try to open Vertex AI Workbench, it shows I don't have permission for that.
Is there any way to combine all these files?

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(1

小苏打饼 2025-02-14 22:25:39

i 写了一篇文章关于如何使用BigQuery播放CSV文件。我没有提到如何合并正确。但是通常我会在row_number上进行加入,类似的事情:

With table_left as (select *, ROW_NUMBER() OVER () as row_id from <tableLeft>),
table_right as (select *, ROW_NUMBER() OVER () as row_id from <tableRight>)
select tl.* except(row_id), tr.*  except(row_id) from table_left tl join table_right tr on tl.row_id = tr.row_id

I wrote an article on how to use BigQuery to play with CSV files. I didn't mentioned how to merge to right. But typically I will do a join on row_number, something like that:

With table_left as (select *, ROW_NUMBER() OVER () as row_id from <tableLeft>),
table_right as (select *, ROW_NUMBER() OVER () as row_id from <tableRight>)
select tl.* except(row_id), tr.*  except(row_id) from table_left tl join table_right tr on tl.row_id = tr.row_id

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文