如何在加载镶木quet文件时单独添加标头行？

发布于 2025-01-25 03:24:59 字数 387 浏览 3 评论 0原文

在处理CSV文件时，我们可以说：

df = pd.read_csv("test.csv", names=header_list, dtype=dtype_dict)

以上将在dtype_dict中以header_list和dtypes创建一个数据框，

我们可以使用pd._read_parquet（）做类似的事情吗？

我的问题涉及单独传递标题，因此在“ test.csv”中不可用
绕过的另一种方法可能是将DF中的整个数据向下移动1（包括将标题转换为行），然后用header_list替换标题（甚至可能吗？）

是否有最佳解决方案？我对镶木木不太熟悉，因此任何指导都将不胜感激，谢谢。

原文

While handling csv files we can say:

df = pd.read_csv("test.csv", names=header_list, dtype=dtype_dict)

Above would create a dataframe with headers as header_list and dtypes as of the dtype_dict

Can we do something similar with pd.read_parquet() ?

My issue involves passing in headers separately and would thus not be available in the "test.csv"
Another way to bypass it could be to move the entire data in df downwards by 1 (including shifting headers into rows) and then replacing the header with header_list (if it's even possible?)

Is there an optimal solution to my issue?
I'm not too familiar with parquet so any guidance would be appreciated, thanks.

分享到QQ

分享到微博