我的输入如下
我想要以下
我正在尝试
Sales External?HR?Purchase Department
使用Listagg,因为最终我想要在单独的列中
查询输出就像以下一样,
这意味着它应该搜索第一次出现分隔符(在这种情况下“?”,它可以是任何东西,而不是常见的,例如“ - ”或“/”,因为分离器需要分开,然后提取短语在第一个分离器之前,创建一个列并放置值。然后,它应该寻找分隔符的第二次出现,然后提取单词并继续创建列,可以有多个分离器。
我尝试使用split_part尝试,但它在实际数据方案中无法维护序列,并且数据不正确。
我还尝试了RegexP_Instr,但无法使用特殊字符作为分离器。
有什么想法吗?
I have input like below
data:image/s3,"s3://crabby-images/b543b/b543b4a5837e4af52b303763acc4d2aa0350f94b" alt="enter image description here"
I want like below
data:image/s3,"s3://crabby-images/a8df0/a8df011037ba1c5f6310b75e0f245bd1bc444eab" alt="enter image description here"
I was trying with
Sales External?HR?Purchase Department
I did LISTAGG because finally i want in separate columns
Query Output would be like below,
meaning it should search for first occurrence of the separator (in this case "?", it can be anything but not common ones like "-" or "/" as the separator needs to be separate than sting value) and then extract the phrase before the first separator and create one column and put the value. Then it should look for second occurrence of the separator and then extract the word and keep creating columns, there can be multiple separators.
I tried with SPLIT_PART but it does not maintain the sequence in real data scenario and data does not come correct as per sequence.
I also tried with REGEXP_INSTR, but unable to use special characters as separators.
Any thought?
data:image/s3,"s3://crabby-images/ff0e2/ff0e2b335ff4d61a358596d7cf8bfb6ad9589e05" alt="enter image description here"
发布评论
评论(3)
regex提取器
href =“ https://docs.snowflake.com/en/sql-reference/functions/flatten.html” rel =“ nofollow noreferrer”>侧面扁平将您的阵列转化为行:
更深的潜水:更多案例: https://dwgeek.com/snowflake-convert-array -to-lows-methods and-examples.html/
Regex Extract should work for you:
You can use LATERAL FLATTEN to convert your array into rows:
Deeper dive into some more cases: https://dwgeek.com/snowflake-convert-array-to-rows-methods-and-examples.html/
这是数据的简化版本。它使用带有array_agg的CTE来分组行。然后,它从数组更改为列。要添加更多列,您可以使用max(),min()或any_value()函数来通过聚合使它们通过。 (请注意,使用Any_value()将不允许使用结果集缓存的缓存结果,因为它被标记为非确定性。)
将行分类为阵列似乎没有什么特别的东西,以便将行分解为cool1,cole2,cole2,cool3,cool3,etc。是确定性的。我显示了简单地对角色的名称进行排序,但这可能是该组中的任何顺序。
Here's a simplified version of the data. It uses a CTE with array_agg to group the rows. It then changes from arrays to columns. To add more columns, you can use max(), min(), or any_value() functions to get them through the aggregation. (Note that use of any_value() will not allow use of cached results from the result set cache since it's flagged as nondeterministic.)
There's nothing particular that seems interesting to sort the rows into the array so that ROLE1, ROLE2, ROLE3, etc. are deterministic. I showed simply sorting on the name of the role, but it could be any order by within that group.
这是一个存储的POC,它将根据输入字符串和指定的定界符产生一个动态列的表结果。
如果您正在寻找一种基于值生成动态列名称的方法,我建议您在此处访问Felipe Hoffa的博客条目:
https://medium.com/medium.com/snowflake/snowflake/dynemic-pivots-pivots-pivots-pivots-pivots-po- snowflake-c76393987c in-sql-with-with-snowflake
调用:
调用pivot_dyn_results([string],[DeLimiter]);
结果:
Here's a stored proc that will produce a table result with a dynamic set of columns based on the input string and specified delimiter.
If you are looking for a way to generate dynamic column names based on values, I recommend visiting Felipe Hoffa's blog entry here:
https://medium.com/snowflake/dynamic-pivots-in-sql-with-snowflake-c763933987c
Invocation:
call pivot_dyn_results([string],[delimiter]);
Results: