排除最少 6 位数字并替换 Hive 中的尾随数字
有人可以帮我在配置单元中编写以下逻辑吗? 我在列中有一个值,其中的数字尾随 0。我需要将所有这些 0 替换为 9,同时将 0 替换为 9 ,我还需要考虑在 9 之前至少应该有 6 位数字,否则需要排除一些 0,以便 9 之前至少可以有 6 位数字。 PFB 一些场景。
- 1234506600000000000
这里我们可以看到尾随 0 之前的位数是 8 (12345066),所以我只需要把 0 保留 9 个,输出就会像这样。 1234506699999999999。
- 1234500000000000000
这里我在尾随 0 之前只有 5 位数字,所以我需要考虑第 6 个位置的 0 也是一个数字,并且需要在用 9 替换 0 时排除它,因此输出将为 1234509999999999999。
- 1000000000000000000
这里我在尾随 0 之前只有 1 位数字,所以我需要排除 5 个额外的 0,并且需要将剩余的 0 替换为 9,所以最终输出将类似于 1000009999999999999。
Input Output
1234506600000000000 1234506699999999999
1234500000000000000 1234509999999999999
1000000000000000000 1000009999999999999
Can someone help me to write below logic in hive.
I have one value in column in which I have digits with trailing 0's. I need to Replace all these 0's by 9,while replacing the 0's by 9 ,I need to also consider that before 9 minimun 6 digits should be there, else need to exclude some 0's so that before 9 can have at-least 6 digits.PFB some Scenarios.
- 1234506600000000000
Here we can see the number of digits before trailing 0's is 8 (12345066) so i Just need to remaining 0's by 9 and the output will be like. 1234506699999999999.
- 1234500000000000000
Here I have only 5 digits before trailing 0's so I need to consider 6th Position's 0 also a digit and need to exclude this while replacing the 0's by 9 so the output will be 1234509999999999999.
- 1000000000000000000
Here I have only 1 digit before trailing 0's ,so I need to exclude 5 extra 0's and need to replace remaining 0's by 9, so final output will be like 1000009999999999999.
Input Output
1234506600000000000 1234506699999999999
1234500000000000000 1234509999999999999
1000000000000000000 1000009999999999999
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
如果您想修改 另一个问题中的 leftjoin 技术,我们可以调整正则表达式以匹配至少 6 位数字(包括 0)
如果您想走我建议的替换/填充/替换路线,你会在 rtrim'd 后检查数字的长度,如果它小于 6,则用零将其填充到 6。如果字符串长度超过 6 个字符,rpad 的大多数实现都会将其截断为 6 个字符 - 如果不这样做,那么在 rtrim 之后调用 rpad 会很好而且很简单。如果 hive 的 rpad 执行子字符串操作,那么创建自己的 rpad 函数可能值得,该函数可以单独留下比 N 长的字符串
If you want to modify leftjoin's technique from the other question we can tweak the Regex to match at least 6 digits including 0s
If you want to go the replace/pad/replace route I proposed, you'd check the length of the number after it's rtrim'd and if it's less than 6, rpad it out to 6 with zeroes. Most implementations of rpad would chop the string off at 6 chars if it were longer than 6 - if they didn't it would be nice and simple to just call rpad after rtrim. It might be worth making your own rpad function that leaves strings longer than N alone, if hive's rpad performs a substring op