如何从字母数字文本中删除前导零?
我在 SO 中看到了有关如何在此处添加零前缀的问题。但不是相反!
你们能建议我如何删除字母数字文本中的前导零吗?是否有任何内置 API 或者我需要编写一个方法来修剪前导零?
例子:
01234 converts to 1234
0001234a converts to 1234a
001234-a converts to 1234-a
101234 remains as 101234
2509398 remains as 2509398
123z remains as 123z
000002829839 converts to 2829839
I've seen questions on how to prefix zeros here in SO. But not the other way!
Can you guys suggest me how to remove the leading zeros in alphanumeric text? Are there any built-in APIs or do I need to write a method to trim the leading zeros?
Example:
01234 converts to 1234
0001234a converts to 1234a
001234-a converts to 1234-a
101234 remains as 101234
2509398 remains as 2509398
123z remains as 123z
000002829839 converts to 2829839
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(21)
正则表达式是完成这项工作的最佳工具;它应该是什么取决于问题的具体说明。下面的代码删除了前导零,但如果需要则保留 1(即它不会将
"0"
转换为空白字符串)。^
锚点将确保匹配的0+
位于输入的开头。(?!$)
负向先行确保不会匹配整个字符串。测试工具:
另请参阅
String.replaceFirst(String regex)
Regex is the best tool for the job; what it should be depends on the problem specification. The following removes leading zeroes, but leaves one if necessary (i.e. it wouldn't just turn
"0"
to a blank string).The
^
anchor will make sure that the0+
being matched is at the beginning of the input. The(?!$)
negative lookahead ensures that not the entire string will be matched.Test harness:
See also
String.replaceFirst(String regex)
您可以使用 StringUtils来自 Apache Commons Lang 的类,如下所示:
You can use the StringUtils class from Apache Commons Lang like this:
如果您使用 Kotlin 这是您唯一需要的代码:
If you are using Kotlin This is the only code that you need:
正则表达式方式怎么样:
^
锚定到字符串的开头(我假设从上下文来看你的字符串不是多行的,否则你可能需要查看\A
用于输入的开始而不是行的开始)。0*
表示零个或多个0
字符(您也可以使用0+
)。replaceFirst
只是将开头的所有0
字符替换为空。如果像 Vadzim 一样,您对前导零的定义不包括将
"0"
(或"000"
或类似字符串)转换为空字符串(一个足够合理的字符串)期望),如有必要,只需将其放回去:How about the regex way:
The
^
anchors to the start of the string (I'm assuming from context your strings are not multi-line here, otherwise you may need to look into\A
for start of input rather than start of line). The0*
means zero or more0
characters (you could use0+
as well). ThereplaceFirst
just replaces all those0
characters at the start with nothing.And if, like Vadzim, your definition of leading zeros doesn't include turning
"0"
(or"000"
or similar strings) into an empty string (a rational enough expectation), simply put it back if necessary:一种清晰的方法,无需任何 regExp 和任何外部库。
A clear way without any need of regExp and any external libraries.
要与thelost的Apache Commons答案相匹配:使用 guava-libraries (Google 的通用 Java我认为实用程序库现在应该位于任何重要 Java 项目的类路径上),这将使用 CharMatcher:
To go with thelost's Apache Commons answer: using guava-libraries (Google's general-purpose Java utility library which I would argue should now be on the classpath of any non-trivial Java project), this would use CharMatcher:
你可以这样做:
String s = Integer.valueOf("0001007").toString();
You could just do:
String s = Integer.valueOf("0001007").toString();
使用这个:
Use this:
使用 Apache Commons <代码>StringUtils类:
Use Apache Commons
StringUtils
class:将正则表达式与组一起使用:
Using Regexp with groups:
正如一些答案所建议的那样,使用正则表达式是一个很好的方法。如果您不想使用正则表达式,则可以使用以下代码:
Using regex as some of the answers suggest is a good way to do that. If you don't want to use regex then you can use this code:
如果您(像我一样)需要删除字符串中每个“单词”中的所有前导零,您可以将@polygenelubricants的答案修改为以下内容:
这会导致:
If you (like me) need to remove all the leading zeros from each "word" in a string, you can modify @polygenelubricants' answer to the following:
which results in:
使用 kotlin 很容易
Using kotlin it is easy
我认为做到这一点是很容易的。您可以从头开始循环遍历字符串并删除零,直到找到非零字符。
I think that it is so easy to do that. You can just loop over the string from the start and removing zeros until you found a not zero char.
如果不对
String
使用Regex
或substring()
函数,这将是低效的 -Without using
Regex
orsubstring()
function onString
which will be inefficient -您可以使用正则表达式将
"^0*(.*)"
替换为"$1"
You could replace
"^0*(.*)"
to"$1"
with regex如果您不想使用正则表达式或外部库。
你可以用“for”来做:
If you don't want to use regex or external library.
You can do with "for":
我做了一些基准测试,发现(到目前为止)最快的方法是这个解决方案:
特别是正则表达式在长时间迭代中非常慢。 (我需要找到批处理作业的最快方法。)
I made some benchmark tests and found, that the fastest way (by far) is this solution:
Especially regular expressions are very slow in a long iteration. (I needed to find out the fastest way for a batchjob.)
那么只搜索第一个非零字符怎么样?
此正则表达式查找 1 到 9 之间的第一个数字,后跟任意数量的数字,因此对于“00012345”,它返回“12345”。
它可以轻松地适应字母数字字符串。
And what about just searching for the first non-zero character?
This regex finds the first digit between 1 and 9 followed by any number of digits, so for "00012345" it returns "12345".
It can be easily adapted for alphanumeric strings.