正则表达式将 BBCode 分成几部分

发布于 2024-09-24 16:05:38 字数 196 浏览 7 评论 0原文

我有这个：

str = "some html code [img]......[/img] some html code [img]......[/img]"

我想得到这个：

["[img]......[/img]","[img]......[/img]"]

原文

I have this:

str = "some html code [img]......[/img] some html code [img]......[/img]"

and I want to get this:

["[img]......[/img]","[img]......[/img]"]

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

鹤仙姿 2024-10-01 16:05:38

请不要使用 BBCode。这是邪恶。

BBCode 在开发人员的帮助下诞生了
懒得正确解析 HTML
并决定发明自己的标记
语言。与所有产品一样
偷懒，结果完全是
不一致、不标准化、
被广泛采用。

尝试使用用户友好的标记语言，例如 Markdown （这就是 Stack Overflow 使用的）或纺织品。
它们都有 Ruby 解析器：

Maruku for Markdown
RedCloth for Textile

如果您仍然不想听从我的建议并选择使用 BBCode，请不要重新发明轮子并使用 BBCode 解析器。要直接回答您的问题，有最不理想的选择：使用正则表达式。

/\[img\].*?\[\/img\]/

如 rubular 所示。虽然我会使用 /\[img\](.*?)\[\/img\]/ ，所以它会提取 img 标签内的内容。请注意，这是相当脆弱的，如果存在嵌套的 img 标签，则会损坏。因此，建议使用解析器。

Please don't use BBCode. It's evil.

BBCode came to life when developers
were too lazy to parse HTML correctly
and decided to invent their own markup
language. As with all products of
laziness, the result is completely
inconsistent, unstandardized, and
widely adopted.

Try to use a user-friendlier markup language, like Markdown (that's what Stack Overflow uses) or Textile.
Both of them have parsers for Ruby:

Maruku for Markdown
RedCloth for Textile

If you still don't want to heed to my advice and choose to go with BBCode, don't reinvent the wheel and use a BBCode parser. To answer your question directly, there is the least desirable option: use regex.

/\[img\].*?\[\/img\]/

As seen on rubular. Although I would use /\[img\](.*?)\[\/img\]/, so it will extract the contents inside the img tags. Note that this is fairly fragile and will break if there are nested img tags. Hence, the advice to use a parser.

回复收藏 0 原文

梦里°也失望 2024-10-01 16:05:38

irb(main):001:0> str = "some html code [img]......[/img] some html \
code [img]......[/img]"
"some html code [img]......[/img] some html code [img]......[/img]"
irb(main):002:0> str.scan(/\[img\].*?\[\/img\]/)
["[img]......[/img]", "[img]......[/img]"]

请记住，这是一个非常具体的答案，基于您的具体问题。更改 str，例如，在图像标签中添加图像标签，以及 一切地狱都会崩溃。

irb(main):001:0> str = "some html code [img]......[/img] some html \
code [img]......[/img]"
"some html code [img]......[/img] some html code [img]......[/img]"
irb(main):002:0> str.scan(/\[img\].*?\[\/img\]/)
["[img]......[/img]", "[img]......[/img]"]

Keep in mind that this is a very specific answer that is based on your exact question. Change str by, say, adding an image tag within an image tag, and all Hell will break loose.

回复收藏 0 原文

断肠人 2024-10-01 16:05:38

Google 代码中有一个 ruby BBCODE 解析器。

不要为此使用正则表达式。

回复收藏 0 原文

贵在坚持 2024-10-01 16:05:38

str = "some html code [img]......[/img] some html code [img]......[/img]"
p str.split("[/img]").each{|x|x.sub!(/.*\[img\]/,"")}

str = "some html code [img]......[/img] some html code [img]......[/img]"
p str.split("[/img]").each{|x|x.sub!(/.*\[img\]/,"")}

回复收藏 0 原文

~没有更多了~

关于作者

靑春怀旧

暂无简介

文章

26 人气

关注发私信

友情链接

文江博客

正则表达式将 BBCode 分成几部分

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（4）

关于作者

相关话题

热门标签

推荐作者

燃烧我的卡路李先生

qq_2gSKZM

∞梦里开花

qq_IklFPL

迷途知返

深海不蓝

友情链接

正则表达式将 BBCode 分成几部分

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（4）

关于作者

相关话题

热门标签

推荐作者

燃烧我的卡路李先生

qq_2gSKZM

∞梦里开花

qq_IklFPL

迷途知返

深海不蓝

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。