删除
之间的 html 换行符标签

发布于 2024-11-17 21:38:32 字数 993 浏览 4 评论 0原文

我有一个CMS系统，允许人们也使用HTML代码，但是在函数末尾提供了一个nl2br，这使得：

<ul>
<li></li>
</ul>

变成这样：

<ul><br/>
<li></li><br/>
</ul>

现在我想删除这些< ;br/> 存在于

我已经发现另一个问题几乎相同，但换行符。我已将其集成到我的 CMS 中，但对于一个客户端，所有内容都已填写，因此我必须在将 \n 替换为 之后修复此问题。 的。

其他问题将此作为正则表达式提供以匹配 \

中的 n：

/(?<=<ul>|<\/li>)\s*?(?=<\/ul>|<li>)/is

我认为这样的事情：

/(?<=<ul>|<\/li>)(<br>|<br\/>|<br \/>)(?=<\/ul>|<li>)/is

可以解决问题，但事实并非如此。我缺少什么？

编辑

我对 DOMDocument 解决方案非常开放，如果有一种方法可以使用 xpath 查询换行符，这可能会解决我的问题。

原文

I have a CMS system that allows people to also use HTML code, but a nl2br is provided at the end of the function, which makes this:

<ul>
<li></li>
</ul>

into this:

<ul><br/>
<li></li><br/>
</ul>

Now I want to remove these <br/>'s that exist between <ul> tags.

I already found another question which asks almost the same, but for newlines. I've integrated this inside my CMS but for one client all the content is already filled in so I have to fix this after the \n's are replaced with <br/>'s.

The other question provides this as a regex to match \n within <ul></ul>:

/(?<=<ul>|<\/li>)\s*?(?=<\/ul>|<li>)/is

I'd think something like this:

/(?<=<ul>|<\/li>)(<br>|<br\/>|<br \/>)(?=<\/ul>|<li>)/is

Would do the trick, but it doesn't. What am I missing?

EDIT

I am very open for DOMDocument solutions, if there's a way to query linebreaks with xpath this would probably fix my problem.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

因为看清所以看轻 2024-11-24 21:38:32

在您提供的示例中，标记被一些空格包围（至少被换行符包围），因此需要在相应的正则表达式中反映出来。

/(?<=<ul>|<\/li>)(\s*<br>\s*|\s*<br\/>\s*|\s*<br \/>\s*)(?=<\/ul>|<li>)/is

在许多情况下，正则表达式并不是解析 HTML 的最佳方法（我绝对同意上面/下面的评论），但对于某些特定目的来说，它们总是足够好的。

In the example you provided, <br> tags are surrounded by some white-space (at least by new line characters), so this needs to be reflected in the corresponding regular expression.

/(?<=<ul>|<\/li>)(\s*<br>\s*|\s*<br\/>\s*|\s*<br \/>\s*)(?=<\/ul>|<li>)/is

In many cases regular expressions are NOT the best way for parsing HTML (I definitely agree with the comments above/below), but they are always good enough for some particular purposes.

回复收藏 0 原文