PHP正则表达式问题

发布于 2024-10-21 00:52:33 字数 502 浏览 0 评论 0原文

我有一个存储书页的字符串。它是这样的：

///0///
Page1 Text
///1///
Page2 Text
///2///
Page3 Text
///3///

我想提取页面文本（Page1 文本、Page2 文本、Page3 文本）。这是正在使用的正则表达式：

$format = "%///\d*///(.*)///\d*///%";
preg_replace_callback($format, "process_page", $text);

根据此页面我可以在表达式的开头和结尾使用除 / 之外的其他字符。所以我使用 % 来简化我的模式，所以我不必使用像这样的转义字符 \/

对我来说似乎没问题，但它什么也不返回。有人可以告诉我问题出在哪里吗？

原文

I have a string which I store book pages. It's something like this:

///0///
Page1 Text
///1///
Page2 Text
///2///
Page3 Text
///3///

I want to extract page texts (Page1 Text, Page2 Text, Page3 Text). Here is the Regular Expression which is am using:

$format = "%///\d*///(.*)///\d*///%";
preg_replace_callback($format, "process_page", $text);

According to this page I can use other character than / in the start and end of the expression. So I used % to simplify my pattern, so I don't have to use escape character like this \/

It seems okay to me, but it return nothing. Can somebody please tell me where is the problem?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

甜心 2024-10-28 00:52:33

我认为 preg_split 对您来说可能是更好的选择：

$text = '
Page1 Text
///1///
Page2 Text
///2///
Page3 Text
';

$format = "%///\d+///%";
$arr = preg_split($format, $text);

// $arr = Array
// ( 
//     [0] => Page1 Text
//
//     [1] => 
// Page2 Text
// 
//     [2] => 
// Page3 Text
// )

每个页面现在都有自己的数组元素。

I think preg_split might be a better option for you:

$text = '
Page1 Text
///1///
Page2 Text
///2///
Page3 Text
';

$format = "%///\d+///%";
$arr = preg_split($format, $text);

// $arr = Array
// ( 
//     [0] => Page1 Text
//
//     [1] => 
// Page2 Text
// 
//     [2] => 
// Page3 Text
// )

Each page is now in it's own array element.

回复收藏 0 原文

雨轻弹 2024-10-28 00:52:33

我认为您需要 s 修饰符: $format = "%///\d*///(.*)///\d*///%s";

s (PCRE_DOTALL)
如果设置了此修饰符，模式中的点元字符将匹配所有字符，包括换行符。如果没有它，换行符将被排除。该修饰符相当于 Perl 的 /s 修饰符。负类（例如 [^a]）始终与换行符匹配，与此修饰符的设置无关。

我不确定你想做什么，但我个人不会为此使用正则表达式。您知道要查找的确切字符串（例如///4///），并从那里找到结束字符串（///5///< /code> 或文件结尾）。带有 strpos 的简单 substr 可能是更好的选择。

回复收藏 0 原文

一片旧的回忆 2024-10-28 00:52:33

我会使用类似 preg_spilt 的东西（参见 Tim Cooper 的回答）。

但对于您的正则表达式，请尝试以下操作：

$format = "%///\d+///(.*?)(?=///\d+///)%s";

使用环视断言和 s-修饰符。

I would use something like preg_spilt (see Tim Cooper's answer).

But for your RegEx, try this:

$format = "%///\d+///(.*?)(?=///\d+///)%s";

With Look-around assertion and s-modifier.

回复收藏 0 原文

~没有更多了~

关于作者

木有鱼丸

暂无简介

0 文章

0 评论

23 人气

关注发私信

友情链接

文江博客

PHP正则表达式问题

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（3）

关于作者

相关话题

热门标签

推荐作者

1CH1MKgiKxn9p

ゞ记忆︶ㄣ

JackDx

信远

yaoduoduo1995

霞映澄塘

友情链接

PHP正则表达式问题

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（3）

关于作者

相关话题

热门标签

推荐作者

1CH1MKgiKxn9p

ゞ记忆︶ㄣ

JackDx

信远

yaoduoduo1995

霞映澄塘

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。