php preg_replace_call :提取特定值以供稍后重新插入
为了简洁起见...
我想从字符串中取出项目,将它们放入一个单独的数组中,用 ID 标记替换从字符串中提取的值,解析字符串,然后将提取的项目放回到原来的位置(以正确的顺序) 。 (如果这是有意义的,则跳过其余部分:D)
我有以下字符串;
“我的句子包含 [url] 和 [url] 的 URL,这让我的生活变得困难。”
由于各种原因,我想删除这些网址。 但我需要保留它们的位置,并稍后重新插入它们(在操作字符串的其余部分之后)。
因此我想;
“我的句子包含 [url] 和 [url] 的 URL,这让我的生活变得困难。”
成为;
“我的句子包含 [token1fortheURL] 和 [token2fortheURL] 的 URL,这让我的生活变得困难。”
我已经尝试过多次以各种方式做到这一点。 我所做的就是撞砖墙并发明新的脏话!
我使用以下代码进行设置;
$mystring = 'my sentence contains URLs to http://www.google.com/this.html and http://www.yahoo.com which makes my life difficult.';
$myregex = '/(((?:https?|ftps?)\:\/\/)?([a-zA-Z0-9:]*[@])?([a-zA-Z0-9\-\.]+\.[a-zA-Z]{2,3}|([0-9]+))([a-zA-Z0-9-._?,\'\/\+&%\$#\=~:]+)?)/';
$myextractions = array();
然后我执行 preg_replace_callback;
$matches = preg_replace_callback($myregex,'myfunction',$mystring);
我的职能如下;
function myfunction ($matches) {}
正是在这里,砖墙开始出现。 我可以将内容放入空白提取数组中 - 但它们在函数外部不可用。我可以使用令牌更新字符串,但无法访问被替换的 URL。 我似乎无法向 preg_replace_callback 中的函数调用添加其他值。
我希望有人能提供帮助,因为这让我发疯。
更新:
基于@Lepidosteus建议的解决方案, 我想我有以下工作?
$mystring = 'my sentence contains URLs to http://www.google.com/this.html and http://www.yahoo.com which makes my life difficult.';
$myregex = '/(((?:https?|ftps?)\:\/\/)?([a-zA-Z0-9:]*[@])?([a-zA-Z0-9\-\.]+\.[a-zA-Z]{2,3}|([0-9]+))([a-zA-Z0-9-._?,\'\/\+&%\$#\=~:]+)?)/';
$tokenstart = ":URL:";
$tokenend = ":";
function extraction ($myregex, $mystring, $mymatches, $tokenstart, $tokenend) {
$test1 = preg_match_all($myregex,$mystring,$mymatches);
$mymatches = array_slice($mymatches, 0, 1);
$thematches = array();
foreach ($mymatches as $match) {
foreach ($match as $key=>$match2) {
$thematches[] = array($match2, $tokenstart.$key.$tokenend);
}
}
return $thematches;
}
$matches = extraction ($myregex, $mystring, $mymatches, $tokenstart, $tokenend);
echo "1) ".$mystring."<br/>";
// 1) my sentence contains URLs to http://www.google.com/this.html and http://www.yahoo.com which makes my life difficult.
function substitute($matches,$mystring) {
foreach ($matches as $match) {
$mystring = str_replace($match[0], $match[1], $mystring);
}
return $mystring;
}
$mystring = substitute($matches,$mystring);
echo "2) ".$mystring."<br/>";
// 2) my sentence contains URLs to :URL:0: and :URL:1: which makes my life difficult.
function reinsert($matches,$mystring) {
foreach ($matches as $match) {
$mystring = str_replace($match[1], $match[0], $mystring);
}
return $mystring;
}
$mystring = reinsert($matches,$mystring);
echo "3) ".$mystring."<br/>";
// 3) my sentence contains URLs to http://www.google.com/this.html and http://www.yahoo.com which makes my life difficult.
这看起来有效吗?
For the sake of brevity...
I want to take items out of a string, put them into a separate array, replace the values extracted from the string with ID'd tokens, parse the string, then put the extracted items back in their original positions (in the correct order).
(If that makes sense, then skip the rest :D)
I have the following string;
"my sentence contains URLs to [url] and [url] which makes my life difficult."
For various reasons, I would like to remove the URLs.
But I need to keep their place, and reinsert them later (after manipulating the rest of the string).
Thus I would like;
"my sentence contains URLs to [url] and [url] which makes my life difficult."
to become;
"my sentence contains URLs to [token1fortheURL] and [token2fortheURL] which makes my life difficult."
I've tried doing this several times, various ways.
All I do is hit brickwalls and invent new swear words!
I use the following code to setup with;
$mystring = 'my sentence contains URLs to http://www.google.com/this.html and http://www.yahoo.com which makes my life difficult.';
$myregex = '/(((?:https?|ftps?)\:\/\/)?([a-zA-Z0-9:]*[@])?([a-zA-Z0-9\-\.]+\.[a-zA-Z]{2,3}|([0-9]+))([a-zA-Z0-9-._?,\'\/\+&%\$#\=~:]+)?)/';
$myextractions = array();
I then do a preg_replace_callback;
$matches = preg_replace_callback($myregex,'myfunction',$mystring);
And I have my function as follows;
function myfunction ($matches) {}
And it's here that the brickwalls start happening.
I can put stuff into the blank extraction array - but they are nto available outside the function. I can update the string with tokens, but I lose access to the URLs that are replaced.
I cannot seem to add additional values to the function call within the preg_replace_callback.
I'm hoping someone can help, as this is driving me nuts.
UPDATE:
Based on the solution suggested by @Lepidosteus,
I think I have the following working?
$mystring = 'my sentence contains URLs to http://www.google.com/this.html and http://www.yahoo.com which makes my life difficult.';
$myregex = '/(((?:https?|ftps?)\:\/\/)?([a-zA-Z0-9:]*[@])?([a-zA-Z0-9\-\.]+\.[a-zA-Z]{2,3}|([0-9]+))([a-zA-Z0-9-._?,\'\/\+&%\$#\=~:]+)?)/';
$tokenstart = ":URL:";
$tokenend = ":";
function extraction ($myregex, $mystring, $mymatches, $tokenstart, $tokenend) {
$test1 = preg_match_all($myregex,$mystring,$mymatches);
$mymatches = array_slice($mymatches, 0, 1);
$thematches = array();
foreach ($mymatches as $match) {
foreach ($match as $key=>$match2) {
$thematches[] = array($match2, $tokenstart.$key.$tokenend);
}
}
return $thematches;
}
$matches = extraction ($myregex, $mystring, $mymatches, $tokenstart, $tokenend);
echo "1) ".$mystring."<br/>";
// 1) my sentence contains URLs to http://www.google.com/this.html and http://www.yahoo.com which makes my life difficult.
function substitute($matches,$mystring) {
foreach ($matches as $match) {
$mystring = str_replace($match[0], $match[1], $mystring);
}
return $mystring;
}
$mystring = substitute($matches,$mystring);
echo "2) ".$mystring."<br/>";
// 2) my sentence contains URLs to :URL:0: and :URL:1: which makes my life difficult.
function reinsert($matches,$mystring) {
foreach ($matches as $match) {
$mystring = str_replace($match[1], $match[0], $mystring);
}
return $mystring;
}
$mystring = reinsert($matches,$mystring);
echo "3) ".$mystring."<br/>";
// 3) my sentence contains URLs to http://www.google.com/this.html and http://www.yahoo.com which makes my life difficult.
That appears to work?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
解决这里问题的关键是将 url 列表存储在外部容器中,回调和主代码可以访问该列表,以便对它们进行所需的更改。为了记住您的网址位置,我们将在字符串中使用自定义标记。
请注意,如果您不能使用 php,要访问容器,我使用 闭包 5.3 出于某种原因,您需要将它们替换为另一种从回调中访问 $url_tokens 容器的方法,这应该不是问题。
执行时给出以下结果:
The key to solving your problem here is to store the urls list in an outside container than can be accessed by your callbacks and by your main code to do the changes you need on them. To remember your urls positions, we will use a custom token in the string.
Note that to access the container I use closures, if you can't use php 5.3 for some reason you will need to replace them with another way to access the $url_tokens container from within the callback, which shouldn't be a problem.
Which give this result when executed: