当前位置：文江博客话题详情

从字符串中获取重复的格式化段

发布于 2024-10-20 17:38:28 字数 712 浏览 1 评论 0原文

我正在开发一个论坛橄榄球风格的得分游戏，并寻求帮助开发正则表达式解析器来解析游戏集。

每个帖子可能具有以下可能的格式（不同之处在于有些人可能使用逗号来分隔游戏，有些人可能会使用连字符连接分数 - 或两者的任意组合）：

球队 25-31 队团队 28-35 团队球队 38-10 球队团队 21-15 团队

。

团队 25 31 团队团队 28 35 团队团队 38 10 团队团队 21 15 团队

。

团队 25-31 团队，团队 28-35 团队，团队 38-10 团队，团队 21-15 团队

。

团队 25 31 团队，团队 28 35 团队，团队 38 10 团队，团队 21 15 团队

基本上，球队的长度总是为 5 个字符，比分介于两队之间，但单个帖子中的比赛数量不一定总是相同，即一个帖子可能是一场比赛或 20 场比赛之前或之后也可能有额外的文本，但仍然需要能够提取游戏。只需要将每场比赛分开即可，即[TEAMA] [SCORE] [SCORE] [TEAMB] 将被视为一场比赛。

我开始使用爆炸，但没有太多运气，不幸的是没有太多正则表达式经验，因此寻找一种灵活的方式来适应上述情况 - 只需要拆分每个游戏即可。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

把回忆走一遍 2024-10-27 17:38:28

匹配每个结果比拆分它们更容易，例如：

preg_match_all('/(?P<teamA>\w{5})\s+(?P<scoreA>\d+)[\s-](?P<scoreB>\d+)\s+(?P<teamB>\w{5})/', $str, $m, PREG_SET_ORDER);
print_r($m);

为您提供每个结果，例如：

[0] => Array
    (
        [0] => TEAMA 25 31 TEAMB
        [teamA] => TEAMA
        [1] => TEAMA
        [scoreA] => 25
        [2] => 25
        [scoreB] => 31
        [3] => 31
        [teamB] => TEAMB
        [4] => TEAMB
    )

It's easier to match each result than to split them, e.g.:

preg_match_all('/(?P<teamA>\w{5})\s+(?P<scoreA>\d+)[\s-](?P<scoreB>\d+)\s+(?P<teamB>\w{5})/', $str, $m, PREG_SET_ORDER);
print_r($m);

Gives you for each result, something like:

[0] => Array
    (
        [0] => TEAMA 25 31 TEAMB
        [teamA] => TEAMA
        [1] => TEAMA
        [scoreA] => 25
        [2] => 25
        [scoreB] => 31
        [3] => 31
        [teamB] => TEAMB
        [4] => TEAMB
    )

回复收藏 0 原文

春风十里 2024-10-27 17:38:28

您可以尝试这样的正则表达式（假设团队名称是字母数字）

([a-zA-Z0-9]{5})\s+(\d+)[\s-](\d+)\s+([a-zA-Z0-9]{5})

http://rubular.com/r/v4HGNzo3UY< /a>

You could try a regular expression like this (assumes team names are alphanumeric)

([a-zA-Z0-9]{5})\s+(\d+)[\s-](\d+)\s+([a-zA-Z0-9]{5})

http://rubular.com/r/v4HGNzo3UY

回复收藏 0 原文

荭秂 2024-10-27 17:38:28

另一种选择，

    $raw_str = "TEAMA 25-31 TEAMB TEAMC 28-35 TEAMD TEAME 38-10 TEAMF TEAMG 21-15 TEAMH";
preg_match_all('/(?<first_team_name>[A-Z]+)\s+(?<first_team_score>[0-9]+)-(?<second_team_score>[0-9]+)\s+(?<second_team_name>[A-Z]+)/i',$raw_str,$matches);
$scores = array();
foreach($matches[0] as $index => $match)
{
    $scores[] = array(
                    'first_team_name' =>  $matches['first_team_name'][$index],
                    'first_team_score' =>  $matches['first_team_score'][$index],
                    'second_team_name' =>  $matches['second_team_name'][$index],
                    'second_team_score' =>  $matches['second_team_score'][$index]
                    );
}

print_r($scores);

输出：

数组
（
[0] =>大批
（
[first_team_name] =>;团队
[first_team_score] => 25
[第二支球队名称] =>团队
[第二队得分] => 31
）

[1] => Array
    (
        [first_team_name] => TEAMC
        [first_team_score] => 28
        [second_team_name] => TEAMD
        [second_team_score] => 35
    )

[2] => Array
    (
        [first_team_name] => TEAME
        [first_team_score] => 38
        [second_team_name] => TEAMF
        [second_team_score] => 10
    )

[3] => Array
    (
        [first_team_name] => TEAMG
        [first_team_score] => 21
        [second_team_name] => TEAMH
        [second_team_score] => 15
    )

)

an alternative,

    $raw_str = "TEAMA 25-31 TEAMB TEAMC 28-35 TEAMD TEAME 38-10 TEAMF TEAMG 21-15 TEAMH";
preg_match_all('/(?<first_team_name>[A-Z]+)\s+(?<first_team_score>[0-9]+)-(?<second_team_score>[0-9]+)\s+(?<second_team_name>[A-Z]+)/i',$raw_str,$matches);
$scores = array();
foreach($matches[0] as $index => $match)
{
    $scores[] = array(
                    'first_team_name' =>  $matches['first_team_name'][$index],
                    'first_team_score' =>  $matches['first_team_score'][$index],
                    'second_team_name' =>  $matches['second_team_name'][$index],
                    'second_team_score' =>  $matches['second_team_score'][$index]
                    );
}

print_r($scores);

Output:

Array
(
[0] => Array
(
[first_team_name] => TEAMA
[first_team_score] => 25
[second_team_name] => TEAMB
[second_team_score] => 31
)

[1] => Array
    (
        [first_team_name] => TEAMC
        [first_team_score] => 28
        [second_team_name] => TEAMD
        [second_team_score] => 35
    )

[2] => Array
    (
        [first_team_name] => TEAME
        [first_team_score] => 38
        [second_team_name] => TEAMF
        [second_team_score] => 10
    )

[3] => Array
    (
        [first_team_name] => TEAMG
        [first_team_score] => 21
        [second_team_name] => TEAMH
        [second_team_score] => 15
    )

)

回复收藏 0 原文

昵称有卵用 2024-10-27 17:38:28

只需将每场比赛分开即可，即[TEAMA] [SCORE] [SCORE] [TEAMB] 将被视为一场比赛。

只需要将每个游戏分开即可。

要严格验证 5 个单词的字符串，请在游戏段的外边缘使用单词边界 (\b)。要匹配两个分数之间的未知非数字分隔符，请使用 \D+ 匹配一个或多个非数字。

不需要任何捕获组，只需将每个游戏作为全字符串匹配进行匹配并访问引用数组中的这些元素即可。

代码：(Demo)

$round = 'some text TEAMA 25-31 TEAMB TEAMC 28-35 TEAMD TEAME 38-10 TEAMF TEAMG 21-15 TEAMH some othet text';
preg_match_all('/\b\w{5} \d+\D+\d+ \w{5}\b/', $round, $matches);
var_export($matches[0]);

输出：

array (
  0 => 'TEAMA 25-31 TEAMB',
  1 => 'TEAMC 28-35 TEAMD',
  2 => 'TEAME 38-10 TEAMF',
  3 => 'TEAMG 21-15 TEAMH',
)

如果你想将第一个游戏数据解析为数组，你可以使用 < code>sscanf() 生成字符串和整数数组。 (演示)

var_export(sscanf($matches[0][0], '%s%d%*[^0-9]%d%s'));

输出：

array (
  0 => 'TEAMA',
  1 => 25,
  2 => 31,
  3 => 'TEAMB',
)

或声明单个变量：(演示)

sscanf($matches[0][0], '%s%d%*[^0-9]%d%s', $team1, $score1, $score2, $team2);
var_dump($team1, $score1, $score2, $team2);

输出：

string(5) "TEAMA"
int(25)
int(31)
string(5) "TEAMB"

Just need each game to be split out i.e. [TEAMA] [SCORE] [SCORE] [TEAMB] would be considered one game.

just need each game to be split out.

To tightly validate the 5-word-character strings, use word boundaries (\b) on the outside edges of the game segment. To match the unknown non-numeric delimiter between the two scores, use \D+ to match one or more non-digits.

There is no need for any capture groups, just match each game as a fullstring match and access those elements in the reference array.

Code: (Demo)

$round = 'some text TEAMA 25-31 TEAMB TEAMC 28-35 TEAMD TEAME 38-10 TEAMF TEAMG 21-15 TEAMH some othet text';
preg_match_all('/\b\w{5} \d+\D+\d+ \w{5}\b/', $round, $matches);
var_export($matches[0]);

Output:

array (
  0 => 'TEAMA 25-31 TEAMB',
  1 => 'TEAMC 28-35 TEAMD',
  2 => 'TEAME 38-10 TEAMF',
  3 => 'TEAMG 21-15 TEAMH',
)

If you wanted to parse the first game data into an array, you could use sscanf() to generate an array of strings and integers. (Demo)

var_export(sscanf($matches[0][0], '%s%d%*[^0-9]%d%s'));

Output:

array (
  0 => 'TEAMA',
  1 => 25,
  2 => 31,
  3 => 'TEAMB',
)

Or declare individual variables: (Demo)

sscanf($matches[0][0], '%s%d%*[^0-9]%d%s', $team1, $score1, $score2, $team2);
var_dump($team1, $score1, $score2, $team2);

Output:

string(5) "TEAMA"
int(25)
int(31)
string(5) "TEAMB"

回复收藏 0 原文

~没有更多了~

关于作者

墨小墨

暂无简介

0 文章

0 评论

21 人气

关注发私信

友情链接

文江博客

从字符串中获取重复的格式化段

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（4）

关于作者

相关话题

热门标签

推荐作者

留蓝

18790681156

zach7772

Wini

ayeshaaroy

初雪

友情链接

从字符串中获取重复的格式化段

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（4）

关于作者

相关话题

热门标签

推荐作者

留蓝

18790681156

zach7772

Wini

ayeshaaroy

初雪

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。