使用 php 将 HTML 输出转换为纯文本

发布于 2024-12-20 02:10:12 字数 1734 浏览 7 评论 0原文

我正在尝试将示例 HTML 输出转换为纯文本，但我不知道如何操作。我使用 file_get_contents 但我尝试转换的页面返回的结果最相似。

$raw = "http://localhost/guestbook/profiles.php";
$file_converted = file_get_contents($raw);
echo $file_converted;

profile.php

<html>
    <head>
        <title>Profiles - GuestBook</title>
        <link rel="stylesheet" type="text/css" href="css/style.css">
    </head>
<body>
    <!-- Some Divs -->
    <div id="profile-wrapper">
        <h2>Profile</h2>
        <table>
            <tr>
                <td>Name:</td><td> John Dela Cruz</td>
            </tr>
            <tr>
                <td>Age:</td><td>15</td>
            </tr>
            <tr>
                <td>Location:</td><td> SomewhereIn, Asia</td>
            </tr>
        </table>
    </div>
</body>
</html>

基本上，我试图回显类似的内容（纯文本，无样式），

Profile
Name: John Dela Cruz
Age: 15
Location: SomewhereIn, Asia

但我不知道如何。 :-( 。请帮助我，提前谢谢你们。

编辑：因为我只关注页面的内容，无论它是样式还是纯文本，有没有办法只选择（参见下面的代码））使用 file_get_contents() ？

 <h2>Profile</h2>
        <table>
            <tr>
                <td>Name:</td><td> John Dela Cruz</td>
            </tr>
            <tr>
                <td>Age:</td><td>15</td>
            </tr>
            <tr>
                <td>Location:</td><td> SomewhereIn, Asia</td>
            </tr>
        </table>

原文

I'm trying to convert my sample HTML output into a plain text but I don't know how. I use file_get_contents but the page which I'm trying to convert returns most like the same.

$raw = "http://localhost/guestbook/profiles.php";
$file_converted = file_get_contents($raw);
echo $file_converted;

profiles.php

<html>
    <head>
        <title>Profiles - GuestBook</title>
        <link rel="stylesheet" type="text/css" href="css/style.css">
    </head>
<body>
    <!-- Some Divs -->
    <div id="profile-wrapper">
        <h2>Profile</h2>
        <table>
            <tr>
                <td>Name:</td><td> John Dela Cruz</td>
            </tr>
            <tr>
                <td>Age:</td><td>15</td>
            </tr>
            <tr>
                <td>Location:</td><td> SomewhereIn, Asia</td>
            </tr>
        </table>
    </div>
</body>
</html>

Basically, I trying to echo out something like this (plain text, no styles)

Profile
Name: John Dela Cruz
Age: 15
Location: SomewhereIn, Asia

but i don't know how. :-( . Please help me guys , thank you in advance.

EDIT: Since i am only after of the content of the page, no matter if it's styled or just a plain text , is there a way to select only (see code below) using file_get_contents() ?

 <h2>Profile</h2>
        <table>
            <tr>
                <td>Name:</td><td> John Dela Cruz</td>
            </tr>
            <tr>
                <td>Age:</td><td>15</td>
            </tr>
            <tr>
                <td>Location:</td><td> SomewhereIn, Asia</td>
            </tr>
        </table>

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

温折酒 2024-12-27 02:10:12

使用 php strip_tags

如果 strip_tags 不起作用，那么也许你可以使用正则表达式提取您想要的信息。

尝试使用 PHP preg_match 和 /(。 *?<\/td>)/ 作为模式

回复收藏 0 原文

罪#恶を代价 2024-12-27 02:10:12

看看 simplexml_load_file()：

http://www.php .net/manual/en/function.simplexml-load-file.php

它将允许您将 HTML 数据加载到对象 (SimpleXMLElement) 中并像树一样遍历该对象。

回复收藏 0 原文

旧时光的容颜 2024-12-27 02:10:12

尝试使用 PHP 函数 strip_tags

回复收藏 0 原文

软糯酥胸 2024-12-27 02:10:12

试试这个，

<?php
$data = file_get_contents("your_file");
preg_match_all('|<div[^>]*?>(.*?)</div>|si',$data, $result);
print_r($result[0][0]);
?>

我已经尝试过这个，它似乎对我有用，我希望对你也有用

try this one,

<?php
$data = file_get_contents("your_file");
preg_match_all('|<div[^>]*?>(.*?)</div>|si',$data, $result);
print_r($result[0][0]);
?>

I have try this one, and it seems work for me, for you too i hope

回复收藏 0 原文

大姐，你呐 2024-12-27 02:10:12

您可以使用 strip_tags php 函数来实现此目的。浏览 strip_tags 函数的 php 手册页中的注释，了解如何以良好的方式使用它。

回复收藏 0 原文

~没有更多了~

关于作者

独享拥抱

暂无简介

文章

28 人气

关注发私信

櫻之舞

文章 0 评论 0

关注

弥枳

文章 0 评论 0

关注

m2429

文章 0 评论 0

关注

寻找一个思念的角度

文章 0 评论 0

关注

野却迷人

文章 0 评论 0

关注

我怀念的。

文章 0 评论 0

友情链接

文江博客

使用 php 将 HTML 输出转换为纯文本

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（5）

关于作者

相关话题

热门标签

推荐作者

櫻之舞

弥枳

m2429

寻找一个思念的角度

野却迷人

我怀念的。

友情链接

使用 php 将 HTML 输出转换为纯文本

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（5）

关于作者

相关话题

热门标签

推荐作者

櫻之舞

弥枳

m2429

寻找一个思念的角度

野却迷人

我怀念的。

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。