当前位置：文江博客话题详情

在 Web 开发过程中，我将花费多少时间用于用户输入验证？

发布于 2024-07-05 08:27:53 字数 90 浏览 7 评论 0原文

我是网络开发方面的新手。到目前为止，我花了很多时间（50% 左右）来尝试阻止坏人将 sql 注入之类的东西放入我的输入表单中并在服务器端验证它。这是正常的吗？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

小梨窩很甜 2024-07-12 08:27:53

@Jeremy - 一些 PHP 细节

当涉及到数据库查询时，总是尝试使用准备好的参数化查询。 mysqli 和 PDO 库支持这一点。这比使用 mysql_real_escape_string 等转义函数安全得多。

是的，mysql_real_escape_string 实际上只是一个字符串转义函数。它不是灵丹妙药。它所做的就是转义危险字符，以便它们可以安全地在单个查询字符串中使用。但是，如果您不事先清理您的输入，那么您将很容易受到某些攻击媒介的攻击。

想象一下以下 SQL：

$result = "SELECT fields FROM table WHERE id = ".mysql_real_escape_string($_POST['id']);

您应该能够看到它很容易被利用。
想象一下 id 参数包含常见的攻击向量：

1 OR 1=1

它们的编码中没有危险的字符，因此它将直接通过转义过滤器。留给我们的是：

SELECT fields FROM table WHERE id = 1 OR 1=1

这是一个可爱的 SQL 注入向量。

虽然这些功能很有用，但必须小心使用。您需要确保所有网络输入都在某种程度上得到验证。在这种情况下，我们发现我们可以被利用，因为我们没有检查我们用作数字的变量实际上是数字。在 PHP 中，您应该广泛使用一组函数来检查输入是否为整数、浮点数、字母数字等。但是当涉及 SQL 时，请最注意准备语句的值。如果上面的代码是准备好的语句，那么它是安全的，因为数据库函数会知道 1 OR 1=1 不是有效的文字。

至于htmlspecialchars()。这本身就是一个雷区。

PHP 中存在一个真正的问题，因为它有一系列不同的 html 相关转义函数，并且没有明确说明哪些函数具体做什么。

首先，如果您位于 HTML 标记内，那么您就有麻烦了。看看

echo '<img src= "' . htmlspecialchars($_GET['imagesrc']) . '" />';

我们已经在 HTML 标签内了，所以我们不需要 << 或> 去做任何危险的事情。我们的攻击向量可能只是 javascript:alert(document.cookie)

现在生成的 HTML 看起来像是

<img src= "javascript:alert(document.cookie)" />

攻击直接通过了。

情况变得更糟。为什么？因为 htmlspecialchars 只编码双引号而不是单引号。因此，如果我们的

echo "<img src= '" . htmlspecialchars($_GET['imagesrc']) . ". />";

邪恶攻击者现在可以注入全新的参数

pic.png' onclick='location.href=xxx' onmouseover='...

，那么

<img src='pic.png' onclick='location.href=xxx' onmouseover='...' />

在这些情况下，没有灵丹妙药，您只需自己处理输入即可。如果你尝试过滤掉坏字符，你肯定会失败。采用白名单方法，只允许好的字符通过。请参阅 XSS 备忘单，了解向量的多样性的示例

，即使您使用 htmlspecialchars($ string）在 HTML 标签之外，您仍然容易受到多字节字符集攻击向量的攻击。

最有效的方法是使用 mb_convert_encoding 和 htmlentities 的组合，如下所示。

$str = mb_convert_encoding($str, ‘UTF-8′, ‘UTF-8′);
$str = htmlentities($str, ENT_QUOTES, ‘UTF-8′);

即使这样，IE6 仍然容易受到攻击，因为它处理 UTF 的方式。但是，您可以回退到更有限的编码，例如 ISO-8859-1，直到 IE6 使用率下降。

@Jeremy - some PHP specifics

When it comes to database queries, always try and use prepared parameterised queries. The mysqli and PDO libraries support this. This is infinitely safer than using escaping functions such as mysql_real_escape_string.

Yes, mysql_real_escape_string is effectively just a string escaping function. It is not a magic bullet. All it will do is escape dangerous characters in order that they can be safe to use in a single query string. However, if you do not sanitise your inputs beforehand, then you will be vulnerable to certain attack vectors.

Imagine the following SQL:

$result = "SELECT fields FROM table WHERE id = ".mysql_real_escape_string($_POST['id']);

You should be able to see that this is vulnerable to exploit.
Imagine the id parameter contained the common attack vector:

1 OR 1=1

There's no risky chars in their to encode, so it will pass straight through the escaping filter. Leaving us:

SELECT fields FROM table WHERE id = 1 OR 1=1

Which is a lovely SQL injection vector.

Whilst these functions are useful, they must be used with care. You need to ensure that all web inputs are validated to some degree. In this case, we see that we can be exploited because we didn't check that a variable we were using as a number, was actually numeric. In PHP you should widely use a set of functions to check that inputs are integers, floats, alphanumeric etc. But when it comes to SQL, heed most the value of the prepared statement. The above code would have been secure if it was a prepared statement as the database functions would have known that 1 OR 1=1 is not a valid literal.

As for htmlspecialchars(). That's a minefield of its own.

There's a real problem in PHP in that it has a whole selection of different html-related escaping functions, and no clear guidance on exactly which functions do what.

Firstly, if you are inside an HTML tag, you are in real trouble. Look at

echo '<img src= "' . htmlspecialchars($_GET['imagesrc']) . '" />';

We're already inside an HTML tag, so we don't need < or > to do anything dangerous. Our attack vector could just be javascript:alert(document.cookie)

Now resultant HTML looks like

<img src= "javascript:alert(document.cookie)" />

The attack gets straight through.

It gets worse. Why? because htmlspecialchars only encodes double quotes and not single. So if we had

echo "<img src= '" . htmlspecialchars($_GET['imagesrc']) . ". />";

Our evil attacker can now inject whole new parameters

pic.png' onclick='location.href=xxx' onmouseover='...

gives us

<img src='pic.png' onclick='location.href=xxx' onmouseover='...' />

In these cases, there is no magic bullet, you just have to santise the input yourself. If you try and filter out bad characters you will surely fail. Take a whitelist approach and only let through the chars which are good. Look at the XSS cheat sheet for examples on how diverse vectors can be

Even if you use htmlspecialchars($string) outside of HTML tags, you are still vulnerable to multi-byte charset attack vectors.

The most effective you can be is to use the a combination of mb_convert_encoding and htmlentities as follows.

$str = mb_convert_encoding($str, ‘UTF-8′, ‘UTF-8′);
$str = htmlentities($str, ENT_QUOTES, ‘UTF-8′);

Even this leaves IE6 vulnerable, because of the way it handles UTF. However, you could fall back to a more limited encoding, such as ISO-8859-1, until IE6 usage drops off.

回复收藏 0 原文

听不够的曲调 2024-07-12 08:27:53

为了防止 SQL 注入攻击，只需使用准备好的语句进行查询（具体方式取决于您的平台）。一旦你这样做了，你就再也不用为这个特定的方面而烦恼了。您只需在任何地方使用它即可。

至于一般的输入验证，最好依靠一个公共基础来测试所需的字段、数字等。例如，ASP.Net 的验证器非常易于使用。您应该遵循的一条经验法则是不要相信客户端（javascript）会为您执行此操作，因为很容易绕过它。始终首先在服务器端进行。

需要注意的一个特殊情况是，当您允许引入可能包含 html/javascript 的丰富内容时。这可能允许恶意用户在您的数据中注入 JavaScript，从而在渲染数据时触发您无法控制的代码。不要不要尝试使用自己的验证代码。在网络上搜索免费的、经过测试的、维护的代码，可以为您做到这一点。杰夫在一个播客中就这方面提出了一些建议。

一旦您自动化输入验证代码，执行此操作所花费的时间应该与业务规则的复杂性直接相关。所以作为一般规则：保持简单。

回复收藏 0 原文

远山浅 2024-07-12 08:27:53

不，这不正常。也许您需要：

使用权限组件来避免 SQL 注入（Java 中的 PreparedStatements）
创建一个“过滤”来自用户的消息的组件（Java 中的 servlet Filter）。

任何现代语言都支持这两件事。

亲切的问候

回复收藏 0 原文

黯淡〆 2024-07-12 08:27:53

我很高兴你能注意保护自己。太多人没有。

然而，正如其他人所说，更好的架构选择将使您的问题消失。使用准备好的语句（大多数语言应该支持它）将使 SQL 注入攻击消失。再加上许多数据库，它们将带来显着更好的性能。处理跨站点脚本攻击更加棘手。但基本策略必须是决定如何转义用户输入，决定在哪里转义它，并且始终在同一个地方进行。不要陷入“越多越好”的陷阱！在一个地方始终以一种方式执行此操作就足够了，并且可以避免您必须找出多个转义级别中的哪一个导致了特定的错误。

或者学习如何创建和维护健全的架构的课程需要经验。此外，它需要反思你的糟糕经历。因此，请注意您当前的痛点（看起来您确实是痛点），并思考您可以采取哪些不同的措施来避免它们。如果您有导师，请与您的导师交谈。这并不总是对你的这个项目有很大帮助，但对下一个项目会有帮助。

回复收藏 0 原文