绕过 mysql_real_escape_string() 的 SQL 注入
即使使用 mysql_real_escape_string() 函数,是否也存在 SQL 注入的可能性?
考虑这个示例情况。 SQL 是这样在 PHP 中构建的:
$login = mysql_real_escape_string(GetFromPost('login'));
$password = mysql_real_escape_string(GetFromPost('password'));
$sql = "SELECT * FROM table WHERE login='$login' AND password='$password'";
我听到很多人对我说,即使使用 mysql_real_escape_string()
函数,这样的代码仍然很危险并且可能被黑客攻击。但我想不出任何可能的利用?
像这样的经典注射:
aaa' OR 1=1 --
不起作用。
你知道有什么可能的注入可以通过上面的 PHP 代码吗?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(4)
简短的回答是是的,是的,有一种方法可以绕过
mysql_real_escape_string()
。#对于非常模糊的边缘情况!!!
长答案并不那么容易。它基于此处演示的攻击。
攻击
那么,让我们首先展示攻击......
在某些情况下,这将返回超过 1 行。让我们剖析一下这里发生的事情:
选择字符集
要使此攻击发挥作用,我们需要服务器期望连接上的编码将
'
编码为 ASCII,即0x27
和 得到一些最终字节为 ASCII\
的字符,即0x5c
。事实证明,MySQL 5.6 默认支持 5 种这样的编码:big5
、cp932
、gb2312
、gbk和<代码>sjis。我们将在这里选择
gbk
。现在,请注意此处
SET NAMES
的使用,这一点非常重要。这将字符集设置在服务器上。如果我们使用对 C API 函数mysql_set_charset()
的调用,那就没问题(自 2006 年以来的 MySQL 版本)。但稍后会详细说明原因...有效负载
我们将用于此注入的有效负载以字节序列
0xbf27
开头。在gbk
中,这是一个无效的多字节字符;在latin1
中,它是字符串¿'
。请注意,在latin1
和gbk
中,0x27
本身就是一个文字'
性格。我们选择此有效负载是因为,如果我们对其调用
addslashes()
,我们将插入一个 ASCII\
即0x5c
,在'
字符之前。因此,我们最终会得到0xbf5c27
,它在gbk
中是一个两个字符序列:0xbf5c
后跟0x27
。或者换句话说,一个有效字符后跟一个未转义的'
。但我们没有使用addslashes()
。那么继续下一步...mysql_real_escape_string()
对
mysql_real_escape_string()
的 C API 调用与addslashes()
的不同之处在于它知道连接字符集。因此它可以对服务器期望的字符集正确执行转义。然而,到目前为止,客户端认为我们仍在使用latin1
进行连接,因为我们从未告诉过它。我们确实告诉服务器我们正在使用gbk
,但客户端仍然认为它是latin1
。因此,对
mysql_real_escape_string()
的调用会插入反斜杠,并且我们的“转义”内容中有一个自由悬挂的'
字符!事实上,如果我们查看gbk
字符集中的$var
,我们会看到:这是到底是什么攻击需要。
查询
这部分只是形式,但这是呈现的查询:
恭喜,您刚刚使用
mysql_real_escape_string()
成功攻击了一个程序...坏处
情况变得更糟。
PDO
默认使用 MySQL模拟准备好的语句。这意味着在客户端,它基本上通过 mysql_real_escape_string() (在 C 库中)执行 sprintf,这意味着以下操作将导致成功注入:现在,值得注意的是,您可以通过禁用模拟准备语句来防止这种情况:
这通常会产生真正的准备语句(即数据在与查询不同的数据包中发送)。但是,请注意,PDO 会默默后备 来模拟 MySQL 本身无法准备的语句:它可以的语句是 在手册中列出,但要注意选择适当的服务器版本)。
丑陋的是
我在一开始就说过,如果我们使用
mysql_set_charset('gbk')
而不是SET NAMES gbk
,我们就可以避免这一切。如果您使用的是 2006 年以来的 MySQL 版本,则确实如此。如果您使用的是早期 MySQL 版本,则 mysql_real_escape_string() 中的“noreferrer">bug 意味着无效的多字节字符(例如我们的有效负载中的字符)出于转义目的被视为单个字节,即使客户端已正确处理得知连接编码,因此此攻击仍然会成功。该错误已在 MySQL 4.1.20,5.0.22 和 5.1.11。
但最糟糕的是
PDO
直到 5.3.6 才公开mysql_set_charset()
的 C API,因此在之前的版本中它不能针对每个可能的命令防止这种攻击!它现在公开为 DSN 参数。
正如
我们在一开始所说的,要使这种攻击起作用,数据库连接必须使用易受攻击的字符集进行编码。
utf8mb4
不是容易受到攻击,但可以支持每个 Unicode 字符:因此您可以选择使用它,但它从 MySQL 5.5.3 起才可用。另一种选择是utf8
,也不易受攻击,并且可以支持整个 Unicode 基本多语言飞机。或者,您可以启用
NO_BACKSLASH_ESCAPES
SQL 模式,它(除其他外)改变mysql_real_escape_string()
的操作。启用此模式后,0x27
将被替换为0x2727
而不是0x5c27
,因此转义进程无法创建有效的以前不存在的任何易受攻击的编码中的字符(即0xbf27
仍然是0xbf27
等),因此服务器仍会拒绝该字符串视为无效。但是,请参阅 @eggyal 的回答,了解使用此 SQL 模式可能出现的不同漏洞。安全示例
以下示例是安全的:
因为服务器期望
utf8
...因为我们已经正确设置了字符集,以便客户端和服务器匹配。
因为我们已经关闭了模拟准备语句。
因为我们已经正确设置了字符集。
因为 MySQLi 始终执行真正的准备好的语句。
总结
如果您:
mysql_set_charset()
/$mysqli->set_charset( )
/ PDO 的 DSN 字符集参数(在 PHP ≥ 5.3.6 中)OR
utf8
/latin1
/ascii
/ 等)您 100% 安全。
否则,即使您使用
mysql_real_escape_string()
,您也很容易受到攻击...The short answer is yes, yes there is a way to get around
mysql_real_escape_string()
.#For Very OBSCURE EDGE CASES!!!
The long answer isn't so easy. It's based off an attack demonstrated here.
The Attack
So, let's start off by showing the attack...
In certain circumstances, that will return more than 1 row. Let's dissect what's going on here:
Selecting a Character Set
For this attack to work, we need the encoding that the server's expecting on the connection both to encode
'
as in ASCII i.e.0x27
and to have some character whose final byte is an ASCII\
i.e.0x5c
. As it turns out, there are 5 such encodings supported in MySQL 5.6 by default:big5
,cp932
,gb2312
,gbk
andsjis
. We'll selectgbk
here.Now, it's very important to note the use of
SET NAMES
here. This sets the character set ON THE SERVER. If we used the call to the C API functionmysql_set_charset()
, we'd be fine (on MySQL releases since 2006). But more on why in a minute...The Payload
The payload we're going to use for this injection starts with the byte sequence
0xbf27
. Ingbk
, that's an invalid multibyte character; inlatin1
, it's the string¿'
. Note that inlatin1
andgbk
,0x27
on its own is a literal'
character.We have chosen this payload because, if we called
addslashes()
on it, we'd insert an ASCII\
i.e.0x5c
, before the'
character. So we'd wind up with0xbf5c27
, which ingbk
is a two character sequence:0xbf5c
followed by0x27
. Or in other words, a valid character followed by an unescaped'
. But we're not usingaddslashes()
. So on to the next step...mysql_real_escape_string()
The C API call to
mysql_real_escape_string()
differs fromaddslashes()
in that it knows the connection character set. So it can perform the escaping properly for the character set that the server is expecting. However, up to this point, the client thinks that we're still usinglatin1
for the connection, because we never told it otherwise. We did tell the server we're usinggbk
, but the client still thinks it'slatin1
.Therefore the call to
mysql_real_escape_string()
inserts the backslash, and we have a free hanging'
character in our "escaped" content! In fact, if we were to look at$var
in thegbk
character set, we'd see:Which is exactly what the attack requires.
The Query
This part is just a formality, but here's the rendered query:
Congratulations, you just successfully attacked a program using
mysql_real_escape_string()
...The Bad
It gets worse.
PDO
defaults to emulating prepared statements with MySQL. That means that on the client side, it basically does a sprintf throughmysql_real_escape_string()
(in the C library), which means the following will result in a successful injection:Now, it's worth noting that you can prevent this by disabling emulated prepared statements:
This will usually result in a true prepared statement (i.e. the data being sent over in a separate packet from the query). However, be aware that PDO will silently fallback to emulating statements that MySQL can't prepare natively: those that it can are listed in the manual, but beware to select the appropriate server version).
The Ugly
I said at the very beginning that we could have prevented all of this if we had used
mysql_set_charset('gbk')
instead ofSET NAMES gbk
. And that's true provided you are using a MySQL release since 2006.If you're using an earlier MySQL release, then a bug in
mysql_real_escape_string()
meant that invalid multibyte characters such as those in our payload were treated as single bytes for escaping purposes even if the client had been correctly informed of the connection encoding and so this attack would still succeed. The bug was fixed in MySQL 4.1.20, 5.0.22 and 5.1.11.But the worst part is that
PDO
didn't expose the C API formysql_set_charset()
until 5.3.6, so in prior versions it cannot prevent this attack for every possible command!It's now exposed as a DSN parameter.
The Saving Grace
As we said at the outset, for this attack to work the database connection must be encoded using a vulnerable character set.
utf8mb4
is not vulnerable and yet can support every Unicode character: so you could elect to use that instead—but it has only been available since MySQL 5.5.3. An alternative isutf8
, which is also not vulnerable and can support the whole of the Unicode Basic Multilingual Plane.Alternatively, you can enable the
NO_BACKSLASH_ESCAPES
SQL mode, which (amongst other things) alters the operation ofmysql_real_escape_string()
. With this mode enabled,0x27
will be replaced with0x2727
rather than0x5c27
and thus the escaping process cannot create valid characters in any of the vulnerable encodings where they did not exist previously (i.e.0xbf27
is still0xbf27
etc.)—so the server will still reject the string as invalid. However, see @eggyal's answer for a different vulnerability that can arise from using this SQL mode.Safe Examples
The following examples are safe:
Because the server's expecting
utf8
...Because we've properly set the character set so the client and the server match.
Because we've turned off emulated prepared statements.
Because we've set the character set properly.
Because MySQLi does true prepared statements all the time.
Wrapping Up
If you:
mysql_set_charset()
/$mysqli->set_charset()
/ PDO's DSN charset parameter (in PHP ≥ 5.3.6)OR
utf8
/latin1
/ascii
/ etc)You're 100% safe.
Otherwise, you're vulnerable even though you're using
mysql_real_escape_string()
...考虑以下查询:
mysql_real_escape_string()
不会保护您免受这种情况的影响。事实上,您在查询中的变量周围使用单引号 (
' '
) 可以防止出现这种情况。以下也是一个选项:Consider the following query:
mysql_real_escape_string()
will not protect you against this.The fact that you use single quotes (
' '
) around your variables inside your query is what protects you against this. The following is also an option:这是另一个(也许更少?)晦涩的边缘案例!
为了向 @ircmaxell 的出色答案致敬(真的,这应该是奉承而不是抄袭!),我将采用他的格式:
攻击
从演示开始...
这将返回
test
表中的所有记录。剖析:选择 SQL 模式
如字符串文字下所述:
<块引用>
有多种方法可以在字符串中包含引号字符:
用“
'
”引用的字符串内的“'
”可以写为“''
”。用“
”
”引用的字符串内的“”
”可以写为“”“
”。在引号字符前添加转义字符(“
\
”)。用“
”
”引用的字符串中的“'
”不需要特殊处理,不需要加倍或转义。同样,“用“'
”引用的字符串内的“
”不需要特殊处理。如果服务器的 SQL 模式包括
NO_BACKSLASH_ESCAPES
,那么这些选项中的第三个(这是mysql_real_escape_string()
采用的常用方法)不可用:必须使用前两个选项之一。请注意,第四个项目符号的效果是,必须知道将用于引用文字的字符,以避免修改数据。有效负载
有效负载完全使用
"
字符启动此注入。没有特定的编码。没有特殊字符。没有奇怪的字节。mysql_real_escape_string()
幸运的是,
mysql_real_escape_string()
确实检查 SQL 模式并相应地调整其行为。请参阅libmysql.c
:
因此,如果使用
NO_BACKSLASH_ESCAPES
SQL 模式,则会调用不同的底层函数escape_quotes_for_mysql()
。如上所述,这样的函数需要知道将使用哪个字符来引用文字,以便重复它,而不会导致其他引用字符被文字重复。但是,此函数任意假设使用单引号
'
字符来引用字符串。请参阅charset.c
:
因此,无论用于引用的实际字符如何,它都会保持双引号
"
字符不变(并将所有单引号'
字符加倍)在我们的例子中,$var
与提供给mysql_real_escape_string()
的参数完全相同 - 就好像没有发生转义< em>完全没有。查询
。
有点正式,呈现的查询是:
正如我博学的朋友所说:恭喜,您刚刚使用
mysql_real_escape_string()<成功攻击了一个程序/code>...
不好的
mysql_set_charset()< /code>
没有帮助,因为这与字符集无关;也不能
mysqli::real_escape_string()
,因为这只是同一函数的不同包装。问题(如果还不是很明显的话)是对 mysql_real_escape_string() 的调用无法知道将用哪个字符引用文字,因为这由开发人员决定稍后。因此,在 NO_BACKSLASH_ESCAPES 模式下,该函数实际上不可能能够安全地转义每个输入以用于任意引用(至少,不加倍不需要的字符)加倍并因此修改你的数据)。
丑陋的
事情变得更糟。
NO_BACKSLASH_ESCAPES
在野外可能并不那么罕见,因为它需要与标准 SQL 兼容(例如,请参阅 SQL-92 规范,即语法产生并且反斜杠没有任何特殊含义)。此外,它的使用被明确推荐作为ircmaxell 的帖子描述的(早已修复)bug 的解决方法 。谁知道呢,一些 DBA 甚至可能将其配置为默认打开,以阻止使用不正确的转义方法,例如
addslashes()
。另外,新连接的 SQL 模式由服务器根据其配置进行设置(
SUPER
用户可以随时更改);因此,为了确定服务器的行为,您必须在连接后始终明确指定您所需的模式。丑陋
只要您始终显式将 SQL 模式设置为不包含
NO_BACKSLASH_ESCAPES
,或者使用单引号字符引用 MySQL 字符串文字,这个错误就不会变得 head:分别不会使用escape_quotes_for_mysql()
,或者它关于哪些引号字符需要重复的假设是正确的。因此,我建议任何使用
NO_BACKSLASH_ESCAPES
的人也启用ANSI_QUOTES
模式,因为它将强制习惯使用单引号字符串文字。请注意,如果碰巧使用双引号文字,这并不能阻止 SQL 注入,它只是降低了发生这种情况的可能性(因为正常的非恶意查询会失败)。在 PDO 中,其等效函数
PDO::quote()
及其准备好的语句模拟器调用mysql_handle_quoter()
— 正是这样做的:它确保转义文字用单引号引起来,因此您可以确定 PDO 始终不受此错误的影响。从 MySQL v5.7.6 开始,此错误已得到修复。请参阅更改日志:
安全示例
结合 ircmaxell 解释的错误,以下示例是完全安全的(假设使用 4.1.20、5.0.22、5.1.11 之后的 MySQL;或者不使用 GBK/Big5 连接编码):
...因为我们明确选择了不包含
NO_BACKSLASH_ESCAPES
的 SQL 模式。...因为我们用单引号引用字符串文字。
...因为 PDO 准备好的语句不受此漏洞的影响(ircmaxell 也是如此,前提是您使用 PHP≥5.3.6 并且已在 DSN 中正确设置字符集;或者准备好的语句模拟已被禁用) 。
...因为 PDO 的
quote()
函数不仅转义文字,而且还引用它(以单引号'
字符);请注意,在这种情况下,为了避免 ircmaxell 的错误,您必须使用 PHP≥5.3.6 并且已在 DSN 中正确设置字符集。...因为 MySQLi 准备好的语句是安全的。
总结
因此,如果您:
OR
OR
此外使用以下之一ircmaxell 总结中的解决方案,至少使用以下之一:
NO_BACKSLASH_ESCAPES
...那么您应该是完全安全的(字符串转义范围之外的漏洞除外)。
This is another, (perhaps less?) obscure EDGE CASE!!!
In homage to @ircmaxell's excellent answer (really, this is supposed to be flattery and not plagiarism!), I will adopt his format:
The Attack
Starting off with a demonstration...
This will return all records from the
test
table. A dissection:Selecting an SQL Mode
As documented under String Literals:
If the server's SQL mode includes
NO_BACKSLASH_ESCAPES
, then the third of these options—which is the usual approach adopted bymysql_real_escape_string()
—is not available: one of the first two options must be used instead. Note that the effect of the fourth bullet is that one must necessarily know the character that will be used to quote the literal in order to avoid munging one's data.The Payload
The payload initiates this injection quite literally with the
"
character. No particular encoding. No special characters. No weird bytes.mysql_real_escape_string()
Fortunately,
mysql_real_escape_string()
does check the SQL mode and adjust its behaviour accordingly. Seelibmysql.c
:Thus a different underlying function,
escape_quotes_for_mysql()
, is invoked if theNO_BACKSLASH_ESCAPES
SQL mode is in use. As mentioned above, such a function needs to know which character will be used to quote the literal in order to repeat it without causing the other quotation character from being repeated literally.However, this function arbitrarily assumes that the string will be quoted using the single-quote
'
character. Seecharset.c
:So, it leaves double-quote
"
characters untouched (and doubles all single-quote'
characters) irrespective of the actual character that is used to quote the literal! In our case$var
remains exactly the same as the argument that was provided tomysql_real_escape_string()
—it's as though no escaping has taken place at all.The Query
Something of a formality, the rendered query is:
As my learned friend put it: congratulations, you just successfully attacked a program using
mysql_real_escape_string()
...The Bad
mysql_set_charset()
cannot help, as this has nothing to do with character sets; nor canmysqli::real_escape_string()
, since that's just a different wrapper around this same function.The problem, if not already obvious, is that the call to
mysql_real_escape_string()
cannot know with which character the literal will be quoted, as that's left to the developer to decide at a later time. So, inNO_BACKSLASH_ESCAPES
mode, there is literally no way that this function can safely escape every input for use with arbitrary quoting (at least, not without doubling characters that do not require doubling and thus munging your data).The Ugly
It gets worse.
NO_BACKSLASH_ESCAPES
may not be all that uncommon in the wild owing to the necessity of its use for compatibility with standard SQL (e.g. see section 5.3 of the SQL-92 specification, namely the<quote symbol> ::= <quote><quote>
grammar production and lack of any special meaning given to backslash). Furthermore, its use was explicitly recommended as a workaround to the (long since fixed) bug that ircmaxell's post describes. Who knows, some DBAs might even configure it to be on by default as means of discouraging use of incorrect escaping methods likeaddslashes()
.Also, the SQL mode of a new connection is set by the server according to its configuration (which a
SUPER
user can change at any time); thus, to be certain of the server's behaviour, you must always explicitly specify your desired mode after connecting.The Saving Grace
So long as you always explicitly set the SQL mode not to include
NO_BACKSLASH_ESCAPES
, or quote MySQL string literals using the single-quote character, this bug cannot rear its ugly head: respectivelyescape_quotes_for_mysql()
will not be used, or its assumption about which quote characters require repeating will be correct.For this reason, I recommend that anyone using
NO_BACKSLASH_ESCAPES
also enablesANSI_QUOTES
mode, as it will force habitual use of single-quoted string literals. Note that this does not prevent SQL injection in the event that double-quoted literals happen to be used—it merely reduces the likelihood of that happening (because normal, non-malicious queries would fail).In PDO, both its equivalent function
PDO::quote()
and its prepared statement emulator call uponmysql_handle_quoter()
—which does exactly this: it ensures that the escaped literal is quoted in single-quotes, so you can be certain that PDO is always immune from this bug.As of MySQL v5.7.6, this bug has been fixed. See change log:
Safe Examples
Taken together with the bug explained by ircmaxell, the following examples are entirely safe (assuming that one is either using MySQL later than 4.1.20, 5.0.22, 5.1.11; or that one is not using a GBK/Big5 connection encoding):
...because we've explicitly selected an SQL mode that doesn't include
NO_BACKSLASH_ESCAPES
....because we're quoting our string literal with single-quotes.
...because PDO prepared statements are immune from this vulnerability (and ircmaxell's too, provided either that you're using PHP≥5.3.6 and the character set has been correctly set in the DSN; or that prepared statement emulation has been disabled).
...because PDO's
quote()
function not only escapes the literal, but also quotes it (in single-quote'
characters); note that to avoid ircmaxell's bug in this case, you must be using PHP≥5.3.6 and have correctly set the character set in the DSN....because MySQLi prepared statements are safe.
Wrapping Up
Thus, if you:
OR
OR
in addition to employing one of the solutions in ircmaxell's summary, use at least one of:
NO_BACKSLASH_ESCAPES
...then you should be completely safe (vulnerabilities outside the scope of string escaping aside).
嗯,除了
%
通配符之外,没有什么可以真正通过它。如果您使用LIKE
语句,则可能会很危险,因为如果您不过滤掉该语句,攻击者可能只将%
作为登录名,并且只能暴力破解以下密码您的任何用户。人们经常建议使用准备好的语句来使其 100% 安全,因为数据不会以这种方式干扰查询本身。
但对于这样简单的查询,执行类似
$login = preg_replace('/[^a-zA-Z0-9_]/', '', $login); 的操作可能会更有效
Well, there's nothing really that can pass through that, other than
%
wildcard. It could be dangerous if you were usingLIKE
statement as attacker could put just%
as login if you don't filter that out, and would have to just bruteforce a password of any of your users.People often suggest using prepared statements to make it 100% safe, as data can't interfere with the query itself that way.
But for such simple queries it probably would be more efficient to do something like
$login = preg_replace('/[^a-zA-Z0-9_]/', '', $login);