当前位置：文江博客话题详情

快速远程PHP技术检测图像404

发布于 2024-08-29 06:27:39 字数 162 浏览 3 评论 0原文

在包含图像之前检测远程图像是否不存在时，哪种 PHP 脚本技术运行速度最快？我的意思是，我不想下载远程图像的所有字节——只要足以检测它是否存在即可。

虽然在主题上有一点点偏差，但我想下载足够的字节来确定 JPEG 的宽度和高度信息。

对于我正在从事的系统设计而言，速度非常重要。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

夏九 2024-09-05 06:27:39

我也修改了@Volomike的代码来获取宽度。给你...

function get_image_dim($sURL) {
  // note that for jpeg you may need to change 300 to a larger value,
  // as some height/width info is farther out in the header
  try {
    $hSock = @ fopen($sURL, 'rb');
    if ($hSock) {
      while(!feof($hSock)) {
        $vData = fread($hSock, 300);
        break;
      }
      fclose($hSock);
      if (strpos(' ' . $vData, 'JFIF')>0) {
        $vData = substr($vData, 0, 300);
        $asResult = unpack('H*',$vData);        
        $sBytes = $asResult[1];
        $width = 0;
        $height = 0;
        $hex_width = '';
        $hex_height = '';
        if (strstr($sBytes, 'ffc2')) {
          $hex_height = substr($sBytes, strpos($sBytes, 'ffc2') + 10, 4);
          $hex_width = substr($sBytes, strpos($sBytes, 'ffc2') + 14, 4);
        } else {
          $hex_height = substr($sBytes, strpos($sBytes, 'ffc0') + 10, 4);
          $hex_width = substr($sBytes, strpos($sBytes, 'ffc0') + 14, 4);
        }
        $width = hexdec($hex_width);
        $height = hexdec($hex_height);
        return array('width' => $width, 'height' => $height);
      } elseif (strpos(' ' . $vData, 'GIF')>0) {
        $vData = substr($vData, 0, 300);
        $asResult = unpack('h*',$vData);
        $sBytes = $asResult[1];
        $sBytesH = substr($sBytes, 16, 4);
        $height = hexdec(strrev($sBytesH));
        $sBytesW = substr($sBytes, 12, 4);
        $width = hexdec(strrev($sBytesW));
        return array('width' => $width, 'height' => $height);
      } elseif (strpos(' ' . $vData, 'PNG')>0) {
        $vDataH = substr($vData, 22, 4);
        $asResult = unpack('n',$vDataH);
        $height = $asResult[1];        
        $vDataW = substr($vData, 18, 4);
        $asResult = unpack('n',$vDataW);
        $width = $asResult[1];        
        return array('width' => $width, 'height' => $height);
      }
    }
  } catch (Exception $e) {}
  return FALSE;
}

所以，使用它我们...

// jpeg
$url = 'http://upload.wikimedia.org/wikipedia/commons/thumb/c/ce/Quality_comparison_jpg_vs_saveforweb.jpg/250px-Quality_comparison_jpg_vs_saveforweb.jpg';
// png
//$url = 'http://upload.wikimedia.org/wikipedia/commons/thumb/4/47/PNG_transparency_demonstration_1.png/280px-PNG_transparency_demonstration_1.png';
// gif
//$url = 'http://upload.wikimedia.org/wikipedia/commons/e/e2/Sunflower_as_gif_small.gif';

$dim = get_image_dim($url);
print_r($dim);

I've modified the @Volomike's code to get width too. Here you go...

function get_image_dim($sURL) {
  // note that for jpeg you may need to change 300 to a larger value,
  // as some height/width info is farther out in the header
  try {
    $hSock = @ fopen($sURL, 'rb');
    if ($hSock) {
      while(!feof($hSock)) {
        $vData = fread($hSock, 300);
        break;
      }
      fclose($hSock);
      if (strpos(' ' . $vData, 'JFIF')>0) {
        $vData = substr($vData, 0, 300);
        $asResult = unpack('H*',$vData);        
        $sBytes = $asResult[1];
        $width = 0;
        $height = 0;
        $hex_width = '';
        $hex_height = '';
        if (strstr($sBytes, 'ffc2')) {
          $hex_height = substr($sBytes, strpos($sBytes, 'ffc2') + 10, 4);
          $hex_width = substr($sBytes, strpos($sBytes, 'ffc2') + 14, 4);
        } else {
          $hex_height = substr($sBytes, strpos($sBytes, 'ffc0') + 10, 4);
          $hex_width = substr($sBytes, strpos($sBytes, 'ffc0') + 14, 4);
        }
        $width = hexdec($hex_width);
        $height = hexdec($hex_height);
        return array('width' => $width, 'height' => $height);
      } elseif (strpos(' ' . $vData, 'GIF')>0) {
        $vData = substr($vData, 0, 300);
        $asResult = unpack('h*',$vData);
        $sBytes = $asResult[1];
        $sBytesH = substr($sBytes, 16, 4);
        $height = hexdec(strrev($sBytesH));
        $sBytesW = substr($sBytes, 12, 4);
        $width = hexdec(strrev($sBytesW));
        return array('width' => $width, 'height' => $height);
      } elseif (strpos(' ' . $vData, 'PNG')>0) {
        $vDataH = substr($vData, 22, 4);
        $asResult = unpack('n',$vDataH);
        $height = $asResult[1];        
        $vDataW = substr($vData, 18, 4);
        $asResult = unpack('n',$vDataW);
        $width = $asResult[1];        
        return array('width' => $width, 'height' => $height);
      }
    }
  } catch (Exception $e) {}
  return FALSE;
}

So, using it we have...

// jpeg
$url = 'http://upload.wikimedia.org/wikipedia/commons/thumb/c/ce/Quality_comparison_jpg_vs_saveforweb.jpg/250px-Quality_comparison_jpg_vs_saveforweb.jpg';
// png
//$url = 'http://upload.wikimedia.org/wikipedia/commons/thumb/4/47/PNG_transparency_demonstration_1.png/280px-PNG_transparency_demonstration_1.png';
// gif
//$url = 'http://upload.wikimedia.org/wikipedia/commons/e/e2/Sunflower_as_gif_small.gif';

$dim = get_image_dim($url);
print_r($dim);

回复收藏 0 原文

樱桃奶球 2024-09-05 06:27:39

运行一个 cURL 来执行一个 HEAD 请求，而不是完整的 GET

我没有对此进行测试，但希望您能明白这个想法：

<?php
$url = 'http://www.example.com/image.gif';
$ch = curl_init();
curl_setopt($ch, CURLOPT_URL, $url);
curl_setopt($ch, CURLOPT_NOBODY, true); // this is what sets it as HEAD request
curl_exec($ch);

if (curl_getinfo($ch, CURLINFO_HTTP_CODE) == '200') { // 200 = OK
    // image exists ..
}

curl_close($ch);
?>

有关 cURL 的更多信息，请参阅 cURL 文档。

Run a cURL that does a HEAD request insted of a full GET

I didn't test this, but hopefully you'll get the idea:

<?php
$url = 'http://www.example.com/image.gif';
$ch = curl_init();
curl_setopt($ch, CURLOPT_URL, $url);
curl_setopt($ch, CURLOPT_NOBODY, true); // this is what sets it as HEAD request
curl_exec($ch);

if (curl_getinfo($ch, CURLINFO_HTTP_CODE) == '200') { // 200 = OK
    // image exists ..
}

curl_close($ch);
?>

See cURL docuentation for more information about cURL.

回复收藏 0 原文

嗳卜坏 2024-09-05 06:27:39

您应该能够确定 JPEG 的尺寸，而无需加载其全部内容。对于基线 JPEG，即非逐行扫描 JPEG，以字节为单位扫描，直到遇到 0xFFC0。跳过接下来的三个字节。接下来的两个字节表示高度。它们后面还有两个表示宽度的字节。

例如，在“FF C0 00 11 08 01 DE 02 D0”中，01DE代表高度为478，02D0代表宽度为720。

回复收藏 0 原文

梦与时光遇 2024-09-05 06:27:39

我会发送一个包含 RANGE 标头尽可能限制实际数据传输（远程服务器可能不接受 RANGE 请求，但仍然值得一试）。无论您使用套接字（直接）还是使用curl 来发出请求，可能都没有太大区别。但是......如果没有基准，你永远不会知道。对于curl，请查看 http://docs.php.net/ 中的“CURLOPT_RANGE”选项function.curl-setopt

它可能不适合您的配置文件（“一个小时几个小时，在只有少量 CPU 可用功率的服务器上。”），但您可能想尝试一次处理多个 url，即多个活动连接，并且仅处理那些不会阻塞读取操作的连接。如果限制因素主要/仅是CPU功率...忘记这部分。
套接字：看看 stream_select
curl：参见curl_multi_exec()

如果curl模块不可用，您还可以将 http url 包装器与 stream_context_create() 发送包含 RANGE 标头的请求。

看起来您已经知道收到数据后如何处理它。

回复收藏 0 原文

美人骨 2024-09-05 06:27:39

我认为以下例程将仅检索 JPG、GIF 和 PNG 的图像高度，或者在 404 或其他图像类型上返回 === FALSE 条件。该例程还使用最少的服务器资源来执行此操作，因为即使添加了字节限制，file_get_contents() 路由似乎也会实际下载文件，就像 getimagesize() 下载文件一样。与此相比，您可以看到性能受到的影响。

该例程的工作方式是从文件中仅下载 300 字节。不幸的是，与 GIF 或 PNG 不同，JPEG 在文件中将其高度值推得很远，因此我不得不以字节为单位读取文件。然后，它使用这些字节扫描该标头中的 JFIF、PNG 或 GIF，让我们知道它是什么类型。一旦我们有了这个，我们就可以在每个上使用独特的例程来解析标头。请注意，JPEG 必须首先使用带有 H* 的 unpack()，然后扫描 ffc2 或 ffc0 并进行处理。然而，GIF 必须首先使用 h* 进行 unpack()（差别很大）。

这个函数是我通过反复试验创建的，可能是错误的。我在几张图像上运行了它，效果似乎很好。如果您发现其中有问题，请考虑告诉我。

无论如何，这个系统将让我确定图像高度并丢弃该图像并找到另一个（如果太高）。无论我找到什么随机图像，我都会在 HTML 的 IMG 标记中设置宽度，它会自动调整高度 - 但只有当图像低于特定高度时才看起来不错。此外，它还会执行 404 检查，看看另一台服务器返回给我的图像是否不再存在或禁止跨站点链接。由于我手动将图像设置为固定宽度，因此我不在乎读取图像宽度。您可以调整此函数，并且通常只需向前查看几个小字节即可找到图像宽度（如果您愿意的话）。

function getImageHeight($sURL) {
  try {
    $hSock = @ fopen($sURL, 'rb');
    if ($hSock) {
      while(!feof($hSock)) {
        $vData = fread($hSock, 300);
        break;
      }
      fclose($hSock);
      if (strpos(' ' . $vData, 'JFIF')>0) {
        $vData = substr($vData, 0, 300);
        $asResult = unpack('H*',$vData);
        $sBytes = $asResult[1];
        if (strstr($sBytes, 'ffc2')) {
          $sBytes = substr($sBytes, strpos($sBytes, 'ffc2') + 10, 4);
        } else {
          $sBytes = substr($sBytes, strpos($sBytes, 'ffc0') + 10, 4);
        } 
        return hexdec($sBytes);
      } elseif (strpos(' ' . $vData, 'GIF')>0) {
        $vData = substr($vData, 0, 300);
        $asResult = unpack('h*',$vData);
        $sBytes = $asResult[1];
        $sBytes = substr($sBytes, 16, 4);
        $sBytes = strrev($sBytes);
        return hexdec($sBytes);
      } elseif (strpos(' ' . $vData, 'PNG')>0) {
        $vData = substr($vData, 22, 4);
        $asResult = unpack('n',$vData);
        $nHeight = $asResult[1];
        return $nHeight;
      }
    }
  } catch (Exception $e) {}
  return FALSE;
}

I think the following routine will retrieve just the image heights for JPG, GIF, and PNG, or return an === FALSE condition on a 404 or other image type. The routine also does this with the least server resources because the file_get_contents() route appears to actually download the file even with byte restriction added in, as does getimagesize() download the file. You can see the performance hit compared to this.

The way this routine works is that it downloads just 300 bytes from the file. Unfortunately JPEG pushes its height value pretty far out in a file unlike GIF or PNG and so I had to read the file that far out in bytes. Then, with those bytes, it scans for JFIF, PNG, or GIF in that header to let us know which file type it is. Once we have that, we then use unique routines on each to parse the header. Note that JPEG must first use unpack() with H* and then scan for ffc2 or ffc0 and process. GIF, however, must first unpack() with h* (big difference there).

This function was created by me with trial and error, and could be wrong. I ran it on several images and it appears to work good. If you find a fault in it, consider letting me know.

Anyway, this system will let me determine an image height and discard the image and find another if too tall. On whatever random image I find, I set width in the IMG tag of the HTML and it automatically resizes the height -- but looks good only if the image is under a certain height. As well, it does a 404 check to see if the image that was returned by another server to me was not for an image that no longer exists or which prohibits cross-site linking. And since I am manually setting the images to a fixed width, I don't care to read the image width. You can adapt this function and usually look just a few small bytes forward to find image widths should you want to do so.

function getImageHeight($sURL) {
  try {
    $hSock = @ fopen($sURL, 'rb');
    if ($hSock) {
      while(!feof($hSock)) {
        $vData = fread($hSock, 300);
        break;
      }
      fclose($hSock);
      if (strpos(' ' . $vData, 'JFIF')>0) {
        $vData = substr($vData, 0, 300);
        $asResult = unpack('H*',$vData);
        $sBytes = $asResult[1];
        if (strstr($sBytes, 'ffc2')) {
          $sBytes = substr($sBytes, strpos($sBytes, 'ffc2') + 10, 4);
        } else {
          $sBytes = substr($sBytes, strpos($sBytes, 'ffc0') + 10, 4);
        } 
        return hexdec($sBytes);
      } elseif (strpos(' ' . $vData, 'GIF')>0) {
        $vData = substr($vData, 0, 300);
        $asResult = unpack('h*',$vData);
        $sBytes = $asResult[1];
        $sBytes = substr($sBytes, 16, 4);
        $sBytes = strrev($sBytes);
        return hexdec($sBytes);
      } elseif (strpos(' ' . $vData, 'PNG')>0) {
        $vData = substr($vData, 22, 4);
        $asResult = unpack('n',$vData);
        $nHeight = $asResult[1];
        return $nHeight;
      }
    }
  } catch (Exception $e) {}
  return FALSE;
}

回复收藏 0 原文