Typhoeus gem 不会减少响应时间

发布于 2024-12-19 09:41:10 字数 1207 浏览 3 评论 0原文

基本上我有 857 个图像链接需要检查。我用 3 种不同的方法实现它，每种方法运行 3 次。

方法 1：使用 Typhoeus 和 Hydra（并行请求）

hydra = Typhoeus::Hydra.new(:max_concurrency => 50)
st = Time.now
@image_urls.each do |image_url|
  request = Typhoeus::Request.new(image_url)
  hydra.queue(request)
end
hydra.run
et = Time.now
puts "\n" + (et - st).to_s() + " seconds"

耗时：117.65、99.45、102.01 秒

方法 2：使用 Typhoeus（单个请求）

st = Time.now
@image_urls.each do |image_url|
  response = Typhoeus::Request.head(image_url)
end
et = Time.now
puts "\n" + (et - st).to_s() + " seconds"

耗时：33.85、31.89、 30.18 秒

方法 3：使用 Net::HTTP Ruby 库

st = Time.now
@image_urls.each do |image_url|
  url = URI.parse(image_url)
  req = Net::HTTP.new(url.host, url.port)
  res = req.request_head(url.path).code   
end
et = Time.now
puts "\n" + (et - st).to_s() + " seconds"

耗时： 83.30, 67.62, 75.26 秒

最初我认为方法 1：Typhhoeus 和 Hydra 应该通过发送并行请求而不是一次发送 1 个请求来加快 Http 响应时间。然而，上面的结果表明我的响应时间实际上变慢了。

原因之一可能是针对标头的 http 请求的开销比普通的 http GET 请求要少。除此之外，我在这里做错了什么吗？需要建议来优化这个过程，我只需要检索 http 状态代码。

原文

Basically I have 857 image links to check. I implemented it in 3 different methods and run them 3 times each.

Method 1: Using Typhoeus and Hydra (Parallel Requests)

hydra = Typhoeus::Hydra.new(:max_concurrency => 50)
st = Time.now
@image_urls.each do |image_url|
  request = Typhoeus::Request.new(image_url)
  hydra.queue(request)
end
hydra.run
et = Time.now
puts "\n" + (et - st).to_s() + " seconds"

Time taken: 117.65, 99.45, 102.01 seconds

Method 2: Using Typhoeus (Singular Request)

st = Time.now
@image_urls.each do |image_url|
  response = Typhoeus::Request.head(image_url)
end
et = Time.now
puts "\n" + (et - st).to_s() + " seconds"

Time taken: 33.85, 31.89, 30.18 seconds

Method 3: Using Net::HTTP Ruby library

st = Time.now
@image_urls.each do |image_url|
  url = URI.parse(image_url)
  req = Net::HTTP.new(url.host, url.port)
  res = req.request_head(url.path).code   
end
et = Time.now
puts "\n" + (et - st).to_s() + " seconds"

Time taken: 83.30, 67.62, 75.26 seconds

Initially I thought Method 1: Typhoeus and Hydra is suppose to speed up Http response time by sending parallel requests instead of sending 1 at a time. However, the above result show me that I am in fact getting a slower response time.

One reason could be a http request for the header has lesser overhead than a normal http GET request. Other than that, am I doing something wrong here? Need advice to optimize this process, I just need to retrieve the http status code.

分享到QQ

分享到微博