Web服务器日志分析工具

发布于 2024-07-09 21:59:19 字数 69 浏览 11 评论 0原文

对于准确的 Web 日志分析工具来生成 IIS 日志报告有什么建议吗? 我们使用了 WebTrends,但我认为它不准确。

Any suggestions for an accurate Web Log analysis tool to generate reports on the IIS logs? We used WebTrends, but I don't feel it was accurate.

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(8

软甜啾 2024-07-16 21:59:19

要分析博客,我认为使用 Analog 不会出错: http://www.analog.cx/< /a>

如果您正在分析自己的日志(通常是巨大的文件),您将需要能找到的最快的分析器。 模拟速度很快。

您会想要一个已经存在了一段时间并且仍然受到支持的产品。 Analog 刚刚庆祝了 10 岁生日。

Analog 声称是世界上最受欢迎的日志文件分析器。

多种语言。

我有说过它是免费和开源的吗?

就准确性而言,没有任何工具可以提供完美的结果。 JavaScript 经常无法捕捉到点击。 尝试跟踪个人通过网站的路径(即用于分析目的)充满了问题。 甚至试图区分点击量和访问量并筛选出机器人也更像是一门黑术,而不是一门科学。

最好的方法就是拥有一个能够提供适当的基本统计数据的工具,告诉您需要了解的内容。

我研究过其他工具,例如深度日志分析器:http://www.deep-software.com /,它尝试从您的博客进行分析。 但速度是一个问题。 他们声称他们的新版本 3.5 - April 2008(我没有尝试过)提高了性能。 此类程序的一大优势是您可以执行高级报告,包括自定义 SQL 请求。 您必须购买他们的专业版本(200 美元)才能完成大部分分析和自定义查询。 如果模拟对您来说太简单,那么请尝试免费版本的深度日志分析器。

您还可以尝试 Microsoft 自己的日志解析器,正如推荐的答案:https://stackoverflow.com/questions/157677/a-good-iis-log-viewer-for-large-log-files
但您需要一些额外的技能才能使用它。

To analyze weblogs, I don't think you can go wrong with Analog: http://www.analog.cx/

If you are analyzing your own logs, which are often huge files, you will want the fastest analyzer you can find. Analog is fast.

You'll want one that's been around awhile and is still supported. Analog just celebrated its 10'th birthday.

Analog claims to be the most popular logfile analyser in the world.

Multi-languages.

Did I say its free and open source?

As far as accuracy goes, no tool gives perfect results. Javascript fails often in catching hits. Trying to track individual people's paths through a website (i.e. for Analytics purposes) is fraught with problems. And even trying to differentiate hits versus visits and screening out the bots is all more of a black art than a science.

What is best is simply to have a tool that gives decent basic statistics that tell you what you need to know.

I've looked at other tools, such as Deep Log Analyzer: http://www.deep-software.com/, which attempts to do analytics from your weblogs. But speed was a problem. They claim their new version 3.5 - April 2008, which I didn't try, has improved performance. The big advantage of a program like this is the advanced reporting you can do, including custom SQL requests. You have to purchase their professional version ($200) to do most of the analytics and custom queries. If Analog is too simple for you, then try the free version of Deep Log Analyzer.

And you can also try Microsoft's own Log Parser, as was the recommended answer in: https://stackoverflow.com/questions/157677/a-good-iis-log-viewer-for-large-log-files.
But you will need some extra skills to use it.

荒芜了季节 2024-07-16 21:59:19

您想从日志中分析什么? 有很多免费或付费的工具可以检查日志并输出各种各样的数据。 有些具有真正的意义,而另一些则最好持保留态度。

没有人会向您展示“有多少人实际上在阅读我精彩的网页”。 那些试图显示“不同网站访问者”或任何详细指标的数据充其量只是粗略地近似于模糊趋势的指示......

但就其价值而言,我们使用 模拟

What are you wanting to analyze from your logs? There are a bunch of tools out there - free or paid for - that will go through the logs and spit out a great variety of figures. Some have real meaning, others are best used with a grain of salt.

What none will show you is "How many people are actually reading my wonderful web pages". Those that attempt to show "distinct site visitors" or any detailed metrics are at best a rough approximation to an indication of a vague trend...

But for what it's worth, we use Analog.

非要怀念 2024-07-16 21:59:19

简短回答

您对结果的质疑是正确的; 日志分析不足以报告实际流量。

长答案

WebTrends 是一个出色的工具。 但作为 WebTrends 安装的前管理员,我发现 Web 日志在捕获感兴趣的指标方面非常糟糕。

例如,如果您的网络交付堆栈中存在任何缓存(或在消费者方面 - *我正在向您,AOL摇动我的拳头!),那么您的网络日志将立即成为非-反映您网站的实际活动。 这是因为日志分析假设所有用户消耗都将转换为返回到 Web 服务器的 HTTP 请求,因此已记录在 IIS 日志中。 在缓存的情况下,情况并非如此。

将来,如果您想要更可靠的结果,您最终需要确保存在一种方法来破坏任何缓存策略。 显而易见的答案是动态内容。 但是,如果您不想以这种方式重写所有内容,只需确保您的网络流量分析使用动态调用即可。

WebTrends实际上为这个问题提供了一个解决方案,称为SDC服务器。 这正是 Google Analytics 所提供的——它是对分析服务器的 JavaScript 回调。

……我可以为此花上好几天。 如果您想了解更多具体信息,请评论回来。 ;)

编辑:对于 WebTrends,特别是,在默认 IP/userAgent 配置之外配置会话跟踪非常重要。 如果您的网络服务器分配了会话 cookie,您会发现这会提高您的可靠性; 特别是用于区分可能位于同一 NAT 后面的用户。

SHORT ANSWER:

You are correct to question the results; log analysis is not adequate to report actual traffic.

LONGER ANSWER:

WebTrends is a great tool for what it delivers. But as a previous administrator of a WebTrends installation, I found that web logs are notoriously bad at capturing metrics of interest.

For instance, if there exists any caching in your web delivery stack (or on the consumers side-- *I'm shaking my fist at YOU, AOL!), then your web logs are instantly non-reflective of your site's actual activity. This is because log analysis assumes that all user consumption will translate to an HTTP request back to the web server-- and thus having been recorded in the IIS logs. In the case of a cache, this would not be the case.

In the future if you want more reliable results, you ultimately need to ensure that there exists a way to bust any caching strategy. The obvious answer is dynamic content. But if you do not want to rewrite all of your content in such a fashion, just ensure your web traffic analysis uses a dynamic call.

WebTrends actually offers a solution to this problem, called SDC server. This is exactly what Google Analytics offers as well-- it's a javascript call back to the analysis server.

...I could go for days on this. If you want more specific information, comment back. ;)

EDIT: With WebTrends, specifically, it is quite important to configure session tracking beyond their default IP/userAgent configuration. If your web server assigns a session cookie, you will find this will increase your reliability; especially for differentiating between users which may sit behind the same NAT.

鹿童谣 2024-07-16 21:59:19

我在使用 SmarterTools 的 SmarterStats 方面运气非常好。

I have had really good luck with SmarterStats, from SmarterTools.

鹊巢 2024-07-16 21:59:19

MSFT 提供了一个免费的日志记录包,用于使用 SQL Reporting Services 查看此信息。 去谷歌上查询。

There is a logging package for free from MSFT for viewing this information using SQL Reporting Services. Google it.

清浅ˋ旧时光 2024-07-16 21:59:19

如果它是内部的,那么用日志来做这件事才是一个好主意 - 我会使用谷歌分析来处理互联网上的任何事情

doing it with the logs is only a good idea if it's internal - I'd use google analytics for anyhing on teh internets

梦里兽 2024-07-16 21:59:19

我多年来一直使用 Summary(付费软件),并且很喜欢它。 更新的成本对我来说越来越大,并且支付更新费用只是为了从交易中获得用户代理字符串更新变得很麻烦。 并不是说没有其他修复,我只是倾向于不需要它们。

有人愿意分享他们是否使用过摘要与模拟进行比较吗?

I have been using Summary, which is paid for software, for years, and love it. The cost of updates is getting to me, and paying for an update to just get user agent string updates out of the deal is getting bothersome. Not that there are not other fixes, I just tend to not need them.

Anyone care to share if they have used Summary compared to analog?

少女的英雄梦 2024-07-16 21:59:19

看看XpoLog日志分析平台,用于Web应用服务器和Web服务器日志。 它是一个日志管理和分析平台,可集成到 Web 服务器日志并创建报告、提供搜索和日志查看器以及监视问题。 XpoLog

Look at XpoLog log analysis platform for web application servers and web servers log. it a log management and analysis platform that integrate to web servers logs and create reports, provide search and log viewer and also monitor for problems. XpoLog

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文