比较两个站点结构的工具
有谁知道有什么工具可以比较两个几乎完全相同的网站?
简单地说,我有一个沙箱站点和生产站点,并且想找出它们之间的区别,以了解哪些内容要移动到生产站点。
谢谢!
编辑:
好的,我发现我错过了一条关键信息(抱歉!)。 这两个站点都是在线的并且基于 CMS (Drupal),因此我需要一些爬行工具来在这两个站点上运行,并显示沙箱中存在的页面,但不在生产中。
感谢所有回答的人!
Does anyone knows about any tools allowing to compare two almost exactly same websites?
Simply, I have a sandbox site and production site, and would like to find out the difference between them, to know what content to move to production site.
Thanks!
Edit:
Ok, I see I missed a critical piece of information (sorry!). The both sites are online and based on CMS (Drupal), hence I need some crawling tool which would run over the two sites, and show what pages present in sandbox, but not on production.
Thanks for everyone who answered regardless!
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(10)
使用 httrack 获取网站的副本,然后使用您最喜欢的文件比较工具来查看差异。 (我更喜欢 WinMerge,它可以递归地运行两个文件文件夹,具有忽略空格差异和空行的选项,甚至使用 WINE 在 Linux 下运行良好。)
PS 您甚至可能想通过 HTML Tidy 在进行比较之前对它们进行标准化/漂亮的格式化。
另一种方法是数据库比较。 不过,您仍然可以对原始网站文件(而不是蜘蛛版本)进行文件比较。 根据记忆,Drupal 数据库的模式并不难理解,特别是如果您主要对节点内容感兴趣的话。
Use httrack to pull down a copy of the sites, and then your favourite file comparing tool to review the differences. (I prefer WinMerge, which can recursively run though two folders of files, has options to ignore whitespace differences and blank lines, and even runs well under Linux using WINE.)
P.S. You might even want to run your downloaded HTML files through HTML Tidy to normalise/pretty format them before doing the comparison.
The other way to do it would be a database comparison. You would still do the file comparison of the raw website files (not the spidered version) too though. From memory, the schema for a Drupal database isn't too hard to follow, particularly if you are just primarily interested in node content.
超越比较 ...纯粹的敬畏
Beyond Compare ... pure awesomeness
WinMerge 是一个免费/开源工具,可让您比较文件和目录。
您可以显示相同、不同或仅存在于一个目录中的文件。
然后,您可以通过将文件从一个目录复制到另一个目录来使两个目录相同。
WinMerge is a free/open source tool that allows you to compare files and directories.
You can show files which are identical, different or exist in only one directory.
And you can then make both directories identical, by copying files from one directory to the other.
正如sunny所说,使用爬虫和diff工具。
对于爬虫我更喜欢 httrack 而不是 wget。
现在许多网站都使用从 css 引用的图像。
当我上次检查时,wget 无法加载从 css 引用的文件。
除此之外,它还有一个漂亮的 GUI。
从 diff 工具开始:
我从 Windows 3.1 开始就使用 FileSync。
当然还有更多奇特的差异工具,但这个工具简洁干净,可以胜任这项工作。
在 Unix 上你总是可以执行 diff -r。
如果你的机器上有 eclipse,它有一个很好的 diff 查看器,它也可以比较两个目录。
As sunny said, use a crawler and a diff tool.
For the crawler I'd prefer httrack over wget.
Many sites use images referenced from css nowadays.
wget was not able to load files referenced from css when I last checked.
Besides that it has a nice GUI.
As of the diff tool:
I use FileSync since the Windows 3.1 days.
There are certainly more fancy diff tools but this one is lean and clean an does the job.
On unix you can always do diff -r.
If you have eclipse on your machine it has a nice diff viewer which can compare two directories as well.
如果您运行的是 Linux,Meld 对此很有用。 它可以在 Debian/Ubuntu 存储库中找到。
Meld is good for this if you're running Linux. It's available in Debian/Ubuntu repositories.
使用wget抓取网站,将内容与一些diff 工具。
或者,对脚本文件使用 diff 工具,然后将数据库转储到文本 (csv) 文件中,并进行比较。
Use wget to crawl the sites, the compare the content with some diff tool.
Or, use a diff tool on the script files, and then dump the DBs in a text (csv) files, and compare.
超越比较是伟大的。 Syncback (http://www.2brightsparks.com/downloads.html) 有免费版本您可以使用任何合并工具,例如 araxis 或 winmerge
Beyond compare is great. Syncback (http://www.2brightsparks.com/downloads.html) has a free version and you can use any merge tool like araxis or winmerge
将您的代码放入 Subversion 或其他源代码管理系统中。 准备部署时,“svn update”将仅更新内容已经改变了。
'svn diff' 将显示差异。
Put your code into Subversion or some other source control. When ready to deploy, 'svn update' will update only the things that have changed.
'svn diff' will show the differences.
DiffMerge 是另一个支持目录差异的免费跨平台工具。
DiffMerge is another free cross platform tool that supports directory diffs.
您还可以使用一些在线比较服务,例如 CopyScape 或 ChangeDetection
You can also use some online comparison services like CopyScape or ChangeDetection