监控网站(html)变化/获取快照的工具
您能给我推荐一些工具(不是在线工具)来监视网站中的 html 更改或获取网站的完整快照吗?
我的任务是将当前网站迁移到下一版本的 CMS。由于新的 API,迁移也需要更改代码。这个想法是在迁移之前和之后制作网站的快照。之后,我将比较两个快照并查看迁移是否顺利。
谢谢
Could you recommend me some tool (not online one) to monitor html changes in website or to get complete snapshot of a website?
My task is to migrate current website to the next version of CMS. Migration requires code changes as well, because of new API. The idea is to make snapshot of the website before migration and after. After that I will compare two snapshots and see if migration went OK.
Thanks
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(3)
如果您要从一个 CMS 迁移到另一个 CMS,则内容很可能位于数据库中。只需在所需的点转储数据库,然后构建快速导入脚本来提取内容并映射数据库字段中的任何更改。
不久前,我将一个站点从旧的基于 PERL 的 CMS 迁移到更好的基于 Zend Framework 的自定义 CMS。虽然数据库表有很大不同,但每个 CMS 似乎都有共性,例如标题、内容、模糊等字段。只需识别匹配项、构建导入脚本并运行即可。编写快速脚本后,您可以以相同的方式引入更新,从而允许您运行新代码的测试版,并快速导入在首次导入和新代码库启动之前可能发生的更新。
If you're migrating from one CMS to another, then content is most likely in a Database. Just dump the DB at the desired points, then build quick import script to pull in content and map any changes in DB fields.
Not too long ago I moved a site from an old PERL based CMS to a much nicer Zend Framework-based custom CMS. While the DB tables were quite different, every CMS seems to have commonalities such as Title, Content, Blurb, etc fields. It's just a matter of identifying what matches, building the import script, and running. Once the quick script is written, you could pull in updates in the same manner, allowing you to run a beta of the new code and quickly importing updates that might happen between first import and immediately prior to launch of new codebase.
屏幕抓取(图像)
由于您的网页可能会扩展浏览器窗口的可见区域,因此您需要专门的工具来实现此目的。对于 Firefox,我使用 Fireshot 获得了一些很好的体验。但它不是全自动的。
完整 HTML 结构的副本
对于自动化解决方案,您是否考虑过下载网站的完整镜像?我指的不是旧的 CMS,而是生成的 HTML 输出,包括所有样式表和脚本。当然,任何动态功能都会丢失,但应该可以通过这种方式创建整个事物的运行的本地 HTML 副本。我过去曾使用过 GetLeft 。只要确保所有内容都已下载,并且不再有任何对在线版本的引用即可。
Screen grabs (images)
As your web pages may well extend the visible area of your browser window, you'll need a specialized tool for this. For Firefox, I have made some good experiences with Fireshot. It's not fully automatic though.
Copy of the full HTML structure
For an automated solution, have you considered downloading a complete mirror of the web site? I don't mean the old CMS, but the generated HTML output including all style sheets and scripts. Any dynamic functionality would be lost of course, but it should be possible to create a running, local HTML copy of the whole thing that way. I have used GetLeft in the past. Just be sure everything gets downloaded and there are no references to the online version left anymore.
为什么不复制整个网站文件夹,并在进行更改后使用 Beyondcompare 查看更改的内容。它有一个过滤器,仅显示差异,这将为您提供所需的内容(已更改的文件)。
Why don't you make a copy of the entire website folder and after making the changes use beyondcompare to see what has changed. It has a filter to show only differences which will give you what you need (files that changed).