我应该如何提供压缩网页？

发布于 2024-07-14 05:03:46 字数 997 浏览 9 评论 0原文

背景：
我们的软件以常见的可疑格式（HTML、PDF 等）为客户生成报告，每个报告都可以包含该报告特有的图表和其他图形。对于 PDF，所有内容都保存在一处 - PDF 文件本身。 HTML 比较棘手，因为报告基本上是多个文件的总和。这些文件可通过 Tomcat 通过 HTTP 获取。

问题：
我真的想要一个整洁的环境并将 HTML 报告包装到一个文件中。有 MTHML、数据 URI 和多种格式需要考虑。这很棒问题假设，鉴于这些格式缺乏跨浏览器支持，ZIP 是一个巧妙的解决方案。这对我很有吸引力，因为我还可以提供 zip 格式的下载，作为“您可以通过电子邮件发送的 HTML 报告”选项。（过去，用户抱怨在开始通过电子邮件发送 HTML 报告时丢失了图形）

解决方案似乎很简单。收到请求后，我找到相应的 zip，将其解压到网络服务器上的某个位置，将请求指向新的 HTML 文件，大约一天后再次整理所有内容。

但事情似乎不太对劲。我有一种直觉，这不是一个好的解决方案，它有本质上的错误，或者可能存在我目前看不到的更好的方法。

任何人都可以建议这是好还是坏，并提供替代解决方案？

编辑以获取更多背景信息！
报告需要保留在服务器上。我们的客户是站点的用户，单个报告的可见性可能与站点上的每个人一样广泛。创建过程涉及用户选择报告的标准，并将其提交到服务器进行创建。从数据库中提取数据并构建文档。占位符记录进入数据库，文档本身存储在文件服务器的某个位置。我希望更加整洁的是“文件服务器上的文档”部分 - 压缩也意味着使用更少的磁盘空间！创建报告后，任何可以查看该报告的人都可以使用该报告。

原文

Background:
Our software generates reports for customers in the usual suspect formats (HTML, PDF, etc.) and each report can contain charts and other graphics unique to that report. For PDFs everthing is held in one place - the PDF file itself. HTML is trickier as the report is basically the sum of more than 1 file. The files are available via HTTP through Tomcat.

Problem:
I really want to have a tidy environment and wrap the HTML reports into a single file. There's MTHML, Data URIs, several formats to consider. This excellent question posits that, given the lack of cross-broser support for these formats, ZIP is a neat solution. This is attractive to me as I can also offer the zip for download as a "HTML report you can email" option. (In the past users have complained about losing the graphics about when they set about emailling HTML reports)

The solution seems simple. A request comes in, I locate the appropriate zip, unpack it somewhere on the webserver, point the request at the new HTML file, and after a day or so tidy everything up again.

But something doesn't quite seem right about that. I've kind of got a gut feeling that it's not a good solution, that there's something intrisically wrong with it, or that maybe a better way exists that I can't see at the moment.

Can anyone suggest whether this is good or bad, and offer an alternative solution?

Edit for more background information!
The reports need to persist on the server. Our customers are users at sites, and the visibility of a single report could be as wide as everyone at the site. The creation process involves the user selecting the criteria for the report, and submitting it for creation to the server. Data is extracted from the database and a document built. A placeholder record goes into the database, and the documents themselves get stored on the fileserver somewhere. It's the 'documents on the fileserver' part that I'd like to be tidier - zipping also means less disk space used!. Once a report is created, it is available to anyone who can see it.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

无所谓啦 2024-07-21 05:03:46

我原以为该计划是 zip 文件最终出现在客户端上，而不是留在服务器上。

在不了解您的架构的情况下，我会猜测这样的方法：

用户请求报告
服务器将报告显示为 HTML
用户可能会调整一些参数，重复请求
服务器将报告显示为 HTML（重复直到用户满意）
在每个 HTML 报告上，有一个“下载为 zip”链接
用户单击链接
服务器重新生成报告，将其存储在 zip 文件中并将其提供给用户
用户将 zip 文件保存在某处，通过电子邮件发送等 - 服务器根本不参与

这依赖于当然，能够重新运行报告以生成 zip 文件。每次生成一些 HTML 时，您可以生成一个 zip 文件，但如果您不需要需要这样做，那么这就很浪费，并且需要清理等。

也许我'不过我误解了你......如果这听起来不合适，你能更新你的问题吗？

编辑：好的，看到您的问题的更新后，我很想将每个报告的文件存储在单独的目录中（例如使用 GUID 作为目录名称）。许多文件系统支持文件系统级别的压缩，因此“过早压缩”可能不会节省太多磁盘空间，并且会使提取单个文件变得更加困难。然后，如果用户请求 zip，您只需要在提供该文件之前在此时构建 zip 文件（可能只是在内存中）。