导入/导出 Apache Jackrabbit 存储库的最佳方法
从 Apache Jackrabbit 存储库导入/导出数据的最佳方法是什么?
现在,我有一个基于存储库的网站,服务器中包含超过 100GB 的数据,但我经常需要在笔记本中获取这些数据的子集进行本地演示。
我怎样才能以最简单、最快的方式做到这一点?
What is the best way to import/export data from a Apache Jackrabbit repository?
Right now, I have a website based on a repository with over 100GB data in a server but I frequently need to take a subset of this data in my notebook for in-loco demonstrations.
How can I do this in the easiest and fastest way?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(4)
谷歌搜索我最好的选择是这个工具:
http://svn. apache.org/repos/asf/jackrabbit/sandbox/jackrabbit-jcr-import-export-tool/README.txt
但是,仅从 README 文件来看,它看起来不太可靠。
另一个问题是,它要求我提前知道要导出的所有文件的文件名,这会很麻烦,因为我提取的主要标准是复制最新的文件,即使用创建日期作为筛选。
理想的解决方案是让我像在常规文件系统中一样浏览存储库中的文件,并使用简单的复制工具来提取数据。
Googling for it my best bet is this tool:
http://svn.apache.org/repos/asf/jackrabbit/sandbox/jackrabbit-jcr-import-export-tool/README.txt
But, judging solely from the README file, it doesn't look very reliable.
The other problem is that it requires me to know the file names of all files I want to export in advance, which would be bothersome, since my main criteria for extraction is to copy the most recent files, ie, using the creation date as the filter.
The ideal solution would let me browse through the files in the repository as in a regular file-system and use simple copy tools to extract the data.
遗憾的是,没有那么多通用的 JCR 工具。 Sling 有两个 jcr 浏览器,一个基于 GWT,另一个基于 JS,但都没有导出。
Sadly there aren't that many generic JCR tools. Sling has two jcr explorers, one that is GWT based and another JS-based, but neither export.
Day就有CRX,不过要花一些钱
There is CRX from Day, but it costs some money
我自己正在研究这个。 Session.exportSystemView() 和 Session.importXML() 可以工作吗? 这些显然允许您将子树序列化/反序列化为 XML。
I'm looking into this myself. Would Session.exportSystemView() and Session.importXML() work? Those will apparently allow you to serialize/deserialize a subtree as XML.