Java 中的 XML 到 XML 转换

发布于 2024-08-17 06:24:17 字数 243 浏览 12 评论 0原文

我需要在 Java 中从 XMI 转换为 OWL（XML/RDF 序列化），所以本质上这是 XML 到 XML 的转换，很可能我可以使用正则表达式并使用 ReplaceAll 来满足我的需要，但这似乎是非常混乱的方法它。您有什么建议，以便以后可以轻松定制（我的 OWL 模型将来可能会略有变化）？

我的想法是将 XMI 读入创建的类层次结构（根据我的 OWL 模型），然后使用一些模板引擎将其输出为 OWL (XML)。您知道更容易定制的更简单的方法吗？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

巾帼英雄 2024-08-24 06:24:17

XSL 转换非常适合此类工作，事实上它是为此设计的

:-)从 XSLT 开始，查看 zvon 参考及其教程。

回复收藏 0 原文

晨敛清荷 2024-08-24 06:24:17

您可以使用 XSLT 将 XML 转换为 XML。

这篇 OReilly 文章是一个很好的起点。

回复收藏 0 原文

冰雪梦之恋 2024-08-24 06:24:17

XMI 不是一种很好的直接转换为 OWL 的格式 - XMI 中有许多不同的结构具有相同的含义（@stereotype="foo"、stereotype/@name="foo “ 和 stereotype/@xmi:id="{id of the foostereotype}" 都表示同一件事） - 我强烈建议使用 XMI 的两阶段过程首先转换为规范形式，在其中解析此类引用，并删除您不想映射到 OWL 的任何信息。

如果您不熟悉的话，XSLT 键函数和元素将非常有用。尽管您可以在 XSLT1 中完成此操作（当没有其他可用的时候我就这样做了），但可以在 XSLT2 处理器（例如 Saxon）中工作使转换更加简洁。询问 XSLT 问题的最佳位置是 Mulberry 列表。

sourceforge 上有一个工具可以通过 GUI 完成此操作，但我似乎找不到它。我的中间转换由前任雇主所有。对于代码生成或 XMI 到 XML，我直接使用 XSLT 和两阶段方法。

回复收藏 0 原文

久而酒知 2024-08-24 06:24:17

我同意 rsp 和 cb160 的观点，即 XSLT 是完成这项工作的工具。

如果您使用的是 unix 平台，您可以考虑使用 xsltproc 在命令行上测试转换。根据我的经验，如果您不太熟悉 XSL，这确实可以加快开发时间。

回复收藏 0 原文

淡淡の花香 2024-08-24 06:24:17

XSLT 设计用于处理 XML 节点树。虽然 RDF 序列化是 XML 节点的“树”（RDF/XML 和 RDF/XML-Abbrev），但底层的 RDF 数据模型是一个图。

如果生成的 RDF 图不是树，那么您将不得不在 XSLT 中做一些肮脏的事情来遍历引用，并且性能/可维护性/健全性可能会受到影响。如果您修改 OWL 格式然后想要转换回非 RDF XML，请注意这一点。

一个简单的（树）示例如下：

## Foo has two types
@prefix e: <uri://example#>.
e:Foo a e:Bar.
e:Foo a e:Baz. # Second statement about e:Foo

对于转换回非 RDF XML，如果您使用最基本的 RDF/XML 形式，您将立即在顶级 rdf:RDF元素。转换这些可能需要一遍又一遍地搜索整个语句列表。

<rdf:RDF xmlns:e="uri://example#"
         xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
  <rdf:Description rdf:about="uri://example#Foo">
    <rdf:type rdf:resource="uri://example#Baz"/>
  </rdf:Description>
  <rdf:Description rdf:about="uri://example#Foo">
    <rdf:type rdf:resource="uri://example#Bar"/>
  </rdf:Description>
</rdf:RDF>

您可能会发现 RDF/XML-Abbrev 格式更易于阅读，但用 XSLT 处理并不容易，因为 RDF 的数据模型是无序的，并且一张图可以有许多等效（但与 XSLT 不兼容）的 XML 形式。上面的示例可以序列化为以下任一形式：

<!-- Bar is the containing element -->
<rdf:RDF xmlns:e="uri://example#"
         xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
  <e:Bar rdf:about="uri://example#Foo">
    <rdf:type rdf:resource="uri://example#Baz"/>
  </e:Bar>
</rdf:RDF>

<!-- Baz is the containing element -->
<rdf:RDF xmlns:e="uri://example#"
         xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
  <e:Baz rdf:about="uri://example#Foo">
    <rdf:type rdf:resource="uri://example#Bar"/>
  </e:Bar>
</rdf:RDF>

Pete Kirkham 关于创建序列化规范形式的建议将帮助您编写 XSLT。在大多数情况下，给定完全相同的输入，RDF 库每次都会将语句序列化为相同的格式，但从长远来看，我不会依赖于此，因为 RDF 图中的数据是无序的。

XSLT is designed for processing trees of XML nodes. While there are RDF serializations which are a "tree" of XML nodes (RDF/XML and RDF/XML-Abbrev), the underlying RDF data model is a graph.

If your resulting RDF graph is not also tree, you're going to have to do dirty things in your XSLT to traverse references and performance/maintainability/sanity can suffer. Just be aware of this if you modify the OWL format and then want to convert back to non-RDF XML.

A simple (tree) example is as follows:

## Foo has two types
@prefix e: <uri://example#>.
e:Foo a e:Bar.
e:Foo a e:Baz. # Second statement about e:Foo

For conversions back to non-RDF XML, if you use the most basic RDF/XML form you will get a list of RDF statements immediately under the top level rdf:RDF element. Transforming these can involve searching the entire list of statements over and over.

<rdf:RDF xmlns:e="uri://example#"
         xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
  <rdf:Description rdf:about="uri://example#Foo">
    <rdf:type rdf:resource="uri://example#Baz"/>
  </rdf:Description>
  <rdf:Description rdf:about="uri://example#Foo">
    <rdf:type rdf:resource="uri://example#Bar"/>
  </rdf:Description>
</rdf:RDF>

You might find the RDF/XML-Abbrev format easier to read, but it is not easy to process with XSLT because RDF's data model is unordered and one graph can have many equivalent (but incompatible to your XSLT) XML forms. The example above can serialize as either of the following:

<!-- Bar is the containing element -->
<rdf:RDF xmlns:e="uri://example#"
         xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
  <e:Bar rdf:about="uri://example#Foo">
    <rdf:type rdf:resource="uri://example#Baz"/>
  </e:Bar>
</rdf:RDF>

<!-- Baz is the containing element -->
<rdf:RDF xmlns:e="uri://example#"
         xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
  <e:Baz rdf:about="uri://example#Foo">
    <rdf:type rdf:resource="uri://example#Bar"/>
  </e:Bar>
</rdf:RDF>

Pete Kirkham's suggestion of creating a canonical form for serialization will aide you in writing XSLTs. In most cases, given the exact same input, a RDF library will serialize the statements to the same format every time, but I would not depend on this in the long run as data in a RDF graph is unordered.

回复收藏 0 原文

~没有更多了~