从 XML RAW 数据批量插入 SQL Server 2008 R2

发布于 2025-01-02 10:43:14 字数 1162 浏览 3 评论 0原文

我有一个如下所示的 XML 结构：

<tables>
  <table name="tableName1">
    <row ID="34" col1="data" col2="dom" />
    <row ID="35" col1="data2" col2="dom2" />
  </table>
  <table name="tableName2">
    <row ID="1" col1="data" col2="dom" col3="item1" />
    <row ID="3" col1="data2" col2="dom2" col3="item2" />
    <row ID="7" col1="data4" col3="item3" />
  </table>
  ...
<tables>

基本上，表节点包含通过选择 FOR XML RAW 创建的 RAW 数据。

现在我希望执行相反的操作：读取 XML 并将数据插入到 SQL Server 2008 R2 数据库的相应表中。然而，我希望加载过程是健壮的，这意味着如果列名和表名将来发生变化，我不想弄乱它们。我需要从表节点的 @name 属性读取表名称并将数据插入到节点中的属性指定的列中的过程。我想到了一个存储过程，它获取 XML 作为输入并完成其余的工作。

数据量约为。 70 个表，行数从 10 到 30 000 行不等，总共不超过 100 000 行。我需要尽可能高效地完成，批量加载是最好的。

该过程不应该处理外键，因为 XML 内的表顺序是这样构建的，以便可以通过依次加载一个表来保持 FK 约束就位。

但是，每个表中都有标识列，因此我必须

SET Identity_Insert ON and SET Identity_Insert OFF

在处理每个表之前和之后进行处理。我还需要在插入所有行后重新设定每个表的种子。哦，我需要在事务中完成整个工作，以便在出现问题时可以回滚。

您建议我走哪条路：我应该继续使用 T-SQL 还是尝试用 CLR SQL 编写 SP？我应该使用 XQuery 还是可以使用一些批量插入方法？

感谢您的帮助！

原文

I have an XML structure like the following:

<tables>
  <table name="tableName1">
    <row ID="34" col1="data" col2="dom" />
    <row ID="35" col1="data2" col2="dom2" />
  </table>
  <table name="tableName2">
    <row ID="1" col1="data" col2="dom" col3="item1" />
    <row ID="3" col1="data2" col2="dom2" col3="item2" />
    <row ID="7" col1="data4" col3="item3" />
  </table>
  ...
<tables>

Basically the table nodes contain RAW data created by selecting FOR XML RAW.

Now I wish to do the reverse: read the XML and insert data into respective tables of a SQL Server 2008 R2 database. However I want the loading process to be robust, meaning I do not want to mess with column names and table names if they change in the future. I need the process to read table names from @name attributes of table nodes and insert data into columns specified by attributes in <Row> nodes. I thought of a stored procedure that gets an XML as input and does the rest.

The amount of data is approx. 70 tables ranging from 10 to 30 000 rows, altogether no more than 100 000 rows. I need to do it as efficiently as possible, bulk loading would be the best.

The process should not take care of foreign keys as the order of tables inside the XML is built so that FK constraints can be kept in place by loading one table after the other.

However there are identity columns in each table so I must do a

SET Identity_Insert ON and SET Identity_Insert OFF

before and after processing each table. I also need to reseed each table after inserting all rows. Oh,and I need to do the whole shebang in a transaction so that I could roll back if something goes wrong.

Which way do you suggest I go: should I stay with T-SQL or try to write the SP in CLR SQL? Should I use XQuery or can I use some bulk insert method?

Thanks for all the help!

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

ま柒月 2025-01-09 10:43:14

基本上，您必须循环遍历 XML 并根据结果集编写查询。

尝试这样开始：

declare @i int;
declare @x xml;

------
SELECT @x = N'
<tables>
  <table name="tableName1">
    <row ID="34" col1="data" col2="dom" />
    <row ID="35" col1="data2" col2="dom2" />
  </table>
  <table name="tableName2">
    <row ID="1" col1="data" col2="dom" col3="item1" />
    <row ID="3" col1="data2" col2="dom2" col3="item2" />
    <row ID="7" col1="data4" col3="item3" />
  </table>
</tables>';


exec sp_xml_preparedocument @i output, @x


select ID, col1, col2
from OpenXml(@i, '/tables/table/row')
with (ID int, col1 nvarchar(30), col2 nvarchar(30))

exec sp_xml_removedocument @i

它将为您提供需要插入数据的列的列表（您可以获取前一级的表名称，只需更改 SQL）

34  data    dom
35  data2   dom2
1   data    dom
3   data2   dom2
7   data4   NULL

您接下来需要做的是编写在此结果集上循环的语句。

仅供参考，您不需要编写 XML，您可以从如下文件中读取：

SELECT @x = xCol.BulkColumn FROM OPENROWSET (BULK 'c:\Update.xml', SINGLE_BLOB) AS xCol;

Basically you will have to loop through your XML and write the queries based on the result set.

Try this to start:

declare @i int;
declare @x xml;

------
SELECT @x = N'
<tables>
  <table name="tableName1">
    <row ID="34" col1="data" col2="dom" />
    <row ID="35" col1="data2" col2="dom2" />
  </table>
  <table name="tableName2">
    <row ID="1" col1="data" col2="dom" col3="item1" />
    <row ID="3" col1="data2" col2="dom2" col3="item2" />
    <row ID="7" col1="data4" col3="item3" />
  </table>
</tables>';


exec sp_xml_preparedocument @i output, @x


select ID, col1, col2
from OpenXml(@i, '/tables/table/row')
with (ID int, col1 nvarchar(30), col2 nvarchar(30))

exec sp_xml_removedocument @i

It will get you the list of columns you need to inset data into (you can get the table names one level before, just change the SQL)

34  data    dom
35  data2   dom2
1   data    dom
3   data2   dom2
7   data4   NULL

what you need to do next is write the statements looping on this result set.

FYI, you don't need to write the XML, you can read from a file like this:

SELECT @x = xCol.BulkColumn FROM OPENROWSET (BULK 'c:\Update.xml', SINGLE_BLOB) AS xCol;

回复收藏 0 原文

向地狱狂奔 2025-01-09 10:43:14

当您处理相当大的 XML 文档时，我建议此时使用 .net 粉碎机。您可以在 CLR 过程或外部工具中执行此操作。您还可以使用 SQL Server 内置的 xquery，但这会很慢。

但是，看看这个和您之前的问题（将数据从 MS SQL Server 2008 R2 转储到单个 XML 文件），我认为您最好使用 BCP 实用程序甚至复制之类的东西。您的具体要求是什么？

回复收藏 0 原文

~没有更多了~

关于作者

放血

暂无简介

文章

27 人气

关注发私信

友情链接

文江博客

从 XML RAW 数据批量插入 SQL Server 2008 R2

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

十二

飞烟轻若梦

OPleyuhuo

wxb0109

旧城空念

-小熊_

友情链接

从 XML RAW 数据批量插入 SQL Server 2008 R2

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

十二

飞烟轻若梦

OPleyuhuo

wxb0109

旧城空念

-小熊_

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。