批量插入期间出现额外字符

发布于 2024-10-07 08:06:32 字数 956 浏览 8 评论 0原文

我正在尝试将 csv 文件中的第一行批量插入到只有一列的表中。但我在开头得到了一些额外的字符（'n++'），如下所示：

n++First Column;Second Column;Third Column;Fourth Column;Fifth Columnm;Sixth Column

CSV 文件内容如下：

First Column;Second Column;Third Column;Fourth Column;Fifth Columnm;Sixth Column

您可以找到 test.csv 文件 here

这是我用来获取表中第一行数据的代码

declare @importSQL nvarchar(2000)
declare @tempstr varchar(max)
declare @path varchar(100)  

SET @path = 'D:\test.csv'    

CREATE TABLE #tbl (line VARCHAR(max))

SET @importSQL = 
'BULK INSERT #tbl 
FROM ''' + @path + ''' 
WITH ( 
LASTROW = 1,
FIELDTERMINATOR = ''\n'',
ROWTERMINATOR = ''\n''
)' 

EXEC sp_executesql @stmt=@importSQL 

SET @tempstr = (SELECT TOP 1 RTRIM(REPLACE(Line, CHAR(9), ';')) FROM #tbl)

print @tempstr
drop table #tbl

知道这个额外的“n++”在哪里吗来自？

原文

I am trying to bulk insert the first row from a csv file into a table with only one column.
But I am getting some extra characters('n++') in the begining like this:

n++First Column;Second Column;Third Column;Fourth Column;Fifth Columnm;Sixth Column

CSV file contents are like:

First Column;Second Column;Third Column;Fourth Column;Fifth Columnm;Sixth Column

You can find the test.csv file here

And this is the code I am using to get the first row data in a table

declare @importSQL nvarchar(2000)
declare @tempstr varchar(max)
declare @path varchar(100)  

SET @path = 'D:\test.csv'    

CREATE TABLE #tbl (line VARCHAR(max))

SET @importSQL = 
'BULK INSERT #tbl 
FROM ''' + @path + ''' 
WITH ( 
LASTROW = 1,
FIELDTERMINATOR = ''\n'',
ROWTERMINATOR = ''\n''
)' 

EXEC sp_executesql @stmt=@importSQL 

SET @tempstr = (SELECT TOP 1 RTRIM(REPLACE(Line, CHAR(9), ';')) FROM #tbl)

print @tempstr
drop table #tbl

Any idea where this extra 'n++' is coming from?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

假装爱人 2024-10-14 08:06:32

SQL Server 2005和2008似乎不支持UTF-8文件，只有版本11才支持！

https:/ /connect.microsoft.com/SQLServer/feedback/details/370419/bulk-insert-and-bcp-does-not-recognize-codepage-65001

回复收藏 0 原文

写下不归期 2024-10-14 08:06:32

额外的字符是由编码引起的。您可以使用记事本将编码格式从UTF-8更改为Unicode。这删除了第一行上的“n++”。

回复收藏 0 原文

英雄似剑 2024-10-14 08:06:32

它可能是正在选取的 Unicode 字节顺序标记。

我建议您尝试将 DATAFILETYPE 选项设置为语句的一部分。有关更多详细信息，请参阅 MSDN 文档：http://msdn。 microsoft.com/en-us/library/aa173832%28SQL.80%29.aspx

回复收藏 0 原文

哭泣的笑容 2024-10-14 08:06:32

不幸的是，旧的 SQL Server 版本不支持 utf-8。将代码页参数添加到批量插入方法。在您的问题中，请更改现有的代码。

SET @importSQL = 
'BULK INSERT #tbl 
    FROM ''' + @path + ''' 
    WITH ( LASTROW = 1, 
           FIELDTERMINATOR = ''\n'', 
           ROWTERMINATOR = ''\n'' , 
           CODEPAGE=''65001'')'

请注意，您的文件必须采用 utf-8 格式。
但问题是，如果您将服务器从 2005 年升级到 2008 年，则不支持代码页 65001(utf-8)，然后您将收到“不支持代码页”消息

Unfortunatelly, Old SQL Server versions not supports utf-8. Add the codepage parameter to bulk insert method. In your question please change your code as exists.

SET @importSQL = 
'BULK INSERT #tbl 
    FROM ''' + @path + ''' 
    WITH ( LASTROW = 1, 
           FIELDTERMINATOR = ''\n'', 
           ROWTERMINATOR = ''\n'' , 
           CODEPAGE=''65001'')'

Note that, your file must be in utf-8 format.
But the problem there is, if you're upgrade your server from 2005 to 2008 the codepage 65001(utf-8) not supported and then you will get the " codepage not supported"message

回复收藏 0 原文