Tsql - 在分隔列上执行联接 - 性能和优化问题

发布于 2024-11-07 04:04:09 字数 1452 浏览 0 评论 0原文

我有以下查询（在返回的列中略有简化）。

select Products.Product, Products.ID, Products.Customers
from Products
where Products.orderCompleteDate is null

作为示例，这将返回

productA  1  Bob
productA  1  Jane
productB  2  John,Dave

请注意，客户可以是逗号分隔的列表。我想添加的是“客户位置”列，因此上面变成了

productA  1  Bob        Ireland
productA  1  Jane       Wales
productB  2  John,Dave  Scotland,England

我在下面创建的函数，其中 fn_split 返回每个分隔项的单行。

create FUNCTION [dbo].[GetLocations]  (@CustomerNames Varchar(256) )   

RETURNS @TempLocations table (CustomerLocations varchar(256)) AS begin
declare @NameStr varchar(256)  
declare @temp table(singleLoc varchar(256))

insert into @temp
select CustomerLocation.Location from CustomerLocation
INNER JOIN Customers ON Customers.ID = CustomerLocation.ID
INNER JOIN dbo.fn_Split(@CustomerNames,',') split ON split.Item = Customers.Name

SELECT @NameStr = COALESCE(@NameStr + ',', '') + singleLoc 
FROM @temp 

insert into @TempLocations values (@NameStr)
return
end

并将其应用到原始查询中，如下所示。

select Products.product, Products.ID, Products.Customers, Locations.CustomerLocations
from Products
OUTER APPLY dbo.GetLocations(Products.Customers,',') AS Locations
where Products.orderCompleteDate is null

但是，这非常慢，在只有 2000 行的表上查询大约需要 10 秒（初始查询几乎立即运行）。这表明查询无法优化，并且是逐行生成的。由于这个原因，我远离了标量值函数，并尝试坚持使用表值函数。我的逻辑/代码有什么明显的错误吗？

原文

I have the following (slightly simplified in the columns returned) query.

select Products.Product, Products.ID, Products.Customers
from Products
where Products.orderCompleteDate is null

This would return, as an example

productA  1  Bob
productA  1  Jane
productB  2  John,Dave

Note that Customers can be a comma delimited list. What I want to add, is a column 'Customer Locations', so the above becomes

productA  1  Bob        Ireland
productA  1  Jane       Wales
productB  2  John,Dave  Scotland,England

I created a function below, where fn_split returns a single row per delimited item.

create FUNCTION [dbo].[GetLocations]  (@CustomerNames Varchar(256) )   

RETURNS @TempLocations table (CustomerLocations varchar(256)) AS begin
declare @NameStr varchar(256)  
declare @temp table(singleLoc varchar(256))

insert into @temp
select CustomerLocation.Location from CustomerLocation
INNER JOIN Customers ON Customers.ID = CustomerLocation.ID
INNER JOIN dbo.fn_Split(@CustomerNames,',') split ON split.Item = Customers.Name

SELECT @NameStr = COALESCE(@NameStr + ',', '') + singleLoc 
FROM @temp 

insert into @TempLocations values (@NameStr)
return
end

And applied it to the original query as follows

select Products.product, Products.ID, Products.Customers, Locations.CustomerLocations
from Products
OUTER APPLY dbo.GetLocations(Products.Customers,',') AS Locations
where Products.orderCompleteDate is null

However, this is extremely slow, with the query taking ~10seconds on a table with a mere 2000 rows (initial query runs almost instantly). This suggests that the query was unable to be optimised, and is being generated row by row. I stayed away from scalar value functions for this reason, and tried to stick to table value functions. Is there any glaring fault in my logic/code?

分享到QQ

分享到微博