当前位置：文江博客话题详情

SSIS事务数据（不同记录类型，一个文件）

发布于 2024-07-24 14:02:08 字数 398 浏览 4 评论 0原文

有趣的是，我们正在评估用于预处理报表数据（例如水电费账单、银行报表）以进行打印的 ETL 工具。

一些数据以不同的记录类型存储在单个平面文件中。

例如，以“01”作为第一个字段的记录类型将是地址数据。这将包含名称和地址字段。带有“02”的记录类型将是汇总数据，包含余额和总计。记录类型“03”将是报表上的行项目。

每条语句将有一条 01 和 02 记录，以及多条 03 记录。我可以预先解析该文件并拆分为 3 个文件以加载到表中，但这不太理想。

我们获取该文件并对其进行一些操作（例如，在地址记录中添加更多字段，并且可能进行一些总计/验证），然后以几乎相同的格式发送该文件（但带有额外的字段添加）到我们的印刷排版程序中。

您将如何在 SSIS 中执行此操作？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

离鸿 2024-07-31 14:02:08

SSIS 中变体记录的一个大问题是，您无法获得连接管理器帮助布局的任何好处，因为连接管理器只能处理单个布局。

因此，通常情况下，您最终会得到一个以 CRLF 结尾的平面文件，其中包含两列：记录类型和记录数据。然后，您放入条件分割并解析不同路径上的每种类型的行。解析必须拆分剩余的记录数据，并将其放入列中，并使用派生列转换或脚本转换以及潜在的转换转换正常进行转换。

如果您有很多包要做，我会认真考虑编写一个自定义组件，该组件生成的 3 个输出已转换为您的目标类型。

回复收藏 0 原文

半窗疏影 2024-07-31 14:02:08

回答了我自己的问题 - 请参阅下面的脚本。 AcctNum 来自平面文件源的派生列，并将正确填充 02 记录类型，将其保存在本地静态变量中，并将其放回到不包含帐户号的其他记录类型的行上。

/* Microsoft SQL Server 集成服务脚本组件
* 使用 Microsoft Visual C# 2008 编写脚本。
* ScriptMain是脚本的入口点类。*/

using System;
使用系统数据；
使用 Microsoft.SqlServer.Dts.Pipeline.Wrapper；
使用 Microsoft.SqlServer.Dts.Runtime.Wrapper；

[Microsoft.SqlServer.Dts.Pipeline.SSISScriptComponentEntryPointAttribute]
公共类 ScriptMain : UserComponent
{
静态字符串帐号= null;

public override void PreExecute()
{
    base.PreExecute();
    /*
      Add your code here for preprocessing or remove if not needed
    */
}

public override void PostExecute()
{
    base.PostExecute();
    /*
      Add your code here for postprocessing or remove if not needed
      You can set read/write variables here, for example:
      Variables.MyIntVar = 100
    */
}

public override void Input0_ProcessInputRow(Input0Buffer Row)
{
    if (Row.RecordType == "02")
        AccountNumber = Row.AcctNum; // Store incomming Account Number into local script variable
    else if (Row.RecordType == "06" || Row.RecordType == "07" || Row.RecordType == "08" ||
             Row.RecordType == "09" || Row.RecordType == "10")
        Row.AcctNum = AccountNumber; // Put Stored Account Number on this row.
}

}

answered my own question - see below script. AcctNum come in from a derived column from the flat file source and will be correctly populated for 02 record types, save it in local static varialbe and put it back on the row for other record types that do not contain the acct number.

/* Microsoft SQL Server Integration Services Script Component
* Write scripts using Microsoft Visual C# 2008.
* ScriptMain is the entry point class of the script.*/

using System;
using System.Data;
using Microsoft.SqlServer.Dts.Pipeline.Wrapper;
using Microsoft.SqlServer.Dts.Runtime.Wrapper;

[Microsoft.SqlServer.Dts.Pipeline.SSISScriptComponentEntryPointAttribute]
public class ScriptMain : UserComponent
{
static String AccountNumber = null;

public override void PreExecute()
{
    base.PreExecute();
    /*
      Add your code here for preprocessing or remove if not needed
    */
}

public override void PostExecute()
{
    base.PostExecute();
    /*
      Add your code here for postprocessing or remove if not needed
      You can set read/write variables here, for example:
      Variables.MyIntVar = 100
    */
}

public override void Input0_ProcessInputRow(Input0Buffer Row)
{
    if (Row.RecordType == "02")
        AccountNumber = Row.AcctNum; // Store incomming Account Number into local script variable
    else if (Row.RecordType == "06" || Row.RecordType == "07" || Row.RecordType == "08" ||
             Row.RecordType == "09" || Row.RecordType == "10")
        Row.AcctNum = AccountNumber; // Put Stored Account Number on this row.
}

}

回复收藏 0 原文

丶视觉 2024-07-31 14:02:08

这是可能的，但您必须编写自定义逻辑。我用 DTS 做过一次。
如果文件被分隔，SSIS 将正确导入字段。您可以编写一个脚本来检查记录类型字段，然后根据记录类型分支到不同的插入。如果文件包含未分隔的记录，但每种类型都有自己的固定宽度，则情况会变得更加复杂，因为您必须解析和拆分每个导入的行，并将记录类型及其宽度硬编码在脚本中。

回复收藏 0 原文