Informatica Data Quality - 匹配分析
在我们的重复分析要求中,输入数据有 1418 条记录,其中 1380 条记录是重复记录。
在与 PowerCenter 集成的 IDQ 中使用匹配分析(使用密钥生成器、匹配器、关联器、合并器)时,除 8 条记录外,所有重复项均被消除。
通过排除这些记录来执行工作流时,重复项会出现在上次运行中未出现重复项的其他记录中。
谁能说出为什么会发生这种不匹配?
In our Duplicate analysis requirement the input data has 1418 records out of which 1380 records are duplicate records.
On using the Match Analysis (used Key Generator, Matcher, Associator, Consolidator) in IDQ integrated with PowerCenter except for 8 records all duplicates were eliminated.
On executing the workflow by excluding these records, duplicates appear in other records for which duplicate didnt occur in the previous run.
Can anyone tell why this mismatch occurs?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
看起来您的合并器转换没有获得正确的关联 ID,因此插入多个记录会导致重复。
Looks like your Consolidator transformation is not getting correct association ids and hence inserting multiple records resulting in duplicates.
请尝试以下步骤:
1) 尝试通过部署您在 IDQ 中开发的映射来在 IDQ 本身中创建工作流。
2) 还要检查记录的业务键,这些记录构成主键,您可以通过它来识别源中的重复项。
please try the below steps:
1) Try to create a workflow in IDQ itself by deploying the mapping which you developed in IDQ.
2) Also keep a check on the business keys of the records which make a primary key through which you are identifying the dups in source.