数据屏蔽/加扰
这是专门针对医疗保健领域的。 在将数据从生产环境复制到测试环境时,您使用什么解决方案/工具来屏蔽 PHI 数据
This is specifically for healthcare domain. What solutions/tools do you use to mask the PHI data while copying data from production to test environments
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
并非特定于医疗保健行业,但我们对电信客户也有类似的要求。 我们希望能够在测试系统中提供代表性数据,但确保不会复制任何敏感信息。
为此,我们尝试了许多不同的方法,包括我们选择的方法:
将使用视图而不是直接表访问 - 这些视图基本上会将真实表中的非敏感信息与来自真实表的虚拟信息连接起来。单独的表。 虚拟信息替换了真实表中的所有敏感信息。
数据通过清理过滤器传输。 这基本上会卸载生产数据库并将其传递给替换敏感信息的匿名程序。 对于简单的情况,地址之类的东西都设置为相同。
与上面的 (2) 类似,但过滤器可以使用来自特定查找数据的数据。 例如,地址可以由 4 位街道号码、从 1000 个最常见街道名称中选择的名称、街道类型(ST、RD、CRES、WAY、PLACE 等)和城镇数据库组成。 电话号码可以是任何十位数字的序列(符合电信公司的要求),但不能重复。
Not specific to the health care industry but we had a similar requirement for telco customers. We wanted to be able to provide representative data in test systems but ensure that no sensitive information was copied.
To that end, we tried a number of different things, including the ones we opted for:
Views that would be used instead of direct table accesses - these views would basically join non-sensitive information from the real table with dummy information from separate tables. The dummy information replaced all sensitive information from the real table.
Data transfers through a cleanup filter. This would basically unload the production database and pass it through an anonymiser program which replaced the sensitive information. For simple cases, things like addresses were all set identical.
Similar to (2) above but the filter could use data from specific lookup data. For example, an address could be formed with a 4-digit street number, a name selected from the 1000 most common street names, a street type (ST, RD, CRES, WAY, PLACE, etc) and a database of towns. A phone number could be any sequence of ten digits (subject to telco requirements) but with no duplication.