数据库设计:如何研究原始数据并将其建模到现有数据库中?
作为数据库开发人员,当您必须将这些数据建模到现有数据库中时,研究数据库的新原始数据的最佳方法是什么?我的意思是您在文本文件中获取原始数据,并且没有有关该文件中的数据类型或列长度的任何描述的情况。我正在寻找工具和最佳实践来进行这项调查。 SQL Server 中是否有一些可用的预定义工具?
编辑:我正在谈论这个过程的技术方面,例如如何查看列中的数据是否必须作为tinyint、int或bigint映射到SQL Server表中...
As a database developers, what is the best way to investigate new raw data for database when you have to model this data into an existing database? I mean a situation where you get raw data for example in text files and you don't have any description about the data types or column lengths in that file. I'm looking for tools and best practices to do this investigation. Is there available some pre-defined tools for this in SQL Server?
Edit: I'm talking about the technical side of this process, e.g. how to see if data in a columns has to be mapped into SQL Server table as tinyint, int or bigint...
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
没有任何工具可以告诉您数据的含义。
您需要与数据所有者交谈,了解数据的含义,当您对自己的理解感到满意时,您可以对数据库进行建模并创建 ETL 流程。
There are no tools that will tell you what data means.
You need to talk to the owners of the data, understand the meaning of it and when you are comfortable with your understanding, you can model your database and create your ETL process.
在我看来,您正在寻找数据分析工具。你的标题“...模特...”有点误导。
看一下几个选项:
SSIS 数据分析 任务
Talend
数据清理
Seems to me that you are looking for a data-profiling tool. Your title "... to model ..." is a bit misleading.
Take a look at a few options:
SSIS data profiling task
Talend
Data Cleaner