SQlite3优化：将外部文件名存储在数据库中？或者只是有大量的行？

发布于 2024-11-09 07:26:18 字数 746 浏览 0 评论 0原文

我是一个没有计算机科学背景的新手。所以请原谅我可能说的任何蠢话。我正在开展一个太阳能监控项目，以监控我公司安装的太阳能发电系统的功率输出。我正在编写一个客户端，它将每 15 分钟查询我们每个监控客户的逆变器（功率输出、电压输出、电流输出、系统错误/故障等，这构成一个“读数”），只要他们拥有他们的系统——这意味着每个客户每年大约有 35,000 个读数。所以我正在考虑用以下两种方式之一来组织我的 sqlite3 数据库。

(1) 让数据库有两个表，一个表包含常规客户信息（姓名、电子邮件等），另一个表更大，其中每一行代表一次阅读，并包含客户 ID 和阅读时间戳作为标识符。这意味着每个客户每年将向这个更大的表中添加大约 35,000 行。（超过两年的数据将被削减并存档。）

或

(2) 将所有读数存储在 csv 文件中（每个客户一个 csv 文件），并将 csv 文件名与常规客户信息一起存储在我的表中

。提供一个网站（如果这对选项有任何影响的话，建立在轨道上），客户将能够在其中查看他们的功率输出数据。我想最大限度地减少登录时加载输出数据所需的时间。我基本上不清楚我的计算机打开并从文本文件中逐行读取与打开、查找（基于客户 ID）并从巨大的 sqlite3 中读取数据所需的时间表——因此我很难知道如何判断上述两个选项。另外，我在衡量 sqlite3 的局限性时遇到了困难，尽管已经阅读了一些相关内容（我认为我没有背景来理解我所做的阅读，因为它似乎说数百行数百万行就可以了当我读到其他人的评论时，似乎说的恰恰相反。）。我也愿意接受完全不同的选择，因为我现在还没有结婚。任何能让加载速度更快的东西。非常感谢！

原文

I am a newbie with no comp sci background. So please forgive me for whatever dumb stuff I may say. I am working on a solar power monitoring project to monitor the power output of the solar power systems my company installs. I am writing a client that will query the inverter (for power output, voltage output, current output, system errors/faults, etc--which constitutes one "reading") of each of our monitoring customers every 15 minutes for as long as they have their system--which means roughly 35k readings per year per customer. So I was thinking of organizing my sqlite3 database in one of the two following ways.

(1) Have the database be two tables, one table with regular customer information (name, email, etc) and another much bigger table where each row represents one reading and includes the customer id and timestamp of reading as identifiers. Which means roughly 35k rows will be being added to this bigger table per customer per year. (Data more than two years old will be pared down and archived.)

(2) Store all readings in a csv file (one csv file per customer) and store the csv file name in my table with regular customer information

This database will be serving a website (built on rails if that makes any difference for options) where customers will be able to view their power output data. I want to minimize the amount of time it will take to load their output data on login. I basically don't have a clear idea of the amount of time it would take for my computer to open and read in lines from a text file versus open, look for (based on customer id) and read in the data from a huge sqlite3 table--and therefore am having trouble knowing how to judge between the two options above. Also I'm having trouble gauging the limits of sqlite3 where it functions optimally despite having read some about it (I don't think I have the background to understand the reading I did because it seems to say 100s of millions of rows are just fine when I read other people's comments seeming to say just the opposite.). I am also open to a completely different option as I'm not married to anything right now. Whatever makes things load faster. Thanks so much in advance!

分享到QQ

分享到微博