有没有办法缓存获取输出？

发布于 2024-11-19 09:15:25 字数 513 浏览 3 评论 0原文

我正在开发一个在云中运行的封闭系统。

我需要的是一个搜索函数，它使用用户输入的正则表达式来过滤数据集中的行。

phrase = re.compile(request.get("query"))
data = Entry.all().fetch(50000)  #this takes around 10s when there are 6000 records
result = x for x in data if phrase.search(x.title)

现在，数据库本身不会有太大变化，每天的搜索量不会超过200-300次。

有没有办法以某种方式缓存所有条目（我预计条目不会超过 50.000 个，每个条目不超过 500 字节），因此检索它们不会花费超过 10 秒的时间？或者也许可以并行化它？我不介意 10cpu 秒，但我介意用户必须等待 10 秒。

为了解决诸如“索引它并使用 .filter()”之类的任何答案 - 查询是正则表达式，并且我不知道任何允许使用正则表达式的索引机制。

原文

I'm working on a closed system running in the cloud.

What I need is a search function that uses user-typed-in regexp to filter the rows in a dataset.

phrase = re.compile(request.get("query"))
data = Entry.all().fetch(50000)  #this takes around 10s when there are 6000 records
result = x for x in data if phrase.search(x.title)

Now, the database itself won't change too much, and there will be no more than 200-300 searches a day.

Is there a way to somehow cache all the Entries (I expect that there will be no more than 50.000 of them, each no bigger than 500 bytes), so retrieving them won't take up >10 seconds? Or perhaps to parallelize it? I don't mind 10cpu seconds, but I do mind 10 second that the user has to wait.

To address any answers like "index it and use .filter()" - the query is a regexp, and I don't know about any indexing mechanism that would allow to use a regexp.

分享到QQ

分享到微博