如何使用两个字段进行排序？

发布于 2024-10-13 02:56:43 字数 828 浏览 7 评论 0原文

我有一个排序/分组问题，希望有人可以补充一些见解。

我们有一个故事表，其中包含发布日期和更新日期。我正在使用 Django，所以它看起来像这样：

class Story(models.Model):
    pub_date = models.DateTimeField(db_index=True)
    update_date = models.DateTimeField(blank=True, null=True, db_index=True)
    headline = models.CharField(max_length=200)
    ...

我们希望在按天分组的分页页面上显示故事。所以...

Jan 20
    Story 1
    Story 2

Jan 19
    Story 1
    Story 3

挑战是，如果一个故事有 update_date，它应该显示两次，一次在 pub_date 日，一次在 update_day 日期（例如故事 1）。

有成千上万个故事，所以我当然不能用 python 完成所有这些，但我不知道如何在 SQL 中执行此查询。

我现在所拥有的是按 -pub_date 对所有内容进行排序，然后获取给定页面上的最大和最小日期范围。然后，我使用 update_date 查询这些日期之间的任何故事，并在 python 中将它们组合和分组。问题是页面上的项目数量是不规则的。

所以我想我的问题是这样的：查询表中的项目列表并根据两个字段对它们进行排序的最佳方法是什么，如果它在第二个字段中有值，则在查询中复制一个项目，然后根据在两个领域？

希望这是有道理的...

原文

I have a sorting/grouping issue that I'm hoping somebody could add some insight on.

We have a table of stories that have a publish date and an updated date. I'm using Django so it looks like this:

class Story(models.Model):
    pub_date = models.DateTimeField(db_index=True)
    update_date = models.DateTimeField(blank=True, null=True, db_index=True)
    headline = models.CharField(max_length=200)
    ...

We want to display the stories on a paginated page grouped by day. So...

Jan 20
    Story 1
    Story 2

Jan 19
    Story 1
    Story 3

The challenge is that if a story has an update_date it should be displayed twice, once on the pub_date day, and once on the update_day date (e.g. Story 1).

There are 10s of thousands of stories so I can't do it all in python of course, but I don't know of a way to do this query in SQL.

What I have right now is sorting everything by -pub_date and then getting a range of the max and min dates on a given page. I then query for any stories between those dates with an update_date and combine and group them in python. The problem is that the number of items on a page is irregular then.

So I guess my question is this: What is the best way to query a table for a list of items and sort them based on two fields, duplicating an item in the query if it has a value in the second field, and then sorting based on the two fields?

Hope that makes sense...

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

我一向站在原地 2024-10-20 02:56:43

我只能想到“联盟”能够做到这一点。

这是一个示例。不确定数据库经常向数据库发送此类查询有多快或多好 D：

查询假设您的表名称是 stories，并使用列 headline strong>、pub_date 和 update_date。它还假设尚未更新的故事在 update_date 列中具有 null 值。

SELECT      headline,
            the_date,
            DAY(the_date) AS the_day
FROM (
    SELECT      headline,
                pub_date AS the_date
    FROM        stories
    UNION
    SELECT      headline,
                update_date AS the_date
    FROM        stories
    WHERE       update_date IS NOT NULL
) AS publishedandupdated
ORDER BY    the_date DESC;

如果要向查询添加限制，应该在“order by”子句之后最后完成。

i can only think of "union" being able to do this.

here's an example of what that would look like. not sure how fast or good it is for the database to have this type of query sent to it often though D:

the query assumes your table name is stories, and uses the columns headline, pub_date and update_date. it also assumes that a story that hasn't been updated has the value null in the update_date column.

SELECT      headline,
            the_date,
            DAY(the_date) AS the_day
FROM (
    SELECT      headline,
                pub_date AS the_date
    FROM        stories
    UNION
    SELECT      headline,
                update_date AS the_date
    FROM        stories
    WHERE       update_date IS NOT NULL
) AS publishedandupdated
ORDER BY    the_date DESC;

if you want to add a limit to the query, it should be done last, after the "order by" clause.

回复收藏 0 原文

站稳脚跟 2024-10-20 02:56:43

你的问题和我的很相似。我从 Facebook 墙上读到了一些文章。我有两次约会，一次是关于项目创建（用户发布该项目），一次是关于项目检索（我从 Facebook 读取该项目）。我想显示今天发布或检索的项目。

SELECT link,time FROM homeWallItems WHERE 
DATE_SUB(CURDATE(),INTERVAL 1 DAY)<= created 
OR
DATE_SUB(CURDATE(),INTERVAL 1 DAY)<= time
group by time LIMIT 0,30

编辑：我对这句话过于乐观：这是错误的。

在此代码中，代替 CURDATE()，
如果你使用时间，那么它应该可以工作
你。

your question is similar to what I had. I read some items from Facebook walls. I had two dates, one on item creation(user posts the item), one on item retrieval(I read the item from Facebook). I wanted to show items that are posted or retrieved today.

SELECT link,time FROM homeWallItems WHERE 
DATE_SUB(CURDATE(),INTERVAL 1 DAY)<= created 
OR
DATE_SUB(CURDATE(),INTERVAL 1 DAY)<= time
group by time LIMIT 0,30

Edit: I was over optimistic in this sentence: It is wrong.

in this code, instead of CURDATE(),
if you use time, then it should work
you.

回复收藏 0 原文

凡间太子 2024-10-20 02:56:43

对列名进行一些假设，您需要 UNION ALL 来保留两个部分的重复项。

    select headline, actualdate=pub_date
    from story
    where pub_date between /mindate/ and /maxdate/
union all
    select headline, actualdate=update_date
    from story
    where update_date between /mindate/ and /maxdate/
order by actualdate

虚拟字段 Actualdate 用于将 pub_date / update_date 匹配为按其 ORDER BY 的单个列。
union-ed 语句中的 ORDER BY 在 union 完成后应用，因此它只需要出现一次。
日期范围上的过滤器应用于联合的每个部分内，以减少工作表大小（在应用过滤器之前不必提取所有数据）

Making some assumptions on the column names, you need UNION ALL to retain duplicates from both parts.

    select headline, actualdate=pub_date
    from story
    where pub_date between /mindate/ and /maxdate/
union all
    select headline, actualdate=update_date
    from story
    where update_date between /mindate/ and /maxdate/
order by actualdate

The virtual field actualdate is used to match up the pub_date / update_date as a single column on which to ORDER BY.
The ORDER BY in a union-ed statement is applied AFTER the union has been done, so it only needs to appear once.
the filter on the date range is applied within each part of the union, to reduce the worktable size (it shouldn't have to pull in all data unnecessarily before applying the filter)

回复收藏 0 原文

~没有更多了~