删除 MySQL 中的重复行

发布于 2024-09-11 04:22:27 字数 411 浏览 8 评论 0原文

我有一个包含以下字段的表：

id (Unique)
url (Unique)
title
company
site_id

现在，我需要删除具有相同 title、company 和 site_id 的行。一种方法是使用以下 SQL 和脚本 (PHP)：

SELECT title, site_id, location, id, count( * ) 
FROM jobs
GROUP BY site_id, company, title, location
HAVING count( * ) >1

运行此查询后，我可以使用服务器端脚本删除重复项。

但是，我想知道这是否可以仅使用 SQL 查询来完成。

原文

I have a table with the following fields:

id (Unique)
url (Unique)
title
company
site_id

Now, I need to remove rows having same title, company and site_id. One way to do it will be using the following SQL along with a script (PHP):

SELECT title, site_id, location, id, count( * ) 
FROM jobs
GROUP BY site_id, company, title, location
HAVING count( * ) >1

After running this query, I can remove duplicates using a server side script.

But, I want to know if this can be done only using SQL query.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

疯了 2024-09-18 04:22:27

一个非常简单的方法是在 3 列上添加一个 UNIQUE 索引。当您编写 ALTER 语句时，请包含 IGNORE 关键字。像这样：

ALTER IGNORE TABLE jobs
ADD UNIQUE INDEX idx_name (site_id, title, company);

这将删除所有重复的行。作为一个额外的好处，未来重复的 INSERT 将会出错。与往常一样，您可能需要在运行类似这样的操作之前进行备份...

编辑：在 MySQL 5.7+ 中不再工作

此功能已在 MySQL 5.6 和在 MySQL 5.7 中被删除，所以它不起作用。

A really easy way to do this is to add a UNIQUE index on the 3 columns. When you write the ALTER statement, include the IGNORE keyword. Like so:

ALTER IGNORE TABLE jobs
ADD UNIQUE INDEX idx_name (site_id, title, company);

This will drop all the duplicate rows. As an added benefit, future INSERTs that are duplicates will error out. As always, you may want to take a backup before running something like this...

Edit: no longer works in MySQL 5.7+

This feature has been deprecated in MySQL 5.6 and removed in MySQL 5.7, so it doesn't work.

删除 MySQL 中的重复行

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（28）

编辑：在 MySQL 5.7+ 中不再工作

Edit: no longer works in MySQL 5.7+

技术说明

✔ 保留最后一个条目而不是第一个条目的变体

✔ 对重复项执行某些任务的变体，例如对找到的重复项进行计数

✔ 重新生成自增字段 id 的变体

✔ 其他变体

Technical explanation

✔ Variation for keeping the last entry instead of the first one

✔ Variation for performing some tasks on the duplicates, for example keeping a count on the duplicates found

✔ Variation for regenerating the auto-incremental field id

✔ Further variations

关于作者

相关话题

热门标签

推荐作者

╭⌒浅淡时光〆

慕巷

浅生活

bal

lqwuliang

后来的我们

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。