SQL

Identify and Delete Duplicate Rows Efficiently

Discover SQL techniques to find duplicate records in your table using window functions and Common Table Expressions, and safely remove extra duplicates while preserving one unique entry.

WITH Duplicates AS (
  SELECT id, email_address, full_name,
         ROW_NUMBER() OVER (PARTITION BY email_address, full_name ORDER BY id) as rn
  FROM users
)
DELETE FROM users
WHERE id IN (SELECT id FROM Duplicates WHERE rn > 1);

How it works: This query uses a Common Table Expression (CTE) named `Duplicates` to identify redundant rows. `ROW_NUMBER() OVER (PARTITION BY email_address, full_name ORDER BY id)` assigns a sequential number to each row within groups that have identical `email_address` and `full_name`. Any row with `rn > 1` is considered a duplicate. The final `DELETE` statement then removes all identified duplicates, preserving the row with the lowest `id` within each duplicate group.

Need help integrating this into your project?

Our team of expert developers can help you build your custom application from scratch.

Hire DigitalCodeLabs

Identify and Delete Duplicate Rows Efficiently

Related SQL Snippets

Advanced Pagination or Top N per Group using ROW_NUMBER()

Calculate Running Totals Using SQL Common Table Expressions (CTEs)

Generate Pivot-Like Reports Using SQL Conditional Aggregation

Need help integrating this into your project?