Summarize and organize common methods for removing duplicate data from Oracle database-Oracle-php.cn

Home

Database

Oracle

Summarize and organize common methods for removing duplicate data from Oracle database

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Aug 22, 2022 pm 05:59 PM

oracle

This article brings you relevant knowledge about Oracle, which mainly introduces the duplicate data in the table that is often cleared during data cleaning. So how to deal with it in Oracle? Let’s take a look at it together, I hope it will be helpful to everyone.

Summarize and organize common methods for removing duplicate data from Oracle database

Recommended tutorial: "Oracle Video Tutorial"

Create test data

create table nayi224_180824(col_1 varchar2(10), col_2 varchar2(10), col_3 varchar2(10));
insert into nayi224_180824
select 1, 2, 3 from dual union all
select 1, 2, 3 from dual union all
select 5, 2, 3 from dual union all
select 10, 20, 30 from dual ;
commit;
select*from nayi224_180824;

COL_1	COL_2	COL_3
1	2	3
1	2	3
5	2	3
10	20	30

for the specified Column, check the result set after deduplication

distinct

select distinct t1.* from nayi224_180824 t1;

##COL_1COL_2COL_3 102030123523

The method is very limited because it can only deduplicate all query columns. If I want to deduplicate col_2 and col3, then my result set can only have col_2 and col_3 columns, but not col_1.

select distinct t1.col_2, col_3 from nayi224_180824 t1

COL_2COL_3##220But it is also the simplest and easiest way to understand.


3
30

row_number()

select *
  from (select t1.*,
               row_number() over(partition by t1.col_2, t1.col_3 order by 1) rn
          from nayi224_180824 t1) t1
 where t1.rn = 1
;

COL_1110 ##It’s a lot more troublesome to write, but it has greater flexibility .

COL_2	COL_3	RN
2	3	1
20	30	1

For the specified column, find all duplicate rows

count having

select *
  from nayi224_180824 t
 where (t.col_2, t.col_3) in (select t1.col_2, t1.col_3
                                from nayi224_180824 t1
                               group by t1.col_2, t1.col_3
                              having count(1) > 1)

COL_1COL_2COL_3123 123523If you need to check the table twice, the efficiency will be relatively low. Not recommended.

count over

select *
  from (select t1.*,
               count(1) over(partition by t1.col_2, t1.col_3) rn
          from nayi224_180824 t1) t1
 where t1.rn > 1
;

COL_1COL_2COL_3RN123312335233#You only need to check the table once, recommended.

Delete all duplicate rows

delete from nayi224_180824 t
 where t.rowid in (
                   select rid
                     from (select t1.rowid rid,
                                   count(1) over(partition by t1.col_2, t1.col_3) rn
                              from nayi224_180824 t1) t1
                    where t1.rn > 1);

The above statement is slightly modified.

Delete duplicate data and retain one

Analytical function method

delete from nayi224_180824 t
 where t.rowid in (select rid
                     from (select t1.rowid rid,
                                  row_number() over(partition by t1.col_2, t1.col_3 order by 1) rn
                             from nayi224_180824 t1) t1
                    where t1.rn > 1);

has the consistent high flexibility of analytical functions. You can do whatever you want with the grouping and change the orderby clause to achieve requirements like "retain the maximum id".

group by

delete from nayi224_180824 t
 where t.rowid not in
       (select max(rowid) from nayi224_180824 t1 group by t1.col_2, t1.col_3);

Sacrifice some flexibility in exchange for higher efficiency.

Recommended tutorial: "

Oracle Video Tutorial

The above is the detailed content of Summarize and organize common methods for removing duplicate data from Oracle database. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:脚本之家. If there is any infringement, please contact admin@php.cn delete

什么是oracle asmApr 18, 2022 pm 04:16 PM

oracle asm指的是“自动存储管理”，是一种卷管理器，可自动管理磁盘组并提供有效的数据冗余功能；它是做为单独的Oracle实例实施和部署。asm的优势：1、配置简单、可最大化推动数据库合并的存储资源利用；2、支持BIGFILE文件等。

oracle怎么查询所有索引May 13, 2022 pm 05:23 PM

方法：1、利用“select*from user_indexes where table_name=表名”语句查询表中索引；2、利用“select*from all_indexes where table_name=表名”语句查询所有索引。

oracle全角怎么转半角May 13, 2022 pm 03:21 PM

在oracle中，可以利用“TO_SINGLE_BYTE(String)”将全角转换为半角；“TO_SINGLE_BYTE”函数可以将参数中所有多字节字符都替换为等价的单字节字符，只有当数据库字符集同时包含多字节和单字节字符的时候有效。

Oracle怎么查询端口号May 13, 2022 am 10:10 AM

在Oracle中，可利用lsnrctl命令查询端口号，该命令是Oracle的监听命令；在启动、关闭或重启oracle监听器之前可使用该命令检查oracle监听器的状态，语法为“lsnrctl status”，结果PORT后的内容就是端口号。

oracle怎么删除sequenceMay 13, 2022 pm 03:35 PM

在oracle中，可以利用“drop sequence sequence名”来删除sequence；sequence是自动增加数字序列的意思，也就是序列号，序列号自动增加不能重置，因此需要利用drop sequence语句来删除序列。

oracle怎么查询数据类型May 13, 2022 pm 04:19 PM

在oracle中，可以利用“select ... From all_tab_columns where table_name=upper('表名') AND owner=upper('数据库登录用户名');”语句查询数据库表的数据类型。

oracle查询怎么不区分大小写May 10, 2022 pm 05:45 PM

方法：1、利用“LOWER(字段值)”将字段转为小写，或者利用“UPPER(字段值)”将字段转为大写；2、利用“REGEXP_LIKE(字符串,正则表达式,'i')”，当参数设置为“i”时，说明进行匹配不区分大小写。

Oracle怎么修改sessionMay 13, 2022 pm 05:06 PM

方法：1、利用“alter system set sessions=修改后的数值 scope=spfile”语句修改session参数；2、修改参数之后利用“shutdown immediate – startup”语句重启服务器即可生效。

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

Repo: How To Revive Teammates

1 months agoBy尊渡假赌尊渡假赌尊渡假赌

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hello Kitty Island Adventure: How To Get Giant Seeds

1 months agoBy尊渡假赌尊渡假赌尊渡假赌

How Long Does It Take To Beat Split Fiction?

4 weeks agoByDDD

R.E.P.O. Save File Location: Where Is It & How to Protect It?

4 weeks agoByDDD

Hot Tools

SublimeText3 English version

Recommended: Win version, supports code prompts!

Safe Exam Browser

Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

Zend Studio 13.0.1

Powerful PHP integrated development environment

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),